KnowledgeHub

Jacob T. wants to read Junkyard Computing: Repurposing Discarded Smartphones to Minimize Carbon by Jennifer Switzer

No cover — Junkyard Computing: Repurposing Discarded Smartphones to Minimize Carbon (Paper, 2023, ACM ASPLOS)

Junkyard Computing: Repurposing Discarded Smartphones to Minimize Carbon by Jennifer Switzer, Gabriel Marcano, Ryan Kastner, and 1 other

Jacob T. reviewed HECO: Fully Homomorphic Encryption Compiler by Alexander Viand

HECO: Fully Homomorphic Encryption Compiler (2023, Arxiv)

In recent years, Fully Homomorphic Encryption (FHE) has undergone several breakthroughs and advancements, leading to …

An improvement in usability

4 stars

This paper covers a compiler for more traditional imperative code to be converted to optimized (and batched) FHE operations via the SEAL library. The frontend is Python, which is then converted to multiple simplification and optimization passes in the CPP MLIR.

Both synthetic/toy examples and more real-world applications are created in pure imperative implementations (that require non-performant emulation steps), compiled with HECO, and built with FHE optimizations manually. The HECO performance is close to the hand-optimized in most scenarios, even edging it out in a few.

Hopefully the spread of this tool will help FHE reach the masses.

Jacob T. reviewed Two-in-One: A Model Hijacking Attack Against Text Generation Models by Wai Man Si

Machine learning has progressed significantly in various applications ranging from face recognition to text generation. …

Less plausible than Adversarial Reprogramming

3 stars

This paper covers a highly-effective (85%+) hijack attack where training data is tainted by an adversary, and then the model can be cajoled into performing other types of tasks. While this work is a steep closer to a more general-type of attack, the model is less plausible than inference-time attacks popularized in the Adversarial Reprogramming literature.

Jacob T. reviewed Adversarial Reprogramming of Neural Networks by Gamaleldin F. Elsayed

Deep neural networks are susceptible to \emph{adversarial} attacks. In computer vision, well-crafted perturbations to images …

The first "RCE" against ML that I came across

5 stars

I have sent this paper to a number of people over the years from when it first came out, I am surprised there is less attention to this type of attack, despite being a white-box model. This is the first class of attack that lets the attacker reprogram an image classification model to perform an attacker-determined task (e.g., turning an image classifier into a counter task).

Reviewing this paper 5 years after its release, it still stands up, and I see there is a small field of work in this lineage that includes similar attacks against NLP classifiers. I would count this paper as the starting point for this class of attack, which is an impressive and high-impact field.

Jacob T. reviewed Freaky Leaky SMS: Extracting User Locations by Analyzing SMS Timings by Evangelos Bitsikas

Short Message Service (SMS) remains one of the most popular communication channels since its introduction …

An improvement over the state-of-the-art with real-world consequences

3 stars

While silent SMSes have been used by authorities for quite some time to geolocate cell-phones, this work puts a less powerful capability into the hands of anyone. By training a ML model on the RTT from sending a silent SMS to phones in different [known] locations, a temporal map of the GSM network can be made to later classify RTTs when targeting a victim phone and approximate their location to country/region.

Without cooperation of the cell infrastructure it's pretty coarse-grained, but still a scary way to figure out where a target of interest is without alerting them.

Jacob T. wants to read Freaky Leaky SMS: Extracting User Locations by Analyzing SMS Timings by Evangelos Bitsikas

Short Message Service (SMS) remains one of the most popular communication channels since its introduction …

Yikes!

Jacob T. wants to read Two-in-One: A Model Hijacking Attack Against Text Generation Models by Wai Man Si

Machine learning has progressed significantly in various applications ranging from face recognition to text generation. …

Surprising that what is essentially a RCE in an ML model attack has gotten so little attention. Looks like a nice continuation from the image classification attacks.

Jacob T. wants to read Nougat: Neural Optical Understanding for Academic Documents by Lukas Blecher

No rating

Scientific knowledge is predominantly stored in books and scientific journals, often in the form of …

A @casey recommendation

Jacob T. reviewed An Audacious Plan to Halt the Internet's Enshittification Cory Doctorow by Cory Doctorow

The enshittification of the internet follows a predictable trajectory: first, platforms are good to their …

Very timely

5 stars

In this talk, @pluralistic@mamot.fr covers the basic premise of enshittification, how the internet giants have lobbied to change the rules that let them get big to stifle competition, and finally, what can be done about it.

I only recently was made aware of the term enshittification, but had seen the decay of online platforms hasten, be it Twitter, FB, Reddit, Google, etc. The term and "play book" was helpful to draw connections between the behaviors on disparate sites.

I am slightly less hopeful than the author about reversing the course on some of these, but I guess there's something to be said about me posting this review on a distributed, federated service not part of big tech.

Overall a great talk, wake up call, and pointer to some hopeful directions.

Jacob T. started reading Sparks by Ian Johnson

Sparks (2023, Oxford University Press, Incorporated)

Sparks by Ian Johnson

Jacob T. reviewed Language Modeling Is Compression by Grégoire Delétang

It has long been established that predictive models can be transformed into lossless compressors and …

Really nice way to formalize a collective intuition

4 stars

This paper formally equates (lossless) compression algorithms with LLM/learning. While the Hutter Prize has postulated the connection, this paper shows how an LLM can act as a better compressor for multi-modality data than the domain specific standards of today. The authors also use the gzip compression algorithm as a generative model, with rather poor success, but build a mathematical framework to build on.

The paper also covers tokenization as compression, which is something that's been lacking in a lot of other scientific discourse on this subject. Overall a nice read, 4* only because it ends abruptly without fully exploring the space of compressors as generative models.

Jacob T. wants to read Language Modeling Is Compression by Grégoire Delétang

It has long been established that predictive models can be transformed into lossless compressors and …

Suggestion from co-worker, especially since it seems to overlap with my blog.thinkst.com/2023/06/meet-zippy-a-fast-ai-llm-text-detector.html work

Jacob T. wants to read Universal and Transferable Adversarial Attacks on Aligned Language Models by Andy Zou

No rating

Because "out-of-the-box" large language models are capable of generating a great deal of objectionable content, …

I skimmed the top-level summary when it came out, but it appears well worth a deeper read.

Jacob T. finished reading 3 YEARS IN CHINA: A TALE OF BUILDING A REAL FULL SPEED ANTI-CENSORSHIP ROUTER by KaiJern Lau

No rating

Reversing GFW (Great FireWALL) is not a new topic, but it evolved over the years. …

Interesting story and insider view of the great firewall of China and bypass techniques.

Jacob T. reviewed IPvSeeYou: Exploiting Leaked Identifiers in IPv6 for Street-Level Geolocation by Erik Rye

We present IPvSeeYou, a privacy attack that permits a remote and unprivileged adversary to physically …

Simple concept, powerful results

4 stars

Basically this research combines a legacy IPv6 addressing scheme where the MAC address is put into the address with crowd-sourced WiFi network scanning geo-location databases. The trick is figuring out the delta in MACs between the WAN and WLAN adaptors, but they are usually close, so with some clustering, they were able to get 39m accuracy for ~12M routers in over 100 countries.

Crazy to think putting a MAC address in a world-routing IP address was ever considered a good idea, but with networking gears' long life cycle, it will be a long-lasting mistake!

User Profile

Jacob T.'s books

To Read (View all 11)

Currently Reading

Read (View all 31)

User Activity

Jacob T. wants to read Junkyard Computing: Repurposing Discarded Smartphones to Minimize Carbon by Jennifer Switzer

Junkyard Computing: Repurposing Discarded Smartphones to Minimize Carbon by Jennifer Switzer, Gabriel Marcano, Ryan Kastner, and 1 other

Jacob T. reviewed HECO: Fully Homomorphic Encryption Compiler by Alexander Viand

An improvement in usability

4 stars

Jacob T. reviewed Two-in-One: A Model Hijacking Attack Against Text Generation Models by Wai Man Si

Less plausible than Adversarial Reprogramming

3 stars

Jacob T. reviewed Adversarial Reprogramming of Neural Networks by Gamaleldin F. Elsayed

The first "RCE" against ML that I came across

5 stars

Jacob T. reviewed Freaky Leaky SMS: Extracting User Locations by Analyzing SMS Timings by Evangelos Bitsikas

An improvement over the state-of-the-art with real-world consequences

3 stars

Jacob T. wants to read Freaky Leaky SMS: Extracting User Locations by Analyzing SMS Timings by Evangelos Bitsikas

Jacob T. wants to read Two-in-One: A Model Hijacking Attack Against Text Generation Models by Wai Man Si

Jacob T. wants to read Nougat: Neural Optical Understanding for Academic Documents by Lukas Blecher

Jacob T. reviewed An Audacious Plan to Halt the Internet's Enshittification Cory Doctorow by Cory Doctorow

Very timely

5 stars

Jacob T. started reading Sparks by Ian Johnson

Sparks by Ian Johnson

Jacob T. reviewed Language Modeling Is Compression by Grégoire Delétang

Really nice way to formalize a collective intuition

4 stars

Jacob T. wants to read Language Modeling Is Compression by Grégoire Delétang

Jacob T. wants to read Universal and Transferable Adversarial Attacks on Aligned Language Models by Andy Zou

Jacob T. finished reading 3 YEARS IN CHINA: A TALE OF BUILDING A REAL FULL SPEED ANTI-CENSORSHIP ROUTER by KaiJern Lau

Jacob T. reviewed IPvSeeYou: Exploiting Leaked Identifiers in IPv6 for Street-Level Geolocation by Erik Rye

Simple concept, powerful results

4 stars