Simons Institute for the T @SimonsInstitute, Twitter Profile

Simons Institute for the Theory of Computing @SimonsInstitute

4 weeks ago

1/2 "This is kind of shocking." Mikhail Belkin of @UCSD on how adding the same set of vectors to the activations of an LLM's transformer blocks can steer its output from, say, either English to Chinese, or Python to C++, at the Simons Institute. Video: simons.berkeley.edu/talks/mikhail-…

3 11 81 24K 58

Download Image

Simons Institute for the Theory of Computing @SimonsInstitute

4 weeks ago

2/2 Mikhail Belkin spoke of how to determine such "steering" vectors using white-box LLMs and recursive feature machines at the Simons Institute's Smale@95: A Conference in Honor of Steve Smale. Video: simons.berkeley.edu/talks/mikhail-…

0 1 10 1K 10

Download Image

The Free Press @TheFP

2 weeks ago

“We have tacitly abandoned certain public spaces to the most disordered and depraved among us because enforcing the law feels mean and makes us uncomfortable,” writes @katrosenfield.

166 1K 5K 1.4M 499