dron @_dron_h

math/music/ai nerd | research @GoodfireAI | prev cambridge, bair, polaris | giving a semantics to the syntax garden.dronhazra.com Joined April 2019

Tweets

2K
Followers

317
Following

439
Likes

50K

Michael Pearce @_MichaelPearce

a week ago

Excited to share our work digging into how Evo 2 represents species relatedness or phylogeny. Genetics provides a good quantitative measure of relatedness, so we could use it to probe the model and see if its internal geometry reflects it.

Goodfire @GoodfireAI

a week ago

10 54 367 70K 190

Download Image

1 8 38 6K 7

dron @_dron_h

a week ago

i saw early versions of this work when i was still in school and it made waiting to join this team very difficult... very cool results! @_MichaelPearce

Goodfire @GoodfireAI

a week ago

i saw early versions of this work when i was still in school and it made waiting to join this team very difficult... very cool results! @_MichaelPearce

10 54 367 70K 190

Download Image

0 0 13 587 1

Goodfire @GoodfireAI

a week ago

Arc Institute trained their foundation model Evo 2 on DNA from all domains of life. What has it learned about the natural world? Our new research finds that it represents the tree of life, spanning thousands of species, as a curved manifold in its neuronal activations. (1/8)

10 54 367 70K 190

Download Image

Liv @livgorton

2 weeks ago

What if adversarial examples aren't a bug, but a direct consequence of how neural networks process information? We've found evidence that superposition – the way networks represent many more features than they have neurons – might cause adversarial examples.

15 42 396 54K 243

Download Image

Goodfire @GoodfireAI

2 weeks ago

New research! Post-training often causes weird, unwanted behaviors that are hard to catch before deployment because they only crop up rarely - then are found by bewildered users. How can we find these efficiently? (1/7)

10 44 379 44K 199

Jack Merullo @jack_merullo_

4 weeks ago

Could we tell if gpt-oss was memorizing its training data? I.e., points where it’s reasoning vs reciting? We took a quick look at the curvature of the loss landscape of the 20B model to understand memorization and what’s happening internally during reasoning

14 50 509 43K 397

Download Image

dron @_dron_h

4 weeks ago

heck of a first week

Goodfire @GoodfireAI

4 weeks ago

heck of a first week

4 3 55 4K 6

Download Image

1 0 15 891 2

Curt Tigges @CurtTigges

4 weeks ago

Some neat results from hacking on gpt-oss at the Goodfire internal hackathon this week: 1. MoE experts are... actually experts? 2. The model seems to know which experts it's going to use for a token from the very first layer of the model. Here we see the "business expert":

5 6 52 14K 22

Download Image

Nick @nickcammarata

3 months ago

if you really understand a neural network you should be able to explain and edit anything in the model by directly manipulating the activation tensor. we made a demo of this with diffusion models

Goodfire @GoodfireAI

3 months ago

if you really understand a neural network you should be able to explain and edit anything in the model by directly manipulating the activation tensor. we made a demo of this with diffusion models

39 98 923 174K 587

Download Video

14 22 389 27K 140

Goodfire @GoodfireAI

3 months ago

We created a canvas that plugs into an image model’s brain. You can use it to generate images in real-time by painting with the latent concepts the model has learned. Try out Paint with Ember for yourself 👇

39 98 923 174K 587

Download Video

Goodfire @GoodfireAI

4 months ago

We're publishing new queryable datasets to help researchers explore interpretable features in DeepSeek R1.

4 12 153 17K 58

Download Image

max "activating examples" loeffler @maxsloef

4 months ago

i've added a little more to our recent deepseek r1 SAE launch :)

Goodfire @GoodfireAI

4 months ago

i've added a little more to our recent deepseek r1 SAE launch :)

4 12 153 17K 58

Download Image

2 2 43 2K 4

Goodfire @GoodfireAI

5 months ago

Today, we're announcing our $50M Series A and sharing a preview of Ember - a universal neural programming platform that gives direct, programmable access to any AI model's internal thoughts.

43 114 1K 339K 583

Download Video

Lee Sharkey @leedsharkey

5 months ago

I've got some big personal news: I'm joining @GoodfireAI to lead a fundamental interpretability research team in London! This has been a while coming /n

15 6 352 30K 39

Download Image

dron @_dron_h

5 months ago

r1: <completely breaks>. ahem. well. nevertheless,

Goodfire @GoodfireAI

5 months ago

r1: <completely breaks>. ahem. well. nevertheless, https://t.co/BMqJq5mZsq

2 6 62 10K 13

Download Image

1 2 34 4K 2

Download Image

dron @_dron_h

5 months ago

i've been working on this for the past few months! excited to share some initial results we've found trying to interpret a big reasoning model

Goodfire @GoodfireAI

5 months ago

i've been working on this for the past few months! excited to share some initial results we've found trying to interpret a big reasoning model

21 72 631 113K 381

Download Image

1 0 39 2K 5

Transluce @TransluceAI

5 months ago

To interpret AI benchmarks, we need to look at the data. Top-level numbers don't mean what you think: there may be broken tasks, unexpected behaviors, or near-misses. We're introducing Docent to accelerate analysis of AI agent transcripts. It can spot surprises in seconds. 🧵👇

10 66 338 195K 241

Download Video

Kevin Meng @mengk20

5 months ago

AI models are *not* solving problems the way we think using Docent, we find that Claude solves *broken* eval tasks - memorizing answers & hallucinating them! details in 🧵 we really need to look at our data harder, and it's time to rethink how we do evals...