“How will my model behave if I change the training data?”
Recent(-ish) work w/ @logan_engstrom: we nearly *perfectly* predict ML model behavior as a function of training data, saturating benchmarks for this problem (called “data attribution”).
Had a great time at @SimonsInstitute talking about new & upcoming work on meta-optimization of ML training
tl;dr we show how to compute gradients *through* the training process & use them to optimize training. Immediate big gains on data selection, poisoning, attribution & more!
Had a great time at @SimonsInstitute talking about new & upcoming work on meta-optimization of ML training
tl;dr we show how to compute gradients *through* the training process & use them to optimize training. Immediate big gains on data selection, poisoning, attribution & more!
Very cool work from Logan and the gang! One of these problems, indiscriminate data poisoning, has been one of my favorite mysteries in robustness -- and they did an order of magnitude better than we could previously! Looking forward to checking it out in more detail.
Very cool work from Logan and the gang! One of these problems, indiscriminate data poisoning, has been one of my favorite mysteries in robustness -- and they did an order of magnitude better than we could previously! Looking forward to checking it out in more detail.
After some very fun years at MIT, I'm really excited to be joining CMU as an assistant professor in Jan 2026! A big (huge!) thanks to my advisors (@aleks_madry@KonstDaskalakis), collaborators, mentors & friends.
In the meantime, I'll be a Stein Fellow at Stanford Statistics.
Announcing a deadline extension for the ATTRIB workshop! Submissions are now due September 25th, with an option to submit October 4th if at least one paper author volunteers to be an emergency reviewer. More info here: attrib-workshop.cc
At #ICML2024 ? Our tutorial "Data Attribution at Scale" will be to tomorrow at 9:30 AM CEST in Hall A1!
I will not be able to make it (but will arrive later that day), but my awesome students @andrew_ilyas@smsampark@logan_engstrom will carry the torch :)
How is an LLM actually using the info given to it in its context? Is it misinterpreting anything or making things up?
Introducing ContextCite: a simple method for attributing LLM responses back to the context: gradientscience.org/contextcite
w/ @bcohenwang, @harshays_, @kris_georgiev1
57K Followers 568 FollowingAssistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Joining @NYU_Courant September 2026. Co-EiC @TmlrOrg. I lead @TheSalonML.
4K Followers 927 FollowingResearch fellow at Flatiron Institute, working on understanding optimization in deep learning. Previously: PhD in machine learning at Carnegie Mellon.
61K Followers 12K FollowingAI policy researcher, wife guy in training, fan of cute animals and sci-fi, Substack writer, stealth-ish non-profit co-founder
49K Followers 9K FollowingI lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
5K Followers 889 FollowingFaculty at @ELLISInst_Tue & @MPI_IS, leading the AI Safety and Alignment group.
PhD from @EPFL supported by Google & OpenPhil PhD fellowships.
10K Followers 4K Followingsth new // ex Gemini RL+Inference @GoogleDeepMind // Chat AI @Meta // RL Agents @EA // ML+Information Theory @MIT+@Harvard+@GeorgiaTech // زن زندگی آزادی
43 Followers 148 FollowingML @scale_AI | ex Applied Scientist @amazon / CS @mit | Friends - if you find my professional account, don't send it in the twitter group 🙏
57K Followers 568 FollowingAssistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Joining @NYU_Courant September 2026. Co-EiC @TmlrOrg. I lead @TheSalonML.
4K Followers 927 FollowingResearch fellow at Flatiron Institute, working on understanding optimization in deep learning. Previously: PhD in machine learning at Carnegie Mellon.
11K Followers 723 Following"If there is not folly in the world, then the world itself is folly. You must understand that mistakes are not always regrets." - Paul Tobin, Bandette🤠
24K Followers 120 FollowingGrowSF is the #1 voter guide in San Francisco, trusted by hundreds of thousands of San Franciscans. You deserve a city that works.
15K Followers 6K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
88K Followers 2K FollowingWriting a data-driven newsletter about economics @ https://t.co/IanQ9oPoPi | Nuance? In this economy? | Full Employment Stan, Brazilian Coffee Tariff Victim
156K Followers 36 FollowingI have a place where I say complicated things about philosophy and science. That place is my blog. This is where I make terrible puns.
18K Followers 736 FollowingThis is the OFFICIAL Twitter of the ISO Class 1 Cambridge, MA Fire Department. Follow us for news & fire safety information. This account is not monitored 24/7.
2K Followers 1K FollowingAssistant professor @UMichCSE @UMich; previously @SimonsInstitute @UCBerkeley @Princeton @Tsinghua_Uni. Theoretical and scientific foundations of deep learning.
3K Followers 216 FollowingNeural network speedrunner and community-funded open source researcher. Set the CIFAR-10 record several times. Send me consulting/contracting work!