Tony Chen @tonychenxyz
Next: CS PhD @princetonCS. Prev: Undergrad @columbia. Current: Inference research intern @togethercompute. tonychen.xyz Joined January 2015-
Tweets128
-
Followers598
-
Following1K
-
Likes744
Announcing DeepSWE 🤖: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B. DeepSWE achieves 59% on SWEBench-Verified with test-time scaling (and 42.2% Pass@1), topping the SWEBench leaderboard for open-weight models. Built in…
Check out our new paper “Generative Modeling of Weights: Generalization or Memorization?” — we find that current diffusion-based neural network weight generators often memorize training checkpoints rather than learning a truly generalizable weight distribution!
Check out our new paper “Generative Modeling of Weights: Generalization or Memorization?” — we find that current diffusion-based neural network weight generators often memorize training checkpoints rather than learning a truly generalizable weight distribution!
It's exciting to apply diffusion models to new domains! But it requires careful evaluation, esp. regarding memorization. Our paper highlights this need. Shout-out to @zeng_boya for leading this work. paper: arxiv.org/abs/2506.07998 project page: boyazeng.github.io/weight_memoriz… video:…
Can GPT, Claude, and Gemini play video games like Zelda, Civ, and Doom II? 𝗩𝗶𝗱𝗲𝗼𝗚𝗮𝗺𝗲𝗕𝗲𝗻𝗰𝗵 evaluates VLMs on Game Boy & MS-DOS games given only raw screen input, just like how a human would play. The best model (Gemini) completes just 0.48% of the benchmark! 🧵👇
Conference reviewing should be in the form of annotating on paper like google doc comments. So you know reviewers actually read the paper and less of over-general AI generated comments. And it’s smoother experience writing reviews too without jumping between paper and writing.
Very productive conversations with @melissapan @IntuitMachine @sh_reya @tonychenxyz @cyrusnewday. My tl;dr -> There are at least 4 different concepts here, and it's essential to study them separately. 1) Structured programming to fully express your intent or control on the…
Very productive conversations with @melissapan @IntuitMachine @sh_reya @tonychenxyz @cyrusnewday. My tl;dr -> There are at least 4 different concepts here, and it's essential to study them separately. 1) Structured programming to fully express your intent or control on the…
Glad to see Arena-Hard as the flagship benchmark for Qwen3! All of us really appreciate the works Qwen team have done, their works are amazing! 🙌 Evaluate your model on Arena-Hard and our recently released Arena-Hard-v2.0 at github.com/lmarena/arena-…
Glad to see Arena-Hard as the flagship benchmark for Qwen3! All of us really appreciate the works Qwen team have done, their works are amazing! 🙌 Evaluate your model on Arena-Hard and our recently released Arena-Hard-v2.0 at github.com/lmarena/arena-…
Claude can play Pokemon, but can it play DOOM? With a simple agent, we let VLMs play it, and found Sonnet 3.7 to get the furthest, finding the blue room! Our VideoGameBench (twenty games from the 90s) and agent are open source so you can try it yourself now --> 🧵
Introducing Open Deep Research! A fully open-source Deep Research tool that: • writes comprehensive reports • does multi-hop search and reasoning • generates cover images & pod-casts! We’re releasing everything: evaluation dataset, code and blog.🔥 Example output report👇
Life update: I’ll be joining @PrincetonCS as a PhD student starting fall 2025! It was a very very difficult decision. I enjoyed people I talked to and their research at every place. My biggest thank you to everyone who has helped me along the journey! Excited for what’s to come!
LLM agents are still in their early stages—struggling with simple tasks while holding scary levels of access and control. We need to focus on better safeguards and reliability before scaling their power.
LLM agents are still in their early stages—struggling with simple tasks while holding scary levels of access and control. We need to focus on better safeguards and reliability before scaling their power.
Rolling out a new inference stack for DeepSeek R1 @togethercompute that gets up to 110 t/s on the 671B parameter model!
Do you remember when you joined X? I do! #MyXAnniversary
"I'm too slow at ML research" - every researcher ever. Over years of trying different strategies, I've landed on a few that have really helped me. I've written them down here, hoping it helps others & becomes a community resource! open.substack.com/pub/miachiquie…

Celesta Capital @CelestaCapital
2K Followers 1K Following Global deep tech venture capital firm, enabling innovators and business builders at the frontiers of emerging technology. Team: @anandc @1sriram
Dieti @Dieti147
46 Followers 2K Following
OpheliaHume @88X5o9ajkki4r
38 Followers 2K Following
Saber Darabi @SADarabi
311 Followers 7K Following
bhargav#~ @bhargav_17889
37 Followers 528 Following
Rupert Wu @rhubarbwu
122 Followers 616 Following Researcher @togethercompute; MS '24 @UofTCompSci/@VectorInst
Hangliang Ding @_foreverpiano
107 Followers 908 Following Undergraduate in @Tsinghua_Uni | interested ML system
Joe Mayo @JoeMayo
16K Followers 7K Following Author and Independent Consultant Recent books: - Programming the Microsoft Bot Framework/MSPress - C# Cookbook/O'Reilly Agents, AI, Generative AI, MCP, RAG
Igor Kan @1gor_kan
5K Followers 8K Following ⊣|─ math, physics, stats, ai @UofT ─⊗─ building @lesenheit @styles_lab ─▭─ https://t.co/EuZcbXZ0wZ _/ _ philosophy, ancient near east lit.,history ⏚ :wq && exit
Lekan @lekan_digital
1K Followers 925 Following interests: cs, physics, 3d, ml & sustainable computing. prev: swe+pm @microsoft, research @stanford, ug @pitzercollege. atm: not building, but keen to chat
Nishit Anand @nishitanand99
100 Followers 2K Following MS CS @umdcs | Former ML Research - @iitdelhi, @IIITDelhi | Computer Vision | Multimodal LLMs | Photography
''Ryan'' Zheyuan Lai @ryanzylai
38 Followers 422 Following Statistics Undergrad @NUSingapore | Stochastic Process | Machine Learning | Optimization
Giorgos Kappes @GiorgosKappes
8 Followers 187 Following Postdoc Researcher @ CSL, University of Ioannina | Co-founder @ Polytropo Systems | Cloud Systems Builder
Boya Zeng @zeng_boya
58 Followers 169 Following PhD student @PrincetonCS (Fall 2025). Previous: BAS & BA @Penn. Interested in data, generative models, and many other topics in AI.
Linden Li @lindensli
2K Followers 662 Following Research Platform @OpenAI. Previously @DbrxMosaicAI, @NVIDIA, CS @Stanford.
Ben Athiwaratkun @ben_athi
872 Followers 704 Following Leading Turbo Team @ Together AI. prev: @awscloud @MSFTResearch, @Cornell PhD.
FayRobinson @4lGT0TgTZdB2Q
108 Followers 4K Following Therapist | Professional feelings translator 💬🔍
Chuang Gan @gan_chuang
9K Followers 484 Following Faculty Member at UMass Amherst; Principal researcher at MIT-IBM Watson AI Lab; Homepage: https://t.co/Pc8WeREfTz
Dongdong Sun @BillySun12345
7 Followers 74 Following ML Engineer @ SF | @nyuniversity | AI & Cybersecurity
Aryia Dattamajumdar @AryiaDm
350 Followers 317 Following ML @meta @AIatMeta prev @apple @metlife // @ucberkeley computer science alum🎓 @dailycal alum🗞️ Life Enthusiast, Author, Inventor, Die-Hard Foodie 😊 #GoBears
T.S.V.R @RautiainenTouko
166 Followers 1K Following aerial intelligence @kovadefence // ultra runner
Xindi Wu @cindy_x_wu
4K Followers 1K Following PhD student @PrincetonCS | Interning @nvidia | Data-centric multimodal ml | prev @roboVisionCMU @CMU_Robotics | @RealityLabs @Snapchat | 🏎️
Zhou Yu @Zhou_Yu_AI
12K Followers 1K Following Founder of https://t.co/9KM4uFScMi, Associate Professor at Columbia. Making ai agent design and deployment easy and fast! Forbes 30 under 30.
PolyWallet @BlandaRess18913
18 Followers 670 Following ⚡ Rapid Crypto Growth! Aim for 50-100000 USDT Daily Potential. Secure & Swift Earning Awaits You! Unlock High Rewards Now. 💰🚀
Peter Morales @PeterMoralesX
346 Followers 3K Following Founder, CEO of Code Metal. Interested in development at the edge? DM.
Yoram Bachrach @yorambac
3K Followers 7K Following Research Scientist at Meta (prev Google DeepMind and Microsoft Research). Working on LLM Agents and Multi-Agent Systems.
James Morrison Rubin @import_jmr
7K Followers 6K Following Product Lead | Google Gemini Prev: Launched @aws Trainium, @alexa99 Echo Show 5 Tweets are my own. Retweets are not endorsements. Joyful Learning Machines
Nikita @RoundKubik
7 Followers 268 Following
無聊來看看 @KennChou1
35 Followers 816 Following
Irqiedorv @Irqiedorv0818
46 Followers 1K Following
The Lone Ranger @AbdullahMdKhan
172 Followers 7K Following
Ryan Morey @RyanMorey
594 Followers 3K Following living imaginary under truthful circumstances | software engineer in connectomics @PrincetonNeuro | bad chess player
Georges Harik @gharik
4K Followers 4K Following early google employee. worked on ai, gmail. like to invest and think about ai. https://t.co/PYDDPR00kL
MMM @MMM1897775
9 Followers 3K Following
taesiri @taesiri
852 Followers 5K Following Research Scientist @ EA Sports, VLMs, Evals, All opinions are my own.
Yash Malik @_yash_malik_
88 Followers 1K Following ML @AmazonScience Scaling RL for LLMs Prev @Google, SC
Yong @YongXien
1 Followers 3K Following
Celesta Capital @CelestaCapital
2K Followers 1K Following Global deep tech venture capital firm, enabling innovators and business builders at the frontiers of emerging technology. Team: @anandc @1sriram
Rupert Wu @rhubarbwu
122 Followers 616 Following Researcher @togethercompute; MS '24 @UofTCompSci/@VectorInst
Elad Hazan @HazanPrinceton
14K Followers 217 Following machine learning and optimization @PrincetonCS & Google DeepMind Princeton, dad^3
Guangxuan Xiao @Guangxuan_Xiao
3K Followers 697 Following Ph.D. student at @MITEECS Prev: CS & Finance @Tsinghua_Uni
Hangliang Ding @_foreverpiano
107 Followers 908 Following Undergraduate in @Tsinghua_Uni | interested ML system
Claude @claudeai
108K Followers 1 Following Claude is an AI assistant built by @anthropicai to be safe, accurate, and secure. Talk to Claude on https://t.co/ZhTwG8dz3D or download the app.
Lekan @lekan_digital
1K Followers 925 Following interests: cs, physics, 3d, ml & sustainable computing. prev: swe+pm @microsoft, research @stanford, ug @pitzercollege. atm: not building, but keen to chat
Sukjun (June) Hwang @sukjun_hwang
3K Followers 300 Following ML PhD student @mldcmu advised by @_albertgu
Kevin Lu @_kevinlu
9K Followers 216 Following @thinkymachines. formerly: - @openai: RL, synthetic data, efficient models - @berkeley_ai: decision transformer, universal computation
vincent @vvhuang_
1K Followers 439 Following understanding models @TransluceAI, writing https://t.co/M7hdeAExFk previously: hotel manager @MIT, math @0xPARC
''Ryan'' Zheyuan Lai @ryanzylai
38 Followers 422 Following Statistics Undergrad @NUSingapore | Stochastic Process | Machine Learning | Optimization
Kexin Huang @KexinHuang5
4K Followers 640 Following PhD Student @Stanford CS with @jure; AI + Biomedicine
Dynamics Lab @DynamicsLab_AI
5K Followers 166 Following An applied research company building at the frontier of AI
tomaarsen @tomaarsen
4K Followers 347 Following Sentence Transformers, SetFit & NLTK maintainer Machine Learning Engineer at 🤗 Hugging Face
Boya Zeng @zeng_boya
58 Followers 169 Following PhD student @PrincetonCS (Fall 2025). Previous: BAS & BA @Penn. Interested in data, generative models, and many other topics in AI.
Linden Li @lindensli
2K Followers 662 Following Research Platform @OpenAI. Previously @DbrxMosaicAI, @NVIDIA, CS @Stanford.
Arcee.ai @arcee_ai
4K Followers 416 Following Optimize cost & performance with AI platforms powered by our industry-leading SLMs: Arcee Conductor for model routing, & Arcee Orchestra for agentic workflows.
Chuang Gan @gan_chuang
9K Followers 484 Following Faculty Member at UMass Amherst; Principal researcher at MIT-IBM Watson AI Lab; Homepage: https://t.co/Pc8WeREfTz
Dongdong Sun @BillySun12345
7 Followers 74 Following ML Engineer @ SF | @nyuniversity | AI & Cybersecurity
Delta Institute @DeltaInstitutes
1K Followers 39 Following Supporting exceptional researchers/engineers, from academia to industry and beyond.
Aryia Dattamajumdar @AryiaDm
350 Followers 317 Following ML @meta @AIatMeta prev @apple @metlife // @ucberkeley computer science alum🎓 @dailycal alum🗞️ Life Enthusiast, Author, Inventor, Die-Hard Foodie 😊 #GoBears
Kris Selberg @SelbergKris
549 Followers 258 Following building ai agents for enterprises in sf | prev #1 followed non-fiction book TikToker | cs @princeton
T.S.V.R @RautiainenTouko
166 Followers 1K Following aerial intelligence @kovadefence // ultra runner
Assert Labs @assert_labs
15 Followers 2 Following Building tools to make software verifiably correct
Sabri Eyuboglu @EyubogluSabri
1K Followers 308 Following Working on language model memory. CS PhD student @Stanford working with @HazyResearch and @james_y_zou. 🪬
Pete Koomen @koomen
11K Followers 1K Following GP @ycombinator, cofounded @optimizely https://t.co/5hkCw4y95d
spark @sparkjsdev
899 Followers 1 Following Three.js-native 3D Gaussian splatting renderer Docs/examples @ https://t.co/paCxUjmG9B | Github @ https://t.co/afLJjWD6zO | Discord @ https://t.co/tdbgq3nHbd
rajan agarwal @_rajanagarwal
5K Followers 1K Following RL @amazonscience, se @uwaterloo, scholar @neo, prev @trykino @hitachi
Shuchao Bi @shuchaobi
13K Followers 688 Following Research @Meta Superintelligence Labs, RL/post-training/agents; Previously Research @OpenAI on multimodal and RL; Opinions are my own.
Omar Shaikh @oshaikh13
1K Followers 839 Following member of sociotechnical staff @Stanford - previously @GeorgiaTech
Nimit Kalra @ ICML 20... @qw3rtman
1K Followers 927 Following research @haizelabs, prev @citadel, @utaustin currently feynman technique-ing my way through life
Xindi Wu @cindy_x_wu
4K Followers 1K Following PhD student @PrincetonCS | Interning @nvidia | Data-centric multimodal ml | prev @roboVisionCMU @CMU_Robotics | @RealityLabs @Snapchat | 🏎️
Princeton University ... @PrincetonGrad
2K Followers 234 Following The Princeton Graduate School prepares & inspires the world’s most promising emerging scholars to serve humanity through impact and leadership.
Yoram Bachrach @yorambac
3K Followers 7K Following Research Scientist at Meta (prev Google DeepMind and Microsoft Research). Working on LLM Agents and Multi-Agent Systems.
James Morrison Rubin @import_jmr
7K Followers 6K Following Product Lead | Google Gemini Prev: Launched @aws Trainium, @alexa99 Echo Show 5 Tweets are my own. Retweets are not endorsements. Joyful Learning Machines
Rosmine @rosmine_b
2K Followers 483 Following ML researcher. LLMs + RL + Code gen. Tweets express the views of my employer (myself). DM me ML questions