Naman Jain @StringChaos
PhD @UCBerkeley ; Research @cursor_ai | Projects - LiveCodeBench, DeepSWE, R2E-Gym, GSO, Syzygy, LMArena Coding | Past: @MetaAI @AWS @MSFTResearch @iitbombay naman-ntc.github.io Berkeley Joined March 2018-
Tweets519
-
Followers2K
-
Following1K
-
Likes5K
Interested in building and benchmarking deep research systems? Excited to introduce DeepScholar-Bench, a live benchmark for generative research synthesis, from our team at Stanford and Berkeley! 🏆Live Leaderboard guestrin-lab.github.io/deepscholar-le… 📚 Paper: arxiv.org/abs/2508.20033 🛠️…
MoE layers can be really slow. When training our coding models @cursor_ai, they ate up 27–53% of training time. So we completely rebuilt it at the kernel level and transitioned to MXFP8. The result: 3.5x faster MoE layer and 1.5x end-to-end training speedup. We believe our…
After three intense months of hard work with the team, we made it! We hope this release can help drive the progress of Coding Agents. Looking forward to seeing Qwen3-Coder continue creating new possibilities across the digital world!
After three intense months of hard work with the team, we made it! We hope this release can help drive the progress of Coding Agents. Looking forward to seeing Qwen3-Coder continue creating new possibilities across the digital world!
We just released the evaluation of LLMs on the 2025 IMO on MathArena! Gemini scores best, but is still unlikely to achieve the bronze medal with its 31% score (13/42). 🧵(1/4)
Can data owners & LM developers collaborate to build a strong shared model while each retaining data control? Introducing FlexOlmo💪, a mixture-of-experts LM enabling: • Flexible training on your local data without sharing it • Flexible inference to opt in/out your data…
Can data owners & LM developers collaborate to build a strong shared model while each retaining data control? Introducing FlexOlmo💪, a mixture-of-experts LM enabling: • Flexible training on your local data without sharing it • Flexible inference to opt in/out your data… https://t.co/Vnaaq6c6If
It's easy to confuse Best@K vs Pass@K—and we've seen some misconceptions about our results. Our 59% on SWEBench-Verified is Pass@1 with Best@16, not Pass@8/16. Our Pass@8/16 is 67%/71%. So how did we achieve this? DeepSWE generates N candidate solutions. Then, another LLM…
It's easy to confuse Best@K vs Pass@K—and we've seen some misconceptions about our results. Our 59% on SWEBench-Verified is Pass@1 with Best@16, not Pass@8/16. Our Pass@8/16 is 67%/71%. So how did we achieve this? DeepSWE generates N candidate solutions. Then, another LLM… https://t.co/vikctUOMUh
DeepSWE is a new state-of-the-art open-source software engineering model trained entirely using reinforcement learning, based on Qwen3-32B. together.ai/blog/deepswe Fantastic work from @togethercompute @Agentica_‼
DeepSWE is a new state-of-the-art open-source software engineering model trained entirely using reinforcement learning, based on Qwen3-32B. together.ai/blog/deepswe Fantastic work from @togethercompute @Agentica_‼ https://t.co/mLAbi2HD2Z
Claude is hyped to hear that its small business is getting the public recognition it deserves
Claude is hyped to hear that its small business is getting the public recognition it deserves https://t.co/TwnzY0ruvG
RLVR is not just about RL, it's more about VR! Particularly for LLM coding, good verifiers (tests) are hard to get! In our latest work, we ask 3 questions: How good are current tests? How do we get better tests? How much does test quality matter? leililab.github.io/HardTests/
Questions to ask 1. can we see a "commit history?" (only 2 commits in repo) 2. what level of supervision was provided? 3. the paper is 4 dense pages. was it outlined first in a lean friendly way and then formalization took place? 4. which models were used in the formalization?
Questions to ask 1. can we see a "commit history?" (only 2 commits in repo) 2. what level of supervision was provided? 3. the paper is 4 dense pages. was it outlined first in a lean friendly way and then formalization took place? 4. which models were used in the formalization?
Day 3 of drilling down into popular benchmarks for models/agents. Benchmark #3: LiveCodeBench Developed by researchers at UC Berkeley, MIT, and Cornell, this benchmark evaluates LLM code-generation skills and continually expands with new problems drawn from programming contests…
Introducing Code Researcher - a deep research agent for large systems code and commit history. aka.ms/coderesearcher Achieves a 58% crash resolution rate on a benchmark of crashes in the Linux kernel, a complex codebase with 28M LOC & 75K files.
Ensuring construct validity is becoming increasingly more complex as we move towards more real-world evaluation setups. We should routinely inspect benchmark solutions to ensure intended goal is being met!!
Ensuring construct validity is becoming increasingly more complex as we move towards more real-world evaluation setups. We should routinely inspect benchmark solutions to ensure intended goal is being met!!

Manish Shetty @slimshetty_
1K Followers 768 Following PhD @UCBerkeley | AI4Code & Evals | Projects: GSO, R2E, Syzygy, LMArena RepoChat, AIOpsLab | prev @googledeepmind @msftresearch
Mayur Naik @AI4Code
2K Followers 300 Following Misra Family Professor @CIS_Penn. I do research on neurosymbolic AI and cybersecurity.
Shreya Shankar @sh_reya
48K Followers 690 Following on the CS faculty job market | PhD @Berkeley_EECS, building https://t.co/PmuOqAYt6q | teaching https://t.co/CTWJ6z0JEg | formerly ML eng & undergrad @Stanford
Sumanth @sumanthd17
4K Followers 2K Following Building Models @sarvamai PhD’ing @iitmadras @AI4Bharat, Google PhD Fellow, Past life - @GoogleAI @Mila_Quebec @IIITSC
Conor Power @conor_power23
2K Followers 699 Following PhD student @BerkeleySky + https://t.co/jDsPgbj1nT. Former senior SWE on Microsoft Cosmos. Databases 🐘 and distributed systems 🕰️ with some theory 🧮 thrown in.
Dhruv Agarwal @agdhruv
729 Followers 214 Following PhD @Cornell. Past: @MSFTResearch, @GoogleDeepMind, @ashokauniv. Sports fan!
Swapnil Gandhi @sw2pnil
623 Followers 967 Following PhD Student in ML Systems @Stanford CS 🌲 ◦ Formerly: @MSFTResearch, @IIScBangalore
Shadaj Laddad @ShadajL
3K Followers 294 Following Senior Scientist at AWS. Building https://t.co/Ax69nGtiH4, a framework for correct distributed systems. PhD from @Berkeley_EECS. Co-organizer https://t.co/mV8bqpr3KF
Parth Thakkar @parth007_96
2K Followers 2K Following @Meta | Previously @IllinoisCS @MSFTResearch @IBMResearch | LLMs + code
Sriram Rajamani @SriramRajamani
3K Followers 466 Following Geek, technologist, research junkie. Dad, husband, son, brother & uncle. CVP, Microsoft Research. Working with wonderful colleagues and friends.
Arkil Patel @arkil_patel
1K Followers 1K Following CS PhD Student at Mila and McGill | Worked at AllenNLP and Microsoft Research
Divy Thakkar @divy93t
9K Followers 2K Following strategy + programs for Gemini, advancing human-centered llms. Ph.D @CityStGeorges . Personal views.
Siddhartha Gairola @sidgairo18
1K Followers 1K Following 🏔️📍🇩🇪 @ELLISforEurope 🇪🇺 PhD Student @cvml_mpiinf at MPI-INF & IST-A 物の哀れ ✨
Harshita Diddee @ihsrahedid
897 Followers 830 Following LTI PhD @SCSatCMU, AS-Intern @amazon Search| Prev: RF at @MSFTResearch | Interested in Data Quality Estimation
Harshit Joshi @harshitj__
2K Followers 372 Following CS phd @StanfordNLP, @StanfordOVAL | prev: @MSFTResearch | LLM systems for knowledge access, discovery and curation
Gargi Balasubramaniam @gargi_balasu
3K Followers 2K Following Research Engineer @GoogleDeepMind, @SiebelScholars '23, MS CS UIUC @IllinoisCS, Gold Medalist CS'20 BITS Pilani Goa, Prev @Meta, @AmazonScience, @Microsoft, 🎶
Saksham @sgdescent
1K Followers 2K Following Interested in making LLMs go brrrrr x+1: MS @LTIatCMU x: LLM @Zomato x-N: https://t.co/ht5ObQh7RV & Program Synthesis with LLMs @ProseMsft
swyx 🇸🇬 @swyx
125K Followers 3K Following achieve ambition with intentionality, intensity, & integrity - @smol_ai - @dxtipshq - @sveltesociety - @aidotengineer - @coding_career - @latentspacepod
Tommy @Shaughnessy119
58K Followers 3K Following Founding Partner @Delphi_Ventures | Co-Founder @Delphi_Digital | Host @PodcastDelphi | Built @VenturesRobot | Not Investment Advice | All My Own Opinions
JessicaNancy @1vOoUfSii511j
2 Followers 194 Following Be a girl with a mind, a woman with attitude, and a lady with class.
Prakash Kagitha @prakashkagitha
295 Followers 2K Following Research Assistant @DrexelUniv. Improving planning with LLMs. Previously, Lead Data Scientist @ https://t.co/yohjwHWaMY
Aman Karmani @tmm1
8K Followers 4K Following building Cursor @anysphere. full stack tinkerer and perf nerd. formerly vp of infra @github + ruby-core committer. founder @getchannels + ffmpeg committer.
Ashwinee Panda @PandaAshwinee
3K Followers 723 Following Postdoc of @tomgoldsteincs, PhD @princeton, @Cal alum, currently working on LLMs
Dan Roy @roydanroy
57K Followers 2K Following ML / AI researcher. Research Director and Canada CIFAR AI Chair, @VectorInst. Professor, @UofT (Statistics/CS).
SherryThompson @Leu17xilQ1M21
5 Followers 705 Following You can’t dim my light — I shine too bright.
ʟɪsᴀ @lisacheng
8K Followers 4K Following 10+ yrs in crypto. Blockchain Architect @ AI Co. Ethereum & Mastercoin alum. 2 exits. Burned, rebuilt, still here.
Klequd @Klequd1953620
3 Followers 184 Following Focused on investing in U.S. stocks, happy to discuss stock market trends.
Delphine @ArnoldJerd57774
35 Followers 2K Following
Rochelle @Barqou019
14 Followers 903 Following She shines not because she wants to be seen, but because she cannot help it.
Jeanette Larsen @jeanettelarsen_
54 Followers 3K Following Business entrepreneur from Sweden, currently living in California
Victor Hugo @VictorHugo45995
0 Followers 7K Following
VenusReed @2xmm9HO8q8yzzJ
15 Followers 1K Following
Audrey @Kxo5SeP01X7GsGC
13 Followers 899 Following
OffenHerz @gblHstYE460U48
8 Followers 334 Following
Unkonwn @Unkonwn15330541
3 Followers 291 Following
Keivan @Keivansamani
258 Followers 370 Following Building a Panacea | prev: DPhil Nuclear Fusion @UniOfOxford | President @blockchainox | Fellow @OxFoundry
BM building AI @BMAIengineer
82 Followers 4K Following University student. Trying to build. Networking. Interests in AI Research,startups,software,CS and emerging technologies.
Xiangzhe Xu @XiangzheX
194 Followers 392 Following Ph.D. student @PurdueCS. 2025Intern at @MSFTResearch. I do research that helps developers—from pros to vibe coders to agent builders.
Prof. Dr. Fabio Rocha... @fabiorochasilva
531 Followers 6K Following 🎓phd statistical science 👨🏻🏫 Professor at CEFETMG Brazil
Roro Saxen @saxen_roro
8 Followers 257 Following
SP500_Insider🇺🇸 @Wawha002923
52 Followers 2K Following 15-30% Monthly | 2 High-Conviction Stocks.Short-Term Gains: 15-20% in Days/Weeks.DM "JOIN" for WhatsApp Alerts. Live Trade Signals • Market Analysis
Henry Hong @HenryHong217654
0 Followers 73 Following
Tiexu @Tiexu044881
23 Followers 1K Following
Debjyoti Ray @Dev5811
11 Followers 628 Following
Alex Boruch-Gruszecki @abgruszecki
74 Followers 81 Following Postdoc in Arjun Guha's group, investigating how AI and our understanding of programming languages can help build the future of programming
social_media@neural-d... @NeuralDispatch1
104 Followers 1K Following
Stuart Sul @stuart_sul
1K Followers 122 Following ml research @cursor_ai, cs @Stanford, mlsys @HazyResearch
Boardy @Boardy_ai
3K Followers 3K Following 💬 Text me on WhatsApp to get warm intros to the right people to scale your ventures. I know who you need. 📲 +1-415-969-9735 30k+ Substack | 50k+ LinkedIn
Mukund Shankar @MukundShankar3
0 Followers 260 Following
sirapon @sira0818299162
35 Followers 1K Following
Eric Tchirnhausen @tchirnhaus20039
25 Followers 5K Following Like to try new things you never know; trying to prove all software can be automated 😅 😅 😅 | ML/AI, | C++/Java/Go | GitHub : Dyl777
degen_bobo 🤖💎 @agostino90
635 Followers 4K Following
Kritika Prakash @kritipraks
10K Followers 1K Following Researcher and artist. 2nd year Computer Science PhD student @UChicago. Machine Learning and Causality for Healthcare.
Yann LeCun @ylecun
949K Followers 764 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
François Chollet @fchollet
572K Followers 813 Following Co-founder @ndea. Co-founder @arcprize. Creator of Keras and ARC-AGI. Author of 'Deep Learning with Python'.
Manish Shetty @slimshetty_
1K Followers 768 Following PhD @UCBerkeley | AI4Code & Evals | Projects: GSO, R2E, Syzygy, LMArena RepoChat, AIOpsLab | prev @googledeepmind @msftresearch
Zachary Lipton @zacharylipton
63K Followers 2K Following Cofounder & CTO: @AbridgeHQ, Professor: CMU/@acmi_lab, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷
MIT CSAIL @MIT_CSAIL
326K Followers 21K Following MIT's Computer Science & Artificial Intelligence Laboratory (CSAIL). Media Inquiries: [email protected] Check out the latest CSAIL content ⬇️
Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
Bojan Tunguz @tunguz
252K Followers 8K Following ML ex Nvidia. Creator of @trainxgb. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. Memelord. e/xgb. AMDG.
Jelani Nelson @minilek
27K Followers 265 Following Professor and Department Chair @Berkeley_EECS. Research Scientist (part-time) @GoogleAI. Founder @addiscoder. 🇻🇮🇺🇸🇪🇹
Mayur Naik @AI4Code
2K Followers 300 Following Misra Family Professor @CIS_Penn. I do research on neurosymbolic AI and cybersecurity.
Shreya Shankar @sh_reya
48K Followers 690 Following on the CS faculty job market | PhD @Berkeley_EECS, building https://t.co/PmuOqAYt6q | teaching https://t.co/CTWJ6z0JEg | formerly ML eng & undergrad @Stanford
Sumanth @sumanthd17
4K Followers 2K Following Building Models @sarvamai PhD’ing @iitmadras @AI4Bharat, Google PhD Fellow, Past life - @GoogleAI @Mila_Quebec @IIITSC
Andrew Ng @AndrewYNg
1.3M Followers 1K Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs
Shruti Rijhwani @shrutirij
6K Followers 546 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from @LTIatCMU * Amateur woodworker, scuba diver, foosball player
Jia-Bin Huang @jbhuang0604
65K Followers 284 Following
Kayo Yin @kayo_yin
15K Followers 695 Following PhD student @berkeley_ai @berkeleynlp. AI alignment & signed languages. Prev @carnegiemellon @polytechnique, intern @msftresearch @deepmind. 🇫🇷🇯🇵
Michael Black @Michael_J_Black
84K Followers 702 Following Director, Max Planck Institute for Intelligent Systems (@MPI_IS). Chief Scientist @meshcapade. Building 3D digital humans using vision, graphics, and learning.
Gautam Kamath @thegautamkamath
57K Followers 568 Following Assistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Joining @NYU_Courant September 2026. Co-EiC @TmlrOrg. I lead @TheSalonML.
Swarat Chaudhuri @swarat
3K Followers 625 Following Professor at @UTCompSci, Research Scientist at @GoogleDeepmind. Automated Reasoning + Machine Learning + Programming Languages.
Conor Power @conor_power23
2K Followers 699 Following PhD student @BerkeleySky + https://t.co/jDsPgbj1nT. Former senior SWE on Microsoft Cosmos. Databases 🐘 and distributed systems 🕰️ with some theory 🧮 thrown in.
Aman Karmani @tmm1
8K Followers 4K Following building Cursor @anysphere. full stack tinkerer and perf nerd. formerly vp of infra @github + ruby-core committer. founder @getchannels + ffmpeg committer.
jeremy @jerhadf
2K Followers 1K Following clauding @AnthropicAI. personal views only. prev @hume_ai @elicitorg @ai_risks @QualiaRI @dartmouth
Andrew Milich @milichab
4K Followers 1K Following @cursor_ai, former CEO @skiffprivacy (acquired by @notionhq)
Simon Sarris @simonsarris
77K Followers 970 Following 🕯 In labouring to be concise, I become obscure. 🕯 Alchemist, sacred things, making things 🕯 The map is mostly water. 🌜 I make GoJS: https://t.co/7yYIMFfAtd
Alex Boruch-Gruszecki @abgruszecki
74 Followers 81 Following Postdoc in Arjun Guha's group, investigating how AI and our understanding of programming languages can help build the future of programming
eric zakariasson @ericzakariasson
39K Followers 446 Following @cursor_ai & tinkering. gone colfing at https://t.co/KFkkR5CJVl
Micah Hill-Smith @_micah_h
680 Followers 935 Following Co-founder & CEO @ArtificialAnlys. Previously @McKinsey.
Arnab Nandi ⭐️ @arnabdotorg
3K Followers 3K Following human-in-the-loop data infra | Prof, Ohio State CS | Co-founder, @thesteamfactory, @hackohio, @mobikitinc (acq.)
Stuart Sul @stuart_sul
1K Followers 122 Following ml research @cursor_ai, cs @Stanford, mlsys @HazyResearch
Yury Zemlyanskiy @_theopompus
242 Followers 471 Following Research Scientist at @cursor_ai. Ex @augmentcode, Google Research, and @ShaLabUSC.
pash @pashmerepat
10K Followers 466 Following currently head of ai @cline | prev @meta knowledge graph | creator of vault // @usc alum
Paul Christiano @paulfchristiano
3K Followers 0 Following
Lorenz Kuhn @_lorenzkuhn
1K Followers 751 Following Reasoning Research @OpenAI | o1-preview through o3
Borys Minaiev @bminaiev
1K Followers 260 Following Building reasoning models @OpenAI. ICPC World Champion
Arash Vahdat @ArashVahdat
10K Followers 869 Following Research Director, leading fundamental generative AI research (GenAIR) @nvidia research, views are my own.
jianlin.su @Jianlin_S
3K Followers 14 Following Grad is all you need @Kimi_Moonshot Blog: https://t.co/YVxsWylklA , Cool Papers: https://t.co/scS1n1oyaO
Crystal @crystalsssup
11K Followers 597 Following Staff @Kimi_Moonshot prev. co-maker of ModelizeAI & gemsouls "Personality goes a long way" @UCSanDiego
Joel Becker @joel_bkr
3K Followers 2K Following move fast and fix things @METR_evals. 'soccer'-me @MessiSeconds.
Clara Na @claranahhh
967 Followers 594 Following PhD student at @LTIatCMU / @SCSatCMU she/her, prev. @UVA and intern @ai2_allennlp @/clara on https://t.co/GHxXbrRHSB and @/clarana on https://t.co/47UIhMGaRd
Kevin Frans @kvfrans
4K Followers 503 Following @berkeley_ai @reflection_ai prev mit, read my thoughts: https://t.co/7CZsOTrKRA
𝔊𝔴𝔢𝔯𝔫 @gwern
61K Followers 104 Following Internet besserwisser; pedantic, mean reply guy. 𝘞𝘢𝘵𝘢𝘴𝘩𝘪 𝘬𝘪𝘯𝘪𝘯𝘢𝘳𝘪𝘮𝘢𝘴𝘶! (Follow requests ignored due to terrible UI.)
kalomaze @kalomaze
18K Followers 2K Following ML researcher (@primeintellect), speculator • extremely silly jester
Aurko Roy @aurko79
2K Followers 227 Following ML research | @AIatMeta (2025-2025) | @GoogleDeepmind (2023-2025) | @GoogleAI (Brain) (2017-2023) | CS PhD @Georgiatech | CS @IITKanpur
Sanjana Sharma @_keysarasara
1K Followers 2K Following Done healing my inner child, next up is my inner teen. Her highness demands a sword.
Jediah Katz @jediahkatz
3K Followers 339 Following software @cursor_ai. prev @figma / taught CS @penn. native new yorker, check out https://t.co/pkRbuZuX1R
Jürgen Schmidhuber @SchmidhuberAI
163K Followers 0 Following Invented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.
Ameen Patel @Ameen_ml
1K Followers 2K Following LLM Inference & Serving @togethercompute, prev @AmazonScience, @uwaterloo
cat @_catwu
39K Followers 355 Following claude code pm @anthropicai prev: @indexventures, @dagster, @scale_ai
Shuchao Bi @shuchaobi
13K Followers 689 Following Research @Meta Superintelligence Labs, RL/post-training/agents; Previously Research @OpenAI on multimodal and RL; Opinions are my own.
Michiel de Jong @michielsdj
555 Followers 335 Following Research Scientist at @cursor_ai, formerly @augmentcode and @usc
Pengfei Liu @stefan_fee
4K Followers 791 Following Associate Prof. at SJTU, leading GAIR Lab (https://t.co/Nfd8KmZx3B) Co-founder of Inspired Cognition, Postdoc at @LTIatCMU, Previously FNLP, @MILAMontreal,
Ryo Lu @ryolu_
55K Followers 2K Following Head of Design @Cursor_ai. Early @NotionHQ, @Stripe, built startups. I make a world where anyone can make software. Aspiring k-pop idol.
Charlie Ruan @charlie_ruan
635 Followers 470 Following CS PhD Student @UCBerkeley @BerkeleySky | prev @CSDatCMU, @CornellCIS
Jason Lee @jasondeanlee
18K Followers 4K Following Associate Professor at UC Berkeley. Former Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learning.
thebes @voooooogel
15K Followers 885 Following "peaceful, albeit ominous" ꙮ website → https://t.co/aykxqKippW ꙮ games → https://t.co/3Pz19vHOwd ꙮ 💞💍📝 @holotopian ꙮ she/they 🏳️⚧️
Simon Boehm @Si_Boehm
3K Followers 267 Following
Chris Albon @chrisalbon
89K Followers 3K Following Director of ML and Data Eng @Wikimedia Foundation. We host Wikipedia.
Arjun Panickssery @panickssery
4K Followers 2K Following Researching scalable oversight @MATSprogram | prev @METR_Evals @ai_risks | spaced repetition | AI safety | https://t.co/mc28sVZYOC