Princeton NLP Group @princeton_nlp
Princeton NLP Group led by @prfsanjeevarora @danqi_chen @karthik_r_n nlp.cs.princeton.edu Princeton, NJ Joined August 2020-
Tweets258
-
Followers5K
-
Following61
-
Likes281
AlgoTune is a benchmark that penalizes expensive models, since we give each model a budget of $1 to solve each task. Cool to see open weight models doing well! x.com/ori_press/stat…
AlgoTune is a benchmark that penalizes expensive models, since we give each model a budget of $1 to solve each task. Cool to see open weight models doing well! x.com/ori_press/stat…
What happens if you compare LMs on SWE-bench without the fancy scaffolds? Our new leaderboard “SWE-bench (bash only)” shows you which LMs are the best at getting the job done with just bash. More on why this is important 👇
Shoutout to all the @Princeton researchers participating in @icmlconf #ICML2025 Browse through some of the cutting edge research from AI Lab students, post-docs and faculty being presented this year: pli.princeton.edu/blog/2025/prin…
As we optimize model reasoning over verifiable objectives, how does this affect human understanding of said reasoning to achieve superior collaborative outcomes? In our new preprint, we investigate human-centric model reasoning for knowledge transfer 🧵:
Improved reasoning increases performance on benchmarks, but are models able to pass their knowledge onto humans? 🧐 We evaluate models’ communication abilities in teaching novel solutions to users! See our new paper!
Improved reasoning increases performance on benchmarks, but are models able to pass their knowledge onto humans? 🧐 We evaluate models’ communication abilities in teaching novel solutions to users! See our new paper!
Introducing SWE-bench Multilingual: a new eval in the SWE-bench family to test LLM coding abilities in *9* programming languages, fully integrated with SB so it can plug into existing workflows. Claude 3.7 gets 43% on SB Multilingual vs 63% on SB Verified, a 20 pt drop!🧵
Join us on May 21st- I'll talk about how we built SWE-bench & SWE-agent and what I'm excited about for the future of autonomous AI systems.
Join us on May 21st- I'll talk about how we built SWE-bench & SWE-agent and what I'm excited about for the future of autonomous AI systems.
Our warmest congratulations to @danqi_chen, @stanfordnlp grad and now Associate Professor at @PrincetonCS and Associate Director of @PrincetonPLI on her stunning @iclr_conf keynote!
Claude can play Pokemon, but can it play DOOM? With a simple agent, we let VLMs play it, and found Sonnet 3.7 to get the furthest, finding the blue room! Our VideoGameBench (twenty games from the 90s) and agent are open source so you can try it yourself now --> 🧵
Can language models effectively impersonate you to family and friends? We find that they can: 44% of the time, close friends and family mis-identify Llama-3.1-8b as human… 🧵👇
Congrats on the Verified and Multimodal SWE-bench numbers. venturebeat.com/ai/zencoders-c…
We just updated the SWE-bench Multimodal leaderboard. Congrats to Globant, Zencoder, and the Agentless team from UIUC for their strong results.
🤔 Ever wondered how prevalent some type of web content is during LM pre-training? In our new paper, we propose WebOrganizer which *constructs domains* based on the topic and format of CommonCrawl web pages 🌐 Key takeaway: domains help us curate better pre-training data! 🧵/N
This Tuesday (Feb 18), @_carlosejimenez will discuss SWE-bench and the future of codegen evals, as part of the Conference on Synthetic Software in NYC. @KLieret will also be there. RSVP: lu.ma/k2q27yi3
SWE-agent 1.0 is the open-source SOTA on SWE-bench Lite! Tons of new features: massively parallel runs; cloud-based deployment; extensive configurability with tool bundles; new command line interface & utilities.
🚀 Introducing Goedel-Prover: A 7B LLM achieving SOTA open-source performance in automated theorem proving! 🔥 ✅ Improving +7% over previous open source SOTA on miniF2F 🏆 Ranking 1st on the PutnamBench Leaderboard 🤖 Solving 1.9X total problems compared to prior works on Lean…
Congrats to o3-mini on setting a new high score on SciCode!! R1 clocks in at an impressive 4.6%, matching Claude 3.5. SciCode is our super-tough programming benchmark written by PhDs in various scientific domains.
SciCode is our super tough coding benchmark testing the abilities of LMs to program code based on research in physics/biology/material science/... o1 is the SoTA with 7%. To make it easier to use we're putting it into the Inspect AI format, as a few groups were asking for this.
Congrats to the DeepSeek team on the impressive SWE-bench results!

(((ل()(ل() 'yoav)))... @yoavgo
65K Followers 2K Following
Delip Rao e/σ @deliprao
61K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
Jacob Andreas @jacobandreas
20K Followers 951 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL / @NLP_MIT (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw
Bill Yuchen Lin @billyuchenlin
23K Followers 3K Following Building Grok @xAI. Affiliate Assistant Prof @UW; Focusing on Grok Code for Macrohard now. Ex: @allen_ai, Google AI, Meta FAIR.
Sam Bowman @sleepinyourhat
50K Followers 3K Following AI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
Sewon Min @sewon__min
13K Followers 815 Following Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp
Sebastian Ruder @ ACL @seb_ruder
92K Followers 1K Following Research Scientist @AIatMeta • Ex @Cohere @GoogleDeepMind
Sebastian Gehrmann @sebgehr
6K Followers 2K Following Head of Responsible AI, CTO office, @Bloomberg. (he/him) Formerly LLMs @ Google Brain / Harvard. views my own
Jay Alammar @JayAlammar
46K Followers 1K Following Writer https://t.co/TquuQXlLOJ. O'Reilly Author https://t.co/Fl3uPAZHLg. LLM Builder @Cohere. Visualizing AI one concept at a time.
Ofir Press @OfirPress
15K Followers 6K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
Mark Dredze @mdredze
6K Followers 783 Following John C Malone Professor at @JohnsHopkins @JHUCompSci @jhuclsp @jhumceh; Part time @techatbloomberg (tweets my own) @mdredze.bsky.social🦋
Vivek Gupta @keviv9
3K Followers 5K Following Assistant Professor @SCAI_ASU; PostDoc @cogcomp @Penn, ed-@UUtah,@iitkanpur. @Bloomberg @MSFTResearch Fellow; ex-@MetaAI @IBM @Verisk @samsungresearch @Synopsys
Shaily @shaily99
7K Followers 2K Following PhD @LTIatCMU. Prev: @allen_ai @GoogleAI @MSFTResearch. #NLProc. Often ranting about research.
rishi @RishiBommasani
6K Followers 2K Following Societal/economic impacts of AI; AI policy & governance @StanfordHAI Stanford CS PhD w/ @percyliang @jurafsky Cornell CS undergrad w/ @clairecardie
Greg Durrett @gregd_nlp
8K Followers 894 Following Associate professor at NYU (Courant CS + Center for Data Science) | advisor for @bespokelabsai | large language models and NLP | he/him
Weijia Shi @WeijiaShi2
9K Followers 1K Following PhD student @uwnlp @allen_ai | Prev @MetaAI @CS_UCLA | 🏠 https://t.co/Q6Mzg8ow2j
Yonatan Belinkov @boknilev
5K Followers 1K Following Assistant professor of computer science @TechnionLive. #NLProc
Xiaoyue Xu @xiaoyue02_xu
23 Followers 213 Following CS undergrad @Tsinghua_Uni | Seeking 25 fall PhD position in nlp | 🔗 https://t.co/yQea7xlnJ9
pku_whzhang @PKUBrian
0 Followers 41 Following PhD candidate at Peking University @PKU1898. Focusing on LLMs and Autonomous Agents.
Husary Fall @husaryfall
90 Followers 990 Following Maintenance Supervisor|CMMS manager |INDUSTRIAL DESIGNER
smile @Smilex_P
236 Followers 5K Following
云创兽Ai @Frawal7909
0 Followers 112 Following 🌟 focusing on dividend stocks lover, independent girl! open to insights. DM me about economic cycles! 📊 #Nasdaq #Stocks
Goddy Snow @GoddySnow62732
4 Followers 140 Following
Liu He (Helium) @Heliummn
1 Followers 84 Following PhD Applicant (Fall '26) in Computational Social Science | Social NLP|SpeechLLM | XAI | MS@StudyatUSTC, prev intern. @Baidu_Inc’s ERNIE Bot 🤖🗣️👥
Patang Maja @MajaPatang
69 Followers 1K Following
아아 @aa164919269577
0 Followers 82 Following
Jason @Jason27627351
8 Followers 320 Following
Ivan @entrophyx
5 Followers 1K Following
RuthBunyan @fg0eWs80A58wh2D
276 Followers 4K Following
Nafise Sadat Moosavi @NafiseSadat
473 Followers 385 Following Lecturer (~Assistant Prof.) in NLP @SheffieldNLP @shefcompsci, Muslim Iranian woman إنا على العهد
Atharva Mehta @atharva20038
0 Followers 26 Following
Junyi Zhang @Levi_JYZhang
1 Followers 27 Following UCLA Master, PLUS Lab, advised by Prof. Violet Peng
Song Mei @Song__Mei
3K Followers 690 Following Assistant Professor at UC Berkeley, Department of Statistics and EECS. Researcher at OpenAI working on LLM training.
Tapon K. Ray @realTaponRay
7 Followers 727 Following BTech CS @VITAPuniversity #HCI #Agent automation #Algorithm →Aim: Assembling human as superHuman, aim=∫(∞)dt 🎩
ycs @ycs091216
4 Followers 304 Following
Yixiao @y1xia0w
2 Followers 50 Following
Saber Darabi @SADarabi
314 Followers 7K Following
Prathyaksh N @n_prathyaksh
11 Followers 945 Following
Camille @han_zi60291
0 Followers 38 Following
Haoyu Dong @HaoyuDong9
157 Followers 272 Following I am a senior researcher at Microsoft. #SpreadsheetLLM
Dr. Eng. NADIA GHEZAI... @Nadia_GHEZing
29 Followers 785 Following
TienDat @TienDat011000
3 Followers 271 Following
بُوَيْحِث @ALba7ith98
573 Followers 2K Following
Luke @lukenlp57
0 Followers 160 Following
WeiCUI6 @Cui6Wei
34 Followers 752 Following Systems Software Engineer @NVIDIA. Prev @UofT @UCLA @KITE_UHN @Tesla @Samsung @Apple. Working on @NVIDIAGFN
jake sparrow @NLP_beginner
6 Followers 273 Following
Dan @DanIskandarov
49 Followers 2K Following
Sherry Yukinoshita @SherryYuki99299
0 Followers 71 Following まだこの世界は 僕を飼いならして いたいみたいな 望み通りいいだろう? 美しくもがくよ
Leuve @Leuve1095409
45 Followers 1K Following
Rohan Shingade @shingade_rohan
29 Followers 182 Following
Adwawgu @Adwawgu285
31 Followers 1K Following
Gust Stokes @GustS56008
77 Followers 4K Following
Jordi Mas @jordimash
3K Followers 2K Following Passionate about technology, languages, collaboration and open source. Member of @softcatala & @gnome Open source contributions 🔨 : https://t.co/NDTmM57nXi
Yihuai Hong @YihuaiH91773
136 Followers 523 Following 1st Year Ph.D. student @NYU_Courant | Prev Intern @AlibabaGroup @UCL | Mechanistic Interpretability, LLM Safety, Post-training.
EMNLP 2025 @emnlpmeeting
15K Followers 50 Following EMNLP 2025 - The 2025 Conference on Empirical Methods in Natural Language Processing, 2025 Hashtag: #EMNLP2025 Dates: November 5-9 Submission Deadline: May 19th
Stanford NLP Group @stanfordnlp
171K Followers 295 Following Computational Linguists—Natural Language—Machine Learning @chrmanning @jurafsky @percyliang @ChrisGPotts @tatsu_hashimoto @MonicaSLam @Diyi_Yang @StanfordAILab
Ofir Press @OfirPress
15K Followers 6K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
ACL 2025 @aclmeeting
22K Followers 52 Following Association for Computational Linguistics | ACL 2025 conference | The 63rd Annual Meeting of the ACL Hashtags: #NLProc #ACL2025NLP
Princeton Laboratory ... @PrincetonAInews
1K Followers 67 Following The Princeton Laboratory for Artificial Intelligence supports and expands the scope of AI research at Princeton.
Yoonsang Lee @yoonsang_
228 Followers 609 Following CS PhD @princeton_nlp @princetonPLI; prev @SeoulNatlUni
Yong Lin @Yong18850571
744 Followers 217 Following Postdoc Fellow @PrincetonPLI @Princeton. Co-leading the Goedel-Prover project. Apple AI/ML PhD Fellow 2023.
Adithya Bhaskar @AdithyaNLP
312 Followers 304 Following Second Year CS Ph.D. student at Princeton University (@princeton_nlp), previously CS undergrad at IIT Bombay
Luxi (Lucy) He @LuxiHeLucy
991 Followers 409 Following Princeton CS PhD @PrincetonPLI. Previously @Harvard ‘23 CS & Math.
Kilian Lieret @KLieret
882 Followers 40 Following Research Software Engineer at Princeton University. AI agents & benchmarks for software engineering.
Howard Yen @HowardYen1
237 Followers 238 Following
Tri Dao @tri_dao
32K Followers 632 Following Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.
Zirui "Colin" Wang @zwcolin
1K Followers 571 Following CS PhD Student @Berkeley_EECS; Prev. MS @princeton_nlp, BS @HDSIUCSD; '25 @siebelscholars; I work on multimodal models; He/Him.
Princeton PLI @PrincetonPLI
2K Followers 32 Following Princeton University initiative enhancing fundamental understanding of AI, enabling its use in academic disciplines, and examining AI's societal implications.
Alex Wettig @_awettig
2K Followers 584 Following PhD @Princeton trying to make sense of language models and their training data; trying to train agents @cursor_ai
Ellen Zhong @ZhongingAlong
8K Followers 887 Following Assistant Professor @PrincetonCS. #ai4science #proteins247 #cryoem #cryodrgn ❄️🐉 Prev: @MIT @DeepMind @DEShawResearch. Currently moonlighting @generate_biomed.
John Yang @jyangballin
4K Followers 783 Following 🌲 CS PhD @Stanford 🤖 SWE-bench + agent + smith 🎓 Prev. @princeton_nlp 🐯; @Berkeley_EECS 🐻
Vishvak Murahari @VishvakM
476 Followers 225 Following NLP + ML Ph.D. candidate @princeton_nlp Ex. @Google @allen_ai @Microsoft
Princeton University @Princeton
562K Followers 1K Following The official account of Princeton University. In the Nation’s Service and the Service of Humanity.
Princeton Engineering @EPrinceton
10K Followers 2K Following Princeton University School of Engineering and Applied Science. Engineering in the service of humanity.
Princeton Computer Sc... @PrincetonCS
6K Followers 195 Following The Department of Computer Science at Princeton University
Tianyu Gao @gaotianyu1350
5K Followers 903 Following CS PhD student @Princeton @Princeton_nlp @PrincetonPLI working on language models. Previously: @Tsinghua_Uni @TsinghuaNLP
Zexuan Zhong @ZexuanZhong
3K Followers 700 Following @xAI post-trained Grok 3&4; scaling up RL for Grok-next | prev @PrincetonCS
"Tony" Runzhe Yang @RunzheYang
230 Followers 235 Following Machine Learning & Computational Neuroscience Ph.D. @PrincetonCS @PrincetonNeuro
Yangsibo Huang @YangsiboHuang
4K Followers 703 Following research scientist @googledeepmind. gemini thinking & coding. phd @princeton. opinions are my own.
AmsterdamNLP @AmsterdamNLP
4K Followers 332 Following Tweeting about NLP research, events and opportunities in Amsterdam -- run by @wzuidema and others.
Institute for Advance... @the_IAS
27K Followers 382 Following Latest news, research, and campus updates from one of the world's leading centers for theoretical research and intellectual inquiry.
Sanjeev Arora @prfsanjeevarora
25K Followers 101 Following Director, @PrincetonPLI and Professor @PrincetonCS. Seeks math/conceptual understanding of deep learning and large AI models. Also on the "other" social network
UMD Department of Com... @umdcs
6K Followers 90 Following Official feed for the @UofMaryland's Department of Computer Science housed in the @iribecenter.
Howard Chen @__howardchen
1K Followers 1K Following PhDing @PrincetonPLI. Machine memory / control / agency.
Language Technologies... @LTIatCMU
12K Followers 237 Following The Language Technologies Institute in Carnegie Mellon University's @SCSatCMU
JHU CLSP @jhuclsp
7K Followers 6K Following Center for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSQtw @[email protected]
UCL Natural Language ... @ucl_nlp
13K Followers 225 Following NLP/ML research group at @UCLCS, PIs: S. Riedel (@riedelcastro), P. Stenetorp, T. Rocktäschel (@_rockt), E. Grefenstette (@egrefen), P. Minervini (@pminervini)
CopeNLU @CopeNLU
4K Followers 314 Following University of Copenhagen Natural Language Understanding research group, led by @IAugenstein #NLProc #ML #dlearn Funded by @ERC_Research @DFF_raad @VILLUMFONDEN
Machine Learning at G... @mlatgt
7K Followers 431 Following The Machine Learning Center at Georgia Tech (ML@GT) is an interdisciplinary research center that trains the next generation of #machinelearning & #AI pioneers.
CambridgeNLP @cambridgenlp
9K Followers 199 Following The Natural Language Processing Group @Cambridge_Uni, Computer Science department #NLProc #ML. Account managed by @Eric_chamoun, @richarddm1, @pietro_lesci.
EdinburghNLP @EdinburghNLP
13K Followers 159 Following The Natural Language Processing Group at the University of Edinburgh.
WiML @WiMLworkshop
18K Followers 1K Following Women in Machine Learning organization. Maintains a list of women in ML. Profiles the research of women in ML. Annual workshop and other events.
CILVR @CILVRatNYU
2K Followers 13 Following CILVR at NYU https://t.co/PbvGtsBGvR CILVR Blog https://t.co/fyHd5zS3w2
USC NLP @nlp_usc
4K Followers 362 Following The NLP group at @USCViterbi. @DaniYogatama+@_jessethomason_+@jieyuzhao11+@robinomial+@swabhz+@xiangrenNLP at @CSatUSC + researchers @USC_ICT, @USC_ISI.
Griffiths Computation... @cocosci_lab
6K Followers 134 Following Tom Griffiths' Computational Cognitive Science Lab. Studying the computational problems human minds have to solve.
MIT CSAIL @MIT_CSAIL
326K Followers 21K Following MIT's Computer Science & Artificial Intelligence Laboratory (CSAIL). Media Inquiries: [email protected] Check out the latest CSAIL content ⬇️