Hao Peng @haopeng_nlp
Assistant Professor at UIUC CS Joined October 2020-
Tweets39
-
Followers612
-
Following102
-
Likes40
🧩New blog: From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones Do LLMs learn new skills through RL, or just activate existing patterns? Answer: RL teaches the powerful meta-skill of composition when properly incentivized. 🔗:husky-morocco-f72.notion.site/From-f-x-and-g…
So many works talking about entropy, but what is the **mechanism** of entropy in RL for LLMs? 🤔 Our work gives a principled understanding, as well as two tricks that get entropy **controlled** 🧵
Can entropy minimization alone improve LLM performance? And how far can they go without any labeled data? This work answers both: yes, and surprisingly far 🐮 At inference EM can beat GPT4o Claude 3 opus & Gemini 1.5 pro on challenging scientific coding w/o any data/model update
🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models” From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮 And this isn’t a one-off. The pattern holds across RL algorithms and models. 🧵A Deep Dive
💡We find that models “think” 💭 in English (or in general, their dominant language) when processing distinct non-English or even non-language data types 🤯 like texts in other languages, arithmetic expressions, code, visual inputs, & audio inputs ‼️ 🧵⬇️arxiv.org/abs/2411.04986
🚨 I’m on the job market this year! 🚨 I’m completing my @uwcse Ph.D. (2025), where I identify and tackle key LLM limitations like hallucinations by developing new models—Retrieval-Augmented LMs—to build more reliable real-world AI systems. Learn more in the thread! 🧵
I'm on the academic job market! I develop autonomous systems for: programming, research-level question answering, finding sec vulnerabilities & other useful+challenging tasks. I do this by building frontier-pushing benchmarks and agents that do well on them. See you at NeurIPS!
Wanna train PRMs but process labels, annotated manually or automatically, sound too expensive to you😖? Introduce Implicit PRM🚀 – Get your model free process rewards by training an ORM on the cheaper response-level data, with a simple parameterization at no additional cost💰!
Curious whether video generation models (like #SORA) qualify as world models? We conduct a systematic study to answer this question by investigating whether a video gen model is able to learn physical laws. Three are three key messages to take home: 1⃣The model generalises…
What If LLMs can cite the pre-training source(s) supporting their parametric knowledge? Won't this dramatically improve verifiability and trustworthiness? We aimed to answer this during my internship @allen_ai Paper: arxiv.org/abs/2404.01019 To be presented at #COLM Thread👇👇
🎯 Introducing SOLO, a single Transformer architecture for unified vision-language modeling. SOLO accepts both raw image patches (in pixels) and texts as inputs, without using a separate pre-trained vision encoder. Paper: arxiv.org/abs/2407.06438 Code: github.com/Yangyi-Chen/SO…
Language models excel at undergraduate exams, but how do they fare in research? SciCode challenges models with real research coding problems. Even the best models solve less than 5%. Very proud of @MinyangTian1 and @luyu_gao for leading the charge!
Language models excel at undergraduate exams, but how do they fare in research? SciCode challenges models with real research coding problems. Even the best models solve less than 5%. Very proud of @MinyangTian1 and @luyu_gao for leading the charge!
I'm joining the UIUC @UofIllinois this fall as an Assistant Professor in the iSchool, with an affiliation in Computer Science! My research passion lies in the intersection of NLP and the medical domain. I'm recruiting students for 2025! Check more info: yueguo-50.github.io.
From Claude100K to Gemini10M, we are in the era of long context language models. Why and how a language model can utilize information at any input locations within long context? We discover retrieval heads, a special type of attention head responsible for long-context factuality
Want to train an aligned LM in a new language 🌏 but don’t have preference data for training the reward model (RM)? 💡 Just use a RM for another language: it often works well, sometimes even BETTER than if you had a RM in your target language! 🤯 arxiv.org/abs/2404.12318
SWE-agent is our new system for autonomously solving issues in GitHub repos. It gets similar accuracy to Devin on SWE-bench, takes 93 seconds on avg + it's open source! We designed a new agent-computer interface to make it easy for GPT-4 to edit+run code github.com/princeton-nlp/…
Very proud of Eurus. A huge shoutout to @lifan__yuan and @charlesfornlp for leading this!
Very proud of Eurus. A huge shoutout to @lifan__yuan and @charlesfornlp for leading this!
Very proud of Eurus. A huge shoutout to @lifan__yuan and @charlesfornlp for leading this!
Very proud of Eurus. A huge shoutout to @lifan__yuan and @charlesfornlp for leading this!
Frontier models all have at least 100k context length, Gemini 1.5 has even 1m context. What about research and open source? Introducing Long Context Data Engineering, a data driven method achieving the first 128k context open source model matching GPT4-level Needle in a…
Large Language Model (LLM) agents promise to free us from mundane tasks, but how should they best interact with our world? Introducing CodeAct, an agent {framework, instruction-tuning dataset, model}, employs executable Python code to unify the actions of LLM agents. 🧵1/

Zilu Tang (Peter) @ N... @Zilu_Tang_Peter
233 Followers 626 Following Boston University NLP @llamagrp, ex-IBM Research, MIT-IBM Watson AI lab, Rice bioengineering 2018, made in China
Grad @Grad62304977
4K Followers 2K Following
Zhennan Shen @ZShen0521
58 Followers 415 Following SJTU-CS-B.e: 2021~2025 🇨🇳 @sjtu1896 WPI 2026 spring incoming Robot PhD 🇺🇸
Nandan Thakur @beirmug
2K Followers 3K Following PhD @uwaterloo🌲 IR & NLP | I like good evals🔎 l Prev: intern @DbrxMosaicAI @GoogleAI & RA @UKPLab | https://t.co/kxQprYr7Xn, https://t.co/YVvVjSyXOS, TREC-RAG and FreshStack! ✨
Hanwen Wang @hwwang06
20 Followers 332 Following Michael | '27 Undergrad @HKUSTCSE | '25 Exchange-in @UofIllinois @siebelschool
Sree Bhattacharyya @SreeBee11
128 Followers 783 Following PhD student @ISTatPennState • Multimodal AI x Affective Computing • ex-Software Engineer @Microsoft • volunteering @ai4all
Eawage @Eawage070602
60 Followers 2K Following
Hanson Tang @HansonTang928
19 Followers 217 Following Undergrad @UofIllinois | Intern @TAMU SKY Lab | Ex-Intern @sjtu1896 EPIC Lab
Valentina Tardelli @ValentinaT32922
93 Followers 6K Following
• @ixrxxir
1 Followers 7K Following
Ezio Wang @Ezio21084936435
2 Followers 206 Following
Lin Elaine @Elaine_Lin080
9 Followers 453 Following Surgical robotics 🦾 | Medical devices 🏥 | Working at Intuitive Surgical to shape the future of minimally invasive care
Jiang @Jiang69596
0 Followers 7 Following
Sacramento King @SSacrament94313
76 Followers 2K Following
Joe Nguyen @ntanh14
54 Followers 827 Following Ph.D. Student in Visual Language Navigation @ Oregon State University
Yoram Bachrach @yorambac
3K Followers 7K Following Research Scientist at Meta (prev Google DeepMind and Microsoft Research). Working on LLM Agents and Multi-Agent Systems.
Eran Hirsch @hirscheran
344 Followers 701 Following PhD candidate @biunlp ; Tweets about NLP, ML and research
s @mkvrandomworks
4 Followers 1K Following
Helena R.S @Helenaisgood
889 Followers 7K Following Mom of a beautiful twin, lover girl and a sweet soul ....#itistimeforpeace ✡️✡️
Vamsi Mokari @MokariVamsi
0 Followers 87 Following
taesiri @taesiri
854 Followers 5K Following Research Scientist @ EA Sports, VLMs, Evals, All opinions are my own.
Yash Malik @_yash_malik_
92 Followers 1K Following ML @AmazonScience Scaling RL for LLMs Prev @Google, SC
Aditya Sinha @adityaasinha
1K Followers 4K Following Research @Netflix, MS CS at UIUC | Previously @GoogleAI, @MSFTResearch | BITS Pilani, Goa.
Pocket @PocketPriors
2K Followers 2K Following
Basit Mustafa @moltar81435
417 Followers 8K Following introverted but willing to discuss sanctuary moon innovation/ai + dsop/ai delivery @ https://t.co/nQ5pf3TzTZ
Young @younqchan
327 Followers 5K Following Researcher working on Out-of-Distribution Generalizable Reasoning and AI Scientist from causality perspective.
Wen-Ding Li @xu3kev
3K Followers 6K Following LLM for code and reasoning. PhD student at Cornell. Previously Student Researcher at @google. Previously intern at @theteamatx.
Jieyu Zhao @jieyuzhao11
3K Followers 824 Following Assistant Prof. @CSatUSC, @USC || Postdoc @ClipUMD || PhD from @UCLANLP, @UCLA. #NLP, #ML, #TrustworthyNLP
Tzu-Han Lin @tzuhan_0316
53 Followers 746 Following Research intern @UVA. MS student @NTU_TW, advised by @YunNungChen. Interested in LLM reasoning & language agents.
Jinghan Zhang @jinghan23
108 Followers 130 Following CSE PhD student @hkust in her second year advised by @junxian_he . Machine learning, NLP. bluesky here: https://t.co/ECxlKtKTxz
King Hong Chuang @KingHongChuang
30 Followers 2K Following
Adam Falls @AdamFalls172137
56 Followers 4K Following
Ning Shi (Shining) @MrShininnnnn
106 Followers 525 Following PhD student @AmiiThinks @UAlbertaCS; Volunteer @ReviewAcl; Mentor UR2PhD @CRAtweets; Formerly @BAAIBeijing, @AlibabaGroup, @GTOMSCS, @iSchoolSU, @NYUSPS; #NLPer
brendan chambers @societyoftrees
522 Followers 1K Following interconnected systems | humans+computers | ai research + engineering | societyoftrees bsky social
Cheng Tan @chengtan9907
221 Followers 366 Following A fourth-year CS PhD. student at @ZJU_China and @Westlake_Uni || Supervised by: Stan Z. Li.
yashwanth @yashwanth__e
172 Followers 1K Following tech & cats, undergrad researcher, deepl for life, a lil too employed, gpu poor @ hostel room
Kelvin 🦖🤓 @kelvinhan
44 Followers 1K Following #NLProc PhD LORIA/CNRS/Université de Lorraine, I work on generation of questions, from structured and unstructured data. https://t.co/3mUSCnSHTf
Taoran Li @TaoranLi3
13 Followers 314 Following Undergrad @ZJU_China and @ECEILLINOIS | MS @ECEILLINOIS | Trustworthy ML& AI Safety
Shumo Chu @shumochu
6K Followers 825 Following brewing a stealth mode AI startup. ex prof. @UCSBCS, ph.d. @UWCSE, eng. @Google
Luheng He @LuhengH
879 Followers 478 Following
Tianyin Xu @tianyin_xu
5K Followers 1K Following Watchman in a cornfield @IllinoisCDS @ECEILLINOIS @ACMSIGOPS
Manling Li @ManlingLi_
8K Followers 735 Following Assistant Professor@NU, Amazon Scholar, Postdoc@Stanford, PhD@UIUC #NLP #CV Language+Vision/EmbodiedAI, Reasoning, Planning, Compositionality, Trustworthiness
Mistral AI @MistralAI
156K Followers 0 Following Frontier AI in your hands. https://t.co/VdyEwpQsiy Apps: https://t.co/1vZA5XdBYo https://t.co/rj5G4u5sHu
AI at Meta @AIatMeta
712K Followers 288 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
UIUC NLP @uiuc_nlp
1K Followers 138 Following Natural Language Processing research group at The University of Illinois Urbana-Champaign @IllinoisCS @UofIllinois
Minyang Tian @MinyangTian1
130 Followers 116 Following PhD candidate at UIUC, co-advised by @haopeng_nlp and Eliu Huerta @argonne and @UChicago
Mike Lewis @ml_perception
8K Followers 242 Following Llama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.
Akari Asai @AkariAsai
18K Followers 867 Following Incoming Assistant Professor @SCSatCMU & research scientist @allen_ai. akariasai @ 🦋
Terra Blevins @TerraBlvns
847 Followers 469 Following Postdoc @ViennaNLP and incoming asst professor @Northeastern @KhouryCollege || PhD @uwnlp || she/her
Lifan Yuan @lifan__yuan
2K Followers 137 Following PhD student @uiuc_nlp @GoogleDeepMind. Prev: @TsinghuaNLP
May Fung @May_F1_
2K Followers 519 Following Assistant Professor, Hong Kong University of Science and Technology CSE 💻 Human-Centric Trustworthy AI/ML/NLP | Reasoning and Agents
Eunsol Choi @eunsolc
6K Followers 911 Following on natural language processing / machine learning. assistant prof at @NYUDataScience @NYU_Courant prev @UTCompSci @googleai, @uwcse, @Cornell.
Tom Sherborne @tomsherborne
971 Followers 284 Following code MTS @cohere ex: @edinburghnlp @allen_ai @cambridgenlp @ucl @apple.
Nelson Liu @nelsonfliu
4K Followers 845 Following @stanfordnlp PhD student. tweets auto-deleted periodically.
Liwei Jiang @liweijianglw
2K Followers 540 Following 姜力炜 • Ph.D. candidate @uwnlp | visiting @stanford | @nvidia @allen_ai 🧊 advancing AI & understanding humans & benefiting society 🏔️ lifetime adventurer
Doug Downey @_DougDowney
376 Followers 215 Following Researching AI for Science @allen_ai, Prof @northwesterncs
Matt Gardner @nlpmattg
9K Followers 121 Following Researcher at @ScaledCognition. Formerly at Semantic Machines, @allenai (@ai2_allennlp, #nlphighlights).
Rowan Zellers @rown
14K Followers 974 Following multimodal @thinkymachines. I also like to climb rocks and throw pottery. https://t.co/5Er4j39K71 (he/him)
Swabha Swayamdipta @swabhz
7K Followers 475 Following Assistant Prof. @CSatUSC | Researcher in #NLProc | Previously @uwnlp @allenai
Maarten Sap (he/him) @MaartenSap
5K Followers 634 Following retiring X acct: find me @maartensap.bsky Working on #NLProc for social good. Currently at @LTIatCMU, previously at @UWNLP, @MSFTResearch, and @allen_ai. 🏳🌈
Sean Ren @xiangrenNLP
13K Followers 546 Following Building @SaharaLabsAI 🍦| Professor @USCViterbi @nlp_usc | @MIT TR 35 , @ForbesUnder30 | Prev: @allen_ai, @Snapchat, @Stanford, @UofIllinois
Jason Eisner @adveisner
8K Followers 558 Following Professor of CS at Johns Hopkins University, ACL Fellow. My tweets speak only for me.
Diyi Yang @Diyi_Yang
18K Followers 2K Following Assistant Professor @Stanford CS @StanfordNLP @StanfordAILab LLMs for Humans
Daniel Khashabi 🕊�... @DanielKhashabi
3K Followers 910 Following I play with intuitions and data. Now: @jhuclsp @jhucompsci Past: @allen_ai @uwnlp @Penn @cogcomp, @Illinois_Alma, @MSFTResearch He/Him
Mohit Bansal @mohitban47
11K Followers 722 Following Parker Distinguished Prof @UNC. PECASE/AAAI Fellow. Director https://t.co/5qlPVgnrlN (@unc_ai_group). Past @Berkeley_AI @TTIC_Connect @IITKanpur #NLP #CV #AI
Dipanjan Das @dipanjand
6K Followers 320 Following Researcher at @GoogleDeepmind. Factuality and Gemini x Search.
Pradeep Dasigi @pdasigi
1K Followers 509 Following Senior Research Scientist @allen_ai; #NLProc, Post-training for OLMo
Greg Durrett @gregd_nlp
8K Followers 894 Following Associate professor at NYU (Courant CS + Center for Data Science) | advisor for @bespokelabsai | large language models and NLP | he/him
Lucy Lu Wang @lucyluwang
2K Followers 1K Following I am at https://t.co/1LW9HK5FY0 Asst professor @UW_iSchool; @allen_ai @SemanticScholar #nlproc #healthinformatics #scinlp #bionlp #openaccess she/her
Scott Yih @scottyih
2K Followers 844 Following Research Scientist at Meta Fundamental AI Research (FAIR)
Yuntian Deng @yuntiandeng
8K Followers 3K Following Assistant Professor @UWaterloo | Visiting Professor @NVIDIA | Associate @Harvard | Faculty Affiliate @VectorInst | Former Postdoc @ai2_mosaic | PhD @Harvard
Ankur Parikh @ank_parikh
3K Followers 4K Following Staff Research Scientist at Google DeepMind. Former adjunct assistant prof at @NYU_Courant. PhD at @mldcmu. ML for Bio/Chem (Prev. NLP). All opinions my own.
Lianhui Qin @Lianhuiq
6K Followers 465 Following Assistant Professor at UCSD CSE. NLP, ML, AI. I’m recruiting PhD students.
Mandar Joshi @mandarjoshi_
2K Followers 497 Following Research Scientist at Google DeepMind. Formerly CS/NLP PhD student at the University of Washington, Seattle. Here for cats, NLP, and politics.
Iz Beltagy @i_beltagy
2K Followers 414 Following Cofounder @SpiffyAI, Research Lead building OLMo at @allenai_org, formerly @UTCompSci PhD.
Data Mining Group@UIU... @dmguiuc
667 Followers 89 Following led by Prof. Jiawei Han. Data Mining, AI, ML, NLP
Kyunghyun Cho @kchonyc
77K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
Hanna Hajishirzi @HannaHajishirzi
9K Followers 443 Following Sr. Director of AI at @allen_ai, Prof at @uw_cse, lead OLMo, Tulu
Allen School @uwcse
11K Followers 3K Following The Paul G. Allen School of Computer Science & Engineering educates tomorrow's innovators while developing solutions to humanity's greatest challenges.
Yizhe Zhang @YizheZhangNLP
1K Followers 533 Following Research Scientist at Apple MLR | ex-researcher @ Microsoft Research, Meta AI | PhD @ Duke University
Chi Han @Glaciohound
760 Followers 265 Following CS PhD student at UIUC, interested in language models and their understanding.