Tong Chen @tomchen0
PhD student @uwcse @uwnlp Joined February 2023-
Tweets135
-
Followers549
-
Following486
-
Likes182
š How to find more difficult/novel/salient evaluation data? ⨠Let the data generators find it for you! Introducing Data Swarms, multiple data generator LMs collaboratively search in the weight space to optimize quantitative desiderata of evaluation.
ā”š šš makes RL faster ā but at the cost of performance. We present š š„šš¬š”šš, the first šØš©šš§āš¬šØš®š«šš & š°šØš«š¤š¢š§š šš š«ššš¢š©š that applies šššš/š šš for rollout š°š¢šš”šØš®š š„šØš¬š¢š§š š©šš«ššØš«š¦šš§šš compared to šš šš! š Blog:ā¦
Remember āSon of Antonā from the Silicon Valley show(@SiliconHBO)? The experimental AI that āefficientlyā orders 4,000 lbs of meat while looking for a cheap burger and āfixesā a bug by deleting all the code? Itās starting to look a lot like reality. Even 18 months ago, my ownā¦
Remember āSon of Antonā from the Silicon Valley show(@SiliconHBO)? The experimental AI that āefficientlyā orders 4,000 lbs of meat while looking for a cheap burger and āfixesā a bug by deleting all the code? Itās starting to look a lot like reality. Even 18 months ago, my own⦠https://t.co/XsrYkqEIw0
WHY do you prefer something over another? Reward models treat preference as a black-boxš¶āš«ļøbut human brainsš§ decompose decisions into hidden attributes We built the first system to mirror how people really make decisions in our #COLM2025 paperšØPrefPalette⨠Why it mattersšš»š§µ
š¤ How do we train AI models that surpass their teachers? šØ In #COLM2025: āØDelta learning āØmakes LLM post-training cheap and easy ā with only weak data, we beat open 8B SOTA 𤯠The secret? Learn from the *differences* in weak data pairs! š arxiv.org/abs/2507.06187 š§µ below
Can data owners & LM developers collaborate to build a strong shared model while each retaining data control? Introducing FlexOlmošŖ, a mixture-of-experts LM enabling: ⢠Flexible training on your local data without sharing it ⢠Flexible inference to opt in/out your dataā¦
Can data owners & LM developers collaborate to build a strong shared model while each retaining data control? Introducing FlexOlmošŖ, a mixture-of-experts LM enabling: ⢠Flexible training on your local data without sharing it ⢠Flexible inference to opt in/out your data⦠https://t.co/Vnaaq6c6If
Reasoning benchmarks (e.g., MMLU Pro and GPQA) have seen little benefit from naive RAG. But can we flip this? š„Introducing CompactDS: ā Web-scale coverage ā Runs with just 100GB RAM ā Matches search engines The simplest RAG pipeline can even compete with agenticā¦
Worried about overfitting to IFEval? š¤ Use āØIFBench⨠our new, challenging instruction-following benchmark! Loved working w/ @valentina__py! Personal highlight: our multi-turn eval setting makes it possible to isolate constraint-following from the rest of the instruction š
Worried about overfitting to IFEval? š¤ Use āØIFBench⨠our new, challenging instruction-following benchmark! Loved working w/ @valentina__py! Personal highlight: our multi-turn eval setting makes it possible to isolate constraint-following from the rest of the instruction š
š”Beyond math/code, instruction following with verifiable constraints is suitable to be learned with RLVR. But the set of constraints and verifier functions is limited and most models overfit on IFEval. We introduce IFBench to measure model generalization to unseen constraints.
Are AI scientists already better than human researchers? We recruited 43 PhD students to spend 3 months executing research ideas proposed by an LLM agent vs human experts. Main finding: LLM ideas result in worse projects than human ideas.
Web data, the āfossil fuel of AIā, is being exhausted. Whatās next?š¤ We propose Recycling the Web to break the data wall of pretraining via grounded synthetic data. It is more effective than standard data filtering methods, even with multi-epoch repeats! arxiv.org/abs/2506.04689
Wanna š inside Internet-scale LLM training data w/o spending š°š°š°? Introducing infini-gram mini, an exact-match search engine with 14x less storage req than the OG infini-gram š We make 45.6 TB of text searchable. Read on to find our Web Interface, API, and more. (1/n) ā¬ļø
A bit late to announce, but Iām excited to share that I'll be starting as an assistant professor at the University of Maryland @umdcs this August. I'll be recruiting PhD students this upcoming cycle for fall 2026. (And if you're a UMD grad student, sign up for my fall seminar!)
LMs often output answers that sound right but arenāt supported by input context. This is intrinsic hallucination: the generation of plausible, but unsupported content. We propose Precise Information Control (PIC): a task requiring LMs to ground only on given verifiable claims.
LLMs are helpful for scientific research ā but will they continuously be helpful? Introducing šScienceMeter: current knowledge update methods enable 86% preservation of prior scientific knowledge, 72% acquisition of new, and 38%+ projection of future (arxiv.org/abs/2505.24302).
Next week on Wednesday, June 11th we're excited to welcome @StellaLisy for a session on "Spurious Rewards: Rethinking Training Signals in RLVR." Thanks to @AhmadMustafaAn1 for organizing this session! š„ Learn more: cohere.com/events/Cohere-ā¦
šØ New Paper! šØ Guard models slow, language-specific, and modality-limited? Meet OmniGuard that detects harmful prompts across multiple languages & modalities all using one approach with SOTA performance in all 3 modalities!! while being 120X faster š arxiv.org/abs/2505.23856
Thrilled to announce that I will be joining @UTAustin @UTCompSci as an assistant professor in fall 2026! I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! š¤ š¤

šš” @Kamooc6109
26 Followers 1K Following äøäŗŗåć«ćŖćć«ćÆ50幓ćÆććććć ćåćē¦ććŖćę²č¦³ćććŖććć£ćØę ¹ćę·±ćå¼µććć ćę ¹ćę·±ćå¼µć
Clara Smith @Nurulemylia8
113 Followers 5K Following Guiding @Elonmuskās vision for a better future through SpaceX, Tesla, Neuralink and more š I teach enthusiasts, dream chaser and innovation advocate š
Siffatjot Singh @siffatjot_singh
73 Followers 270 Following Running tests, breaking builds, sipping chai - SDE 2 @helloquash
mtg @mengtaigu
4 Followers 46 Following
Ulisses Walace @uwalace
85 Followers 47 Following
Guowei Xu @Kevin_GuoweiXu
879 Followers 331 Following Undergraduate student at Yao Class (Tsinghua University), interested in Language Models and Reinforcement Learning
degen_bobo š¤š @agostino90
638 Followers 4K Following
JP Liberte @jplibertee
7 Followers 100 Following
Boyuan Zheng@ICML @boyuan__zheng
774 Followers 804 Following Phd student at @osunlp | Research Intern at AI2 PRIOR @allen_ai | Previous: MS @jhuclsp; Intern @Amazon
Adam Zweiger @AdamZweiger
943 Followers 415 Following Rethinking how language models learn | Researcher @MIT_CSAIL
Guanghao Ye @guanghao_ye
396 Followers 804 Following PhD student @MIT_CSAIL working on optimization and LLM, intern @Bytedance Seed; previously at @UWCSE @Microsoft @Adobe
Mehul Damani @MehulDamani2
592 Followers 403 Following PhD-ing @mit @MIT_CSAIL | language models, reinforcement learning
Rod Pacocha @pacocha_ro82349
77 Followers 4K Following
Hao Cheng @kelvinih
89 Followers 133 Following Researcher @ Microsoft Research & Adjunct Faculty @ University of Washington
Mussa Kambi @MussaKambi77202
16 Followers 861 Following
Ibutui @Ibutui445864
91 Followers 2K Following
Joe Mayo @JoeMayo
16K Followers 7K Following Author and Independent Consultant Recent books: - Programming the Microsoft Bot Framework/MSPress - C# Cookbook/O'Reilly Agents, AI, Generative AI, MCP, RAG
Xinxi Lyu @XinxiLyu
81 Followers 30 Following PYI @allen_ai | Incoming PhD student @UofIllinois | BS/MS from @uwcse
Shannon Shen @shannonzshen
1K Followers 2K Following PhD Student @MIT_CSAIL | previously @allen_ai @semanticscholar @harvard @brownuniversity
Zoey Chen @ZoeyC17
974 Followers 561 Following PhD student at the University of Washington. I blog about computer vision, robotics and artificial intelligence at:https://t.co/wvaUVuFcWG
Helena R.S @Helenaisgood
889 Followers 7K Following Mom of a beautiful twin, lover girl and a sweet soul ....#itistimeforpeace ā”ļøā”ļø
Leo Liu @ZEYULIU10
1K Followers 2K Following PhD at UT Austin ex-{uw, isi, facebook} nlper Former intern @SFResearch
Kaiyuan Liu @KaiyuanLiu04
15 Followers 104 Following ICPC World Finalist, UW CS Undergrad looking for PhD position~
CV @CV6507645019208
2 Followers 19 Following
Victoria Graf @VictoriaWGraf
122 Followers 59 Following PhD student @uwnlp, Student Researcher @allen_ai, prev @princeton_nlp
Guang Yang @GuangYangNLP
27 Followers 70 Following PhD student @uwcse @uwnlp. I'm interested in AI for music, multimodal learning and NLP.
Pham Anh @PhamAnh38549729
0 Followers 13 Following
Slupui @Slupui92905
63 Followers 3K Following
Yuetai Li @yuetai12575
225 Followers 570 Following Second year PhD @UW | Post-Training, LLM reasoning and synthetic dataset. https://t.co/cYAkbnCsCp Open to chat and collaborate!
Gantavya Bhatt @BhattGantavya
709 Followers 2K Following Ph.D. Student @UW, @nvidia, working in data-efficient ML. Prev: @amazonscience, undergrad @iitdelhi. Passionate Photographer into Alpinism!
Cara @Cara1984320
367 Followers 6K Following
Zichun Yu @Zichun_Yu
136 Followers 82 Following Ph.D. student @LTIatCMU, working with Prof. Chenyan Xiong @XiongChenyan. Previously @TsinghuaNLP, working with Prof. Zhiyuan Liu @zibuyu9. LLM Pretrainer š
Liao Zhang @lideji1
72 Followers 1K Following Machine learning for theorem proving Neuro-symbolic learning
Tommy @Tommy_Tang_930
16 Followers 474 Following
Aladdin @king_quraishi1
145 Followers 7K Following
Ahmad Mustafa Anis @AhmadMustafaAn1
1K Followers 5K Following Computer Vision & Deep Learning @Roll_ai Deep Learning Enthusiastic Community Lead @Cohere_Labs Ex-Fellow @ PI School of AI
Kanwal Mehreen @KanwalMehreen2
33 Followers 171 Following
Neel Bhandari @NeelBhandari9
294 Followers 852 Following Masters Student @LTIatCMU | ML Scientist @PayPal | Open Research @CohereForAI Community | Previously External Research Student @MITIBMLab. Views my own.
shivansh puri @shivanshpu29280
178 Followers 3K Following
Nelson Liu @nelsonfliu
4K Followers 845 Following @stanfordnlp PhD student. tweets auto-deleted periodically.
Violet Peng @VioletNPeng
7K Followers 510 Following Associated Professor@UCLA-CS. Research NLP, AI creativity, controllable generation, model evaluation, computational journalism, event. (she/her/hers)
Robin Jia @robinomial
4K Followers 890 Following Assistant Professor @CSatUSC | Previously Visiting Researcher @facebookai | Stanford CS PhD @StanfordNLP
Ken Liu @kenziyuliu
2K Followers 875 Following CS PhD @StanfordAILab @StanfordNLP w/ @percyliang @sanmikoyejo. Past: DeepMind, CMU, USydney š¦šŗ
Jack Hessel @jmhessel
4K Followers 916 Following soon: @AnthropicAI. Seattle bike lane enjoyer. Opinions my own.
Boyuan Zheng@ICML @boyuan__zheng
774 Followers 804 Following Phd student at @osunlp | Research Intern at AI2 PRIOR @allen_ai | Previous: MS @jhuclsp; Intern @Amazon
Rima (Yining) Cao @YiningCao3
362 Followers 103 Following Human-Computer Interaction. Now second-year Ph.D. student @UCSanDiego @DesignLabUCSD. Master @UMich @umsi. Undergrad @Tsinghua_Uni
Guanghao Ye @guanghao_ye
396 Followers 804 Following PhD student @MIT_CSAIL working on optimization and LLM, intern @Bytedance Seed; previously at @UWCSE @Microsoft @Adobe
Skyler Hallinan @SkylerHallinan
232 Followers 268 Following Research Intern @samaya_AI | PhD student at @nlp_usc | Former: BS/MS student doing research in #NLProc at @uwcse @uwnlp | Previously research at @apple, @amazon
Mehul Damani @MehulDamani2
592 Followers 403 Following PhD-ing @mit @MIT_CSAIL | language models, reinforcement learning
OpenAI @OpenAI
4.3M Followers 3 Following OpenAIās mission is to ensure that artificial general intelligence benefits all of humanity. Weāre hiring: https://t.co/dJGr6Lg202
Eureka Labs @EurekaLabsAI
73K Followers 1 Following We are building a new kind of school that is AI native.
Gokul Swamy @g_k_swamy
4K Followers 1K Following phd candidate @CMU_Robotics. bs/ms @berkeley_ai. summers @GoogleAI, @msftresearch, @aurora_inno, @nvidia, @spacex. no model is an island. prefers email.
Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
Feng Yao @fengyao1909
1K Followers 635 Following Ph.D. student @UCSD_CSE | Intern @Amazon Rufus Foundation Model Ex. @MSFTResearch @TsinghuaNLP
Xinxi Lyu @XinxiLyu
81 Followers 30 Following PYI @allen_ai | Incoming PhD student @UofIllinois | BS/MS from @uwcse
Hao-Wen (Herman) Dong... @hermanhwdong
1K Followers 306 Following Assistant Professor at University of Michigan | PhD from UC San Diego | Human-Centered Generative AI for Content Creation
Jason Eisner @adveisner
8K Followers 558 Following Professor of CS at Johns Hopkins University, ACL Fellow. My tweets speak only for me.
JHU CLSP @jhuclsp
7K Followers 6K Following Center for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSQtw @[email protected]
Yuhan Liu @YuhanLiu_nlp
462 Followers 886 Following CS PhD student @NYU_Courant advised by @eunsolc, previous intern @tsvetshop
Leo Liu @ZEYULIU10
1K Followers 2K Following PhD at UT Austin ex-{uw, isi, facebook} nlper Former intern @SFResearch
Shengjia Zhao @shengjia_zhao
52K Followers 231 Following Chief Scientist @ Meta MSL. Formerly MTS @ OpenAI, PhD @ Stanford. I train models. All opinions my own.
Zirui Liu @ziruirayliu
375 Followers 619 Following Assistant Professor of CS @UMNComputerSci | PhD @RiceUniversity
Lindia Tjuatja @lltjuatja
1K Followers 615 Following a natural language processor and āsensible linguistā. PhD-ing @LTIatCMU, previously BS-ing @UT_linguistics + @utexasece š¤ š¤š she/her
Victoria Graf @VictoriaWGraf
122 Followers 59 Following PhD student @uwnlp, Student Researcher @allen_ai, prev @princeton_nlp
Songlin Yang @SonglinYang4
12K Followers 3K Following research @MIT_CSAIL @thinkymachines. work on scalable and principled algorithms in #LLM and #MLSys. in open-sourcing I trust š³. she/her/hers
Infini-AI-Lab @InfiniAILab
1K Followers 37 Following
Jack Morris @jxmnop
45K Followers 979 Following research @cornell @meta // language models, information theory, science of AI
Sahil Verma @Sahil1V
593 Followers 1K Following PhD student @uwcse. Robustness and Interpretability. Currently at @MSFTResearch. Former intern at @amazon, @itsArthurAI. Undergrad @IITKanpur
Yuetai Li @yuetai12575
225 Followers 570 Following Second year PhD @UW | Post-Training, LLM reasoning and synthetic dataset. https://t.co/cYAkbnCsCp Open to chat and collaborate!
Xuandong Zhao @xuandongzhao
4K Followers 446 Following Postdoc@UC Berkeley CS; Research: ML, NLP, AI Safety
Yusan Lin @yusan_lin
8K Followers 385 Following Founder & CEO @mirrormirror_ai | Fashion & AI | Computer Science Ph.D. | Model @MDTagencyinc
Zoey Chen @ZoeyC17
974 Followers 561 Following PhD student at the University of Washington. I blog about computer vision, robotics and artificial intelligence at:https://t.co/wvaUVuFcWG
Linxing Preston Jiang @lpjiang97
326 Followers 186 Following PhD student @uwcse interested in theoretical neuroscience. Also @lpjiang97.bsky.social
Hattie Zhou @oh_that_hat
10K Followers 850 Following I want to understand things deeply and explain them well. Building friendly AI @AnthropicAI Give me anonymous feedback: https://t.co/7aBNrpbad8
Elias Stengel-Eskin @EliasEskin
2K Followers 1K Following NLP + AI assistant prof. @UTAustin CS, postdoc @uncnlp w/ @mohitban47, PhD @jhuclsp, @NSF grad fellow. Building communicative+collaborative AI.
Yucheng Lu @_yucheng_lu
404 Followers 798 Following Assistant Professor @nyushanghai in ML Systems. Prev @togethercompute.