Che-Ping Tsai @chepingt
PhD @mldcmu, interpretability and representation learning, machine learning theories. Joined November 2016-
Tweets61
-
Followers126
-
Following513
-
Likes624
[0/3] 🚀 Introducing Verlog – an open-source RL framework built specifically for training long-horizon, multi-turn LLM agents. 📊 Max episode length comparison: •VeRL / RAGEN → ~10 turns •verl-agent → ~50 turns •Verlog (ours) → 400+ turns 🔥 ⚙️ Technical foundation:…
Self-Questioning Language Models: LLMs that learn to generate their own questions and answers via asymmetric self-play RL. There is no external training data – the only input is a single prompt specifying the topic.
Yes, there is an official marking guideline from the IMO organizers which is not available externally. Without the evaluation based on that guideline, no medal claim can be made. With one point deducted, it is a Silver, not Gold.
Yes, there is an official marking guideline from the IMO organizers which is not available externally. Without the evaluation based on that guideline, no medal claim can be made. With one point deducted, it is a Silver, not Gold.
I'll be at ICML this week to present our take on "what we're really learning in representation learning and why it works." Our central argument: "Representations are learned from the association between input 𝑋 and context variable 𝐴"
Thrilled to share our #ICML2025 paper! We introduce a variational approach for speech language models, automating speech attribute learning to deliver more natural, human-like speech. Joint work b/w @LTIatCMU and @Apple Read it: arxiv.org/abs/2506.14767
Can LLM solve PDEs? 🤯 We present CodePDE, a framework that uses LLMs to automatically generate solvers for PDE and outperforms human implementation! 🚀 CodePDE demonstrates the power of inference-time algorithms and scaling for PDE solving. More in 🧵: #ML4PDE #AI4Science
Introducing e3 🔥 Best <2B model on math 💪 Are LLMs implementing algos ⚒️ OR is thinking an illusion 🎩.? Is RL only sharpening the base LLM distrib. 🤔 OR discovering novel strategies outside base LLM 💡? We answer these ⤵️ 🚨 arxiv.org/abs/2506.09026 🚨 matthewyryang.github.io/e3/
🔥Unlocking New Paradigm for Test-Time Scaling of Agents! We introduce Test-Time Interaction (TTI), which scales the number of interaction steps beyond thinking tokens per step. Our agents learn to act longer➡️richer exploration➡️better success Paper: arxiv.org/abs/2506.07976
Excited to share our work with my amazing collaborators, @Goodeat258, @SimulatedAnneal, @zicokolter, and Kaiming. In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,…
Data selection and curriculum learning can be formally viewed as a compression protocol via prequential coding. New blog (with @AllanZhou17 ) about this neat idea that motivated ADO but didn’t make it into the paper. yidingjiang.github.io/blog/post/curr…
In our #AISTATS2025 paper, we ask: when it is possible to recover a consistent joint distribution from conditionals? We propose path consistency and autoregressive path consistency—necessary and easily verifiable conditions. See you at Poster session 3, Monday 5th May.
Check out Runtian's thesis on contexture theory, which shows that many representation learning methods perform eigendecomposition on the context-induced linear operators. More papers coming soon—stay tuned!
Check out Runtian's thesis on contexture theory, which shows that many representation learning methods perform eigendecomposition on the context-induced linear operators. More papers coming soon—stay tuned!
Heading to #ICLR2025! Looking forward to discussions on LLMs (for tabular data), interpretability, and representation learning. I'll be presenting my internship project on LLMs for tabular anomaly detection — catch our poster on Sat, April 26 at 10am! Come say hi! @iclr_conf
Are current reasoning models optimal for test-time scaling? 🌠 No! Models make the same incorrect guess over and over again. We show that you can fix this problem w/o any crazy tricks 💫 – just do weight ensembling (WiSE-FT) for big gains on math! 1/N
Excited to share new work from my internship @GoogleAI ! Curious as to how we should measure the similarity between examples in pretraining datasets? We study the role of similarity in pretraining 1.7B parameter language models on the Pile. arxiv: arxiv.org/abs/2502.02494 1/🧵
Model-free deep RL algorithms like NFSP, PSRO, ESCHER, & R-NaD are tailor-made for games with hidden information (e.g. poker). We performed the largest-ever comparison of these algorithms. We find that they do not outperform generic policy gradient methods, such as PPO. 1/N
To trust LLMs in deployment (e.g., agentic frameworks or for generating synthetic data), we should predict how well they will perform. Our paper shows that we can do this by simply asking black-box models multiple follow-up questions! w/ @m_finzi and @zicokolter 1/ 🧵
Through 2024, scaling test-time compute has become key. But, what does it mean to use test-time compute effectively & efficiently + how to do it? 🤔 We wrote a blog post ✍️ with a conceptual perspective on this: blog.ml.cmu.edu/2025/01/08/opt… 🎯Answer: meta reinforcement learning 🧵⤵️
Introducing Content-Adaptive Tokenizer (CAT) 🐈! An image tokenizer that adapts token count based on image complexity, offering flexible 8x, 16x, or 32x compression! Unlike fixed-length tokenizers, CAT optimizes both representation efficiency and quality. Importantly, we use just…
We have just released SenSet, a novel list of 106 senescence marker genes. We hope this resource accelerates discoveries in aging research, cancer biology, and regenerative medicine. #senescence #aging #pulearning #gene-set #SenNet biorxiv.org/content/10.110…

Olga @Sefoj724626
22 Followers 1K Following In order to be irreplaceable one must always be different. — Coco Chanel
Trudy kidst @KidstTrudy
177 Followers 992 Following Crypto Trader| Researcher | Educational AI, RWA, DePin Content| Strategic Advisor | $5.5 M revenue
Andrew Rouditchenko �... @arouditchenko
445 Followers 544 Following PhD student at MIT working on multi-modal and multilingual speech. I was an intern at @AIatMeta and @Apple MLR.
Patrick Drake @time8machine
17K Followers 6K Following Neurodivergent physics student with a keen interest in multisensory integration and emergent perception. Exploring research on a proposed ‘sixth sense’. Δ
Ziqian Zhong @fjzzq2002
617 Followers 460 Following AI interp & alignment @CSDatCMU, prev @MIT @pika_labs
Mussa Kambi @MussaKambi77202
16 Followers 861 Following
Brandon Trabucco @brandontrabucco
1K Followers 354 Following Exploring new data @thinkymachines; @MLDCMU (PhD); @Berkeley_AI (Bachelors); @NDSEG Fellow; Singer and Composer
keesha @KeeshaBrown96
647 Followers 7K Following The wind is free to come and go, and we will meet when we are supposed to meet. If you decide to be brilliant, there is no mountain to block you, and no sea to
Emanuele Marconato @ema_marconato
623 Followers 794 Following Post-doc at @UniTrento_DISI. Previously, PhD in AI at @Unipisa and @UniTrento_DISI; visiting at @UCPH_Research. Travel addicted 🌴
Andreas Kirsch 🇺�... @BlackHC
14K Followers 6K Following My opinions only here. 👨🔬 RS @DeepMind, @Midjourney 1y 🧑🎓 DPhil @AIMS_oxford @UniofOxford 4.5y 🧙♂️ RE DeepMind 1y 📺 SWE @Google 3y 🎓 TUM 👤 @nwspk
Yoram Bachrach @yorambac
3K Followers 7K Following Research Scientist at Meta (prev Google DeepMind and Microsoft Research). Working on LLM Agents and Multi-Agent Systems.
Xiusi Chen @xiusi_chen
617 Followers 457 Following Postdoc @UofIllinois @uiuc_nlp, Ph.D. @UCLA, BS @PKU1898. RM-R1. Ex-Intern @AmazonScience (x2),@NECLabsAmerica. LLM, Neuro-Symbolic AI.
Zhiyu Zhang @imZhiyuZ
417 Followers 557 Following Machine learning learner. Postdoc at Carnegie Mellon University.
Ritabrata Ray @ritabrataray
245 Followers 2K Following PhD student @mldcmu, previously MS @Cornell, and undergrad @IITKgp
Jian Kang @jiank_uiuc
2K Followers 1K Following stats & data sci, comp sci @mbzuai | comp sci @uofr (on leave) | phd @siebelschool @ideaisailuiuc | ex-intern @aiatmeta modeling the interconnected world
Lucio Dery Jnr Mwinm @derylucio
537 Followers 991 Following
Sung-Feng Huang @SungFengHuang
73 Followers 255 Following National Taiwan University | Speech Processing & Machine Learning Lab @ntu_spml
Wei-Lin Chiang @infwinston
5K Followers 937 Following Building @lmarena_ai @UCBerkeley PhD in AI & systems
Calvin Luo @calvinyluo
825 Followers 225 Following PhD Student @BrownUniversity. Former @GoogleAI Resident. @UofT Alum.
Yash Savani @yashsavani_
280 Followers 693 Following PhD student @CSDatCMU with Zico Kolter | prev research scientist @abacusai, ml eng @primer_ai | prev prev CS+Stats @Stanford @StanfordAILab
rebecca yu @reb_yu
123 Followers 204 Following phd @mldcmu | bme/cs ‘21 @johnshopkins | co-founder @jhuwmw
veja @veja_xu
1 Followers 100 Following
Fahim Tajwar @FahimTajwar10
627 Followers 355 Following PhD Student @mldcmu @SCSatCMU BS/MS from @Stanford
Heng-Jui Chang @hjchang87
168 Followers 165 Following 🎓 PhD Candidate @MIT_CSAIL 🧪 Research Scientist Intern @AIatMeta
Sairslor @SairslorV4wA
29 Followers 556 Following
Euxhen Hasanaj @EuxhenH
83 Followers 124 Following Research Scientist @genbioai | PhD, Machine Learning @mldcmu, AI x Biotech
Yifei Wang @yifeiwang77
2K Followers 2K Following Postdoc @MIT_CSAIL. Self-supervised learning. Foundation Models. AI Safety. Prior BS+BA+PhD @PKU1898.
Youngseog Chung @YoungseogC
465 Followers 618 Following PhD student at @mldcmu, @AutonLab | Jazz enthusiast | Tennis player
Chao-Wei Huang @cwhuang_wh
213 Followers 1K Following PhD candidate at National Taiwan University. Former intern @AmazonScience and @AIatMeta. NLP, Retrieval, and Dialogue Systems.
Ethan (Yusheng) Su @thu_yushengsu
692 Followers 1K Following Researcher @AMD | Postdoc @ https://t.co/eNDjysZ4dD | Ph.D @TsinghuaNLP | Intern @Microsoft.
kovariance @kovariance
100 Followers 7K Following
Sukjun (June) Hwang @sukjun_hwang
3K Followers 300 Following ML PhD student @mldcmu advised by @_albertgu
Ashwinee Panda @PandaAshwinee
3K Followers 724 Following Postdoc of @tomgoldsteincs, PhD @princeton, @Cal alum, currently working on LLMs
Chen Wu @ChenHenryWu
612 Followers 598 Following phd student @CMU_Robotics | prev. undergrad @Tsinghua_Uni
Hao-Ping (Hank) Lee @hankhplee
794 Followers 503 Following PhD student @cmuhcii | user-centered privacy in AI | prev: @IBMResearch @MSFTResearch @brave @GeorgiaTech | @hankhplee.bsky.social 🦋
Sang Choe @sangkeun_choe
327 Followers 190 Following pretraining @anthropicAI | prev cs phd @carnegiemellon
Burak Varıcı @VariciBurak
141 Followers 414 Following Postdoc at @mldcmu / PhD at @rpi / 🇹🇷 /~burakvarici at Bluesky
Hongyi Wang @HongyiWang10
2K Followers 2K Following Assist. Prof. @RutgersCS; Head of Infra @genbioai; Ex @mldcmu @WisconsinCS
Arman Adibi @arman_adibi23
685 Followers 3K Following Assistant Professor, @AUG_Cyber |Postdoc @Princeton | Ph.D. from @Penn, @WarrenCntrPenn | Studying machine learning and optimization.
Sherry Tongshuang Wu @tongshuangwu
6K Followers 1K Following Assist. Prof @SCSatCMU , CS PhD @uwcse. HCI+AI, map general-purpose models to specific use cases! prev. intern @MSFTResearch @GoogleAI @Apple. She/her.
Logan Kilpatrick @OfficialLoganK
210K Followers 2K Following Lead product for @GoogleAIStudio + the Gemini API. My views!
Rishabh Agarwal @agarwl_
17K Followers 791 Following Reinforcement Learner, Adjunct Prof at McGill. Ex MSL Meta, DeepMind, Brain, Mila, IIT Bombay. NeurIPS Best Paper
Songlin Yang @SonglinYang4
12K Followers 3K Following research @MIT_CSAIL @thinkymachines. work on scalable and principled algorithms in #LLM and #MLSys. in open-sourcing I trust 🐳. she/her/hers
Hanna Hajishirzi @HannaHajishirzi
9K Followers 443 Following Sr. Director of AI at @allen_ai, Prof at @uw_cse, lead OLMo, Tulu
Akari Asai @AkariAsai
18K Followers 868 Following Incoming Assistant Professor @SCSatCMU & research scientist @allen_ai. akariasai @ 🦋
Zhiqing Sun @EdwardSun0909
19K Followers 1K Following Agents @Meta MSL TBD Lab. previously posttraining research @OpenAI train LLMs to do things: deep research, chatgpt agent, etc. CS PhD @LTIatCMU
Federico Baldassarre @BaldassarreFe
1K Followers 452 Following Postdoctoral Researcher @AIatMeta: DINOv3 and world models. PhD @kth_rpl: deep learning explainability, concept-based visual representations and reasoning.
Guangxuan Xiao @Guangxuan_Xiao
3K Followers 698 Following Ph.D. student at @MITEECS Prev: CS & Finance @Tsinghua_Uni
Dimitris Papailiopoul... @DimitrisPapail
20K Followers 1K Following Researcher @MSFTResearch, AI Frontiers Lab; Prof @UWMadison (on leave); learning in context; thinking about reasoning; babas of Inez Lily.
Jiaxin Shi @thjashin
4K Followers 348 Following Research Scientist @GoogleDeepMind | prev @Stanford @MSRNE @VectorInst @RIKEN_AIP_EN @Tsinghua_Uni. Building probabilistic & algorithmic models for learning.
Demis Hassabis @demishassabis
489K Followers 146 Following Nobel Laureate. Co-Founder & CEO @GoogleDeepMind - working on AGI. Solving disease @IsomorphicLabs. Trying to understand the fundamental nature of reality.
Laurens van der Maate... @lvdmaaten
4K Followers 2K Following Member of Technical Staff at Anthropic. Ex-Meta. t-SNE. Llama 3. DenseNet. Web-scale weakly supervised vision. CrypTen.
Jason Weston @jaseweston
13K Followers 713 Following @Meta+NYU. NLP from scratch(Pretrain+FT LLM) 2008, MemNet (pre-Transformer) 2015, DrQA(pre-RAG) 2017, BlenderBot(dialog pre-ChatGPT) 2018+,Self-Rewarding+more!
Ziqian Zhong @fjzzq2002
617 Followers 460 Following AI interp & alignment @CSDatCMU, prev @MIT @pika_labs
Alexander Wei @alexwei_
24K Followers 193 Following Reasoning @OpenAI. Co-built CICERO @MetaAI | @Berkeley_AI PhD '23 | @Harvard '20
Trapit Bansal @TrapitBansal
32K Followers 247 Following AI Research @Meta | Co-Creator of OpenAI o1 | Previously @OpenAI, @MSFTResearch, @GoogleAI, @facebook, @iiscbangalore, and undergrad @IITKanpur
Danny To Eun Kim @TEKnologyy
547 Followers 1K Following PhD student @LTIatCMU working with @841io on NLP & IR | Prev: MEng @ai_ucl
Diyi Yang @Diyi_Yang
18K Followers 2K Following Assistant Professor @Stanford CS @StanfordNLP @StanfordAILab LLMs for Humans
John Hewitt @johnhewtt
6K Followers 47 Following Assistant Prof @columbia CS. Visiting Researcher @ Google DeepMind. PhD from @stanfordnlp. Language x Neural Nets.
Dan Zhang @DZhang50
4K Followers 998 Following Gemini Model+HW Codesign @ Google DeepMind | Computer Architecture PhD @ UT Austin🤘 | Opinions stated here are my own.
DatologyAI @datologyai
2K Followers 11 Following DatologyAI builds tools to automatically select and optimize the best data on which to train AI models, leading to better, smaller models which train faster.
Seohong Park @seohong_park
4K Followers 532 Following Reinforcement learning | CS Ph.D. student @berkeley_ai
Emanuele Marconato @ema_marconato
623 Followers 794 Following Post-doc at @UniTrento_DISI. Previously, PhD in AI at @Unipisa and @UniTrento_DISI; visiting at @UCPH_Research. Travel addicted 🌴
Andreas Kirsch 🇺�... @BlackHC
14K Followers 6K Following My opinions only here. 👨🔬 RS @DeepMind, @Midjourney 1y 🧑🎓 DPhil @AIMS_oxford @UniofOxford 4.5y 🧙♂️ RE DeepMind 1y 📺 SWE @Google 3y 🎓 TUM 👤 @nwspk
Rishi Jha @rishi_d_jha
915 Followers 27 Following CS PhD student @Cornell_CS! Currently a Research Intern @Microsoft. Prev. @uwcse and UW Math.
Randall Balestriero @randall_balestr
4K Followers 221 Following AI Researcher: From theory to practice (and back) Postdoc @MetaAI with @ylecun PhD @RiceUniversity with @rbaraniuk Masters @ENS_Ulm @Paris_Sorbonne
Shirley Wu @ShirleyYXWu
3K Followers 295 Following CS PhD candidate @Stanford working w/ @jure & @james_y_zou on LLM agents and alignment | Prev USTC, Intern @MSFTResearch, @NUSingapore
Yoram Bachrach @yorambac
3K Followers 7K Following Research Scientist at Meta (prev Google DeepMind and Microsoft Research). Working on LLM Agents and Multi-Agent Systems.
Xiusi Chen @xiusi_chen
617 Followers 457 Following Postdoc @UofIllinois @uiuc_nlp, Ph.D. @UCLA, BS @PKU1898. RM-R1. Ex-Intern @AmazonScience (x2),@NECLabsAmerica. LLM, Neuro-Symbolic AI.
Ritabrata Ray @ritabrataray
245 Followers 2K Following PhD student @mldcmu, previously MS @Cornell, and undergrad @IITKgp
Dylan Foster 🐢 @canondetortugas
3K Followers 1K Following Foundations of RL/AI @MSFTResearch. Previously @MIT @Cornell_CS https://t.co/vQIdUzsw8B RL Theory Lecture Notes: https://t.co/bhgL3aKIk0
Xuandong Zhao @xuandongzhao
4K Followers 446 Following Postdoc@UC Berkeley CS; Research: ML, NLP, AI Safety
Beidi Chen @BeidiChen
15K Followers 399 Following Asst. Prof @CarnegieMellon, @amazon Scholar, Prev: Visiting Researcher @Meta, Postdoc @Stanford, Ph.D. @RiceUniversity, Large-Scale ML, a fan of Dota2.
Nicholas Boffi @nmboffi
895 Followers 893 Following Assistant Professor @mldcmu. Building generative models for science, engineering, and AI. Previously @Harvard, @MIT, @GoogleAI, @NYU_Courant.
Yunzhu Li @YunzhuLiYZ
7K Followers 543 Following Assistant Professor of Computer Science @Columbia @ColumbiaCompSci, Postdoc from @Stanford @StanfordSVL, PhD from @MIT_CSAIL. #Robotics #Vision #Learning
Yining Hong @yining_hong
3K Followers 170 Following 💻Postdoc in CS AI @stanford | 🤖embodied 3D foundation models | 3D-LLMs | embodied world models | Musician -🎸Multi-Instrumentalist & Composer | Metalhead 🤘🏼
Thomas Weng @thomas_weng
866 Followers 386 Following Robotics Research Scientist at The AI Institute | @CMU_Robotics PhD