MIT NLP @nlp_mit

NLP Group at @MIT_CSAIL! PIs: @yoonrkim @jacobandreas @lateinteraction @pliang279 @david_sontag, Jim Glass, @roger_p_levy Cambridge, MA Joined March 2025

Tweets

64
Followers

4K
Following

51
Likes

53

Jyo Pari @jyo_pari

7 hours ago

For agents to improve over time, they can’t afford to forget what they’ve already mastered. We found that supervised fine-tuning forgets more than RL when training on a new task! Want to find out why? 👇

2 50 316 23K 255

Download Image

Paul Liang @pliang279

2 days ago

Since my undergraduate days at CMU, I've been participating in puzzlehunts: involving complex, multi-step puzzles, lacking well-defined problem definitions, with creative and subtle hints and esoteric world knowledge, requiring language, spatial, and sometimes even physical…

Megan Tjandrasuwita @mmtjandrasuwita

3 months ago

3 8 31 17K 11

6 14 93 16K 55

Download Image

Anku Rani @anku__rani

a week ago

✨New work on mathematical reasoning and attribution is now on arXiv! When given charts and questions, multimodal LLMs generate answers but often lack attribution (which granular chart elements drove the answer). If it sounds interesting, please read arxiv.org/abs/2508.16850 🗞️

0 4 15 2K 5

Download Image

Paul Liang @pliang279

a week ago

A bit late, but finally got around to posting the recorded and edited lecture videos for the **How to AI (Almost) Anything** course I taught at MIT in spring 2025. Youtube playlist: youtube.com/watch?v=0MYt0u… Course website and materials: mit-mi.github.io/how2ai-course/… Today's AI can be…

15 248 1K 95K 2K

Download Image

Isha Puri @ishapuri101

4 weeks ago

It seems GPT‑OSS is very prone to hallucinations … check out our RLCR paper to see how we trained reasoning models to know what they don't know. Website 🌐 and code 💻 out today! rl-calibration.github.io 🚀

5 58 399 26K 314

Download Image

Yung-Sung Chuang @YungSungChuang

a month ago

Scaling CLIP on English-only data is outdated now… 🌍We built CLIP data curation pipeline for 300+ languages 🇬🇧We train MetaCLIP 2 without compromising English-task performance (it actually improves! 🥳It’s time to drop the language filter! 📝arxiv.org/abs/2507.22062 [1/5] 🧵

3 80 292 21K 125

Download Image

MIT NLP @nlp_mit

a month ago

🚨new paper alert!🚨 rl for calibration 🚀🚀🚀

Isha Puri @ishapuri101

a month ago

🚨new paper alert!🚨 rl for calibration 🚀🚀🚀

1 10 95 10K 39

0 0 4 726 1

Isha Puri @ishapuri101

a month ago

fun new paper training LLMs to analyze their own uncertainty and be more calibrated in their confidence! arxiv.org/abs/2507.16806

Mehul Damani @MehulDamani2

a month ago

fun new paper training LLMs to analyze their own uncertainty and be more calibrated in their confidence! arxiv.org/abs/2507.16806

13 268 897 93K 613

Download Image

1 10 95 10K 39

MIT NLP @nlp_mit

a month ago

Check out this new paper training LLMs to analyze their own uncertainty and be more calibrated! from @MehulDamani2 @ishapuri101 @StewartSlocum1 @IdanShenfeld and co!

Mehul Damani @MehulDamani2

a month ago

Check out this new paper training LLMs to analyze their own uncertainty and be more calibrated! from @MehulDamani2 @ishapuri101 @StewartSlocum1 @IdanShenfeld and co!

13 268 897 93K 613

Download Image

0 0 9 639 3

Megan Tjandrasuwita @mmtjandrasuwita

2 months ago

I'm currently in Vancouver for #ICML2025 this week and will present our work, "Understanding the Emergence of Multimodal Representation Alignment" later today at 4:30pm. Come by to chat!

0 1 8 1K 1

Jyo Pari @jyo_pari

2 months ago

If you are interested in questioning how we should pretrain models and create new architectures for general reasoning - then checkout E606 @ ICML, our position by @seungwookh and I on potential directions for the next generation reasoning models!

0 6 22 2K 7

Download Image

Seungwook Han @seungwookh

2 months ago

Presenting our ICML spotlight poster at today 11am @ E-606 w/ @jyo_pari! We need to fundamentally change how we train to achieve true reasoning. Reward-based Pretraining (RPT) > Supervised Pretraining

Seungwook Han @seungwookh

6 months ago

1 12 79 10K 42

1 2 14 1K 2

[email protected] @ddvd233

2 months ago

We will present the work TODAY at 4:30 PM at West Hall #421 with a huge poster! Come visit us!

[email protected] @ddvd233

6 months ago

We will present the work TODAY at 4:30 PM at West Hall #421 with a huge poster! Come visit us! https://t.co/2fzaawpcgG

4 8 53 13K 9

6 7 58 11K 5

Download Image

Monica Agrawal @MonicaNAgrawal

2 months ago

Excited to be here at #ICML2025 to present our paper on 'pragmatic misalignment' in (deployed!) RAG systems: narrowly "accurate" responses that can be profoundly misinterpreted by readers. It's especially dangerous for consequential domains like medicine! arxiv.org/pdf/2502.14898

2 7 35 3K 11

Download Image

Belinda Li @belindazli

2 months ago

I'll be presenting "(How) Do Language Models Track State" at ICML! Come by our poster tomorrow, Tuesday July 15 from 4:30pm - 7pm to chat about LMs and whether/how they encode dynamic world models! 🔗 icml.cc/virtual/2025/p…

Belinda Li @belindazli

6 months ago

3 45 227 41K 155

Download Image

1 13 113 9K 51

Download Image

Seungwook Han @seungwookh

2 months ago

How do task vectors emerge during pretraining—and can they predict ICL performance? Come see our ICML spotlight poster "Emergence and Effectiveness of Task Vectors in ICL" at 11am @ East Hall A-B (#E-2312) with @jinyeop_song! 🔗 icml.cc/virtual/2025/p…

1 5 13 990 5

Download Image

Seungwook Han @seungwookh

2 months ago

At #ICML 🇨🇦 this week. I'm convinced that the core computations are shared across modalities (vision, text, audio, etc). The real question is the (synthetic) generative process that ties them. Reach out if you have thoughts or want to chat!

0 3 16 2K 3

Yung-Sung Chuang @YungSungChuang

2 months ago

I will be in Vancouver🇨🇦 for #ICML2025 this week and present #SelfCite on Tuesday morning. Happy to chat and connect. See you there! Blog post link: selfcite.github.io

Yung-Sung Chuang @YungSungChuang

7 months ago

I will be in Vancouver🇨🇦 for #ICML2025 this week and present #SelfCite on Tuesday morning. Happy to chat and connect. See you there! Blog post link: selfcite.github.io

12 76 312 38K 193

Download Image

1 7 60 5K 17

Adam Zweiger @AdamZweiger

2 months ago

Come check out our ICML poster on combining Test-Time Training and In-Context Learning for on-the-fly adaptation to novel tasks like ARC-AGI puzzles. I will be presenting with @jyo_pari at E-2702, Tuesday 11-1:30!

1 6 37 5K 8

Download Image

Martin Ziqiao Ma @ziqiao_ma

2 months ago

📣 Excited to announce SpaVLE: #NeurIPS2025 Workshop on Space in Vision, Language, and Embodied AI! 👉 …vision-language-embodied-ai.github.io 🦾Co-organized with an incredible team → @fredahshi · @maojiayuan · @DJiafei · @ManlingLi_ · David Hsu · @Kordjamshidi 🌌 Why Space & SpaVLE? We…