Meng Li @limengnlp

PhD student @unipotsdam, supervised by @davidschlangen. Working on NLP, ML and CogSci. Prev @LstSaar. Former NLP engineer. limengnlp.github.io Joined December 2021

Tweets

72
Followers

28
Following

697
Likes

926

Yuchen Jin @Yuchenj_UW

a week ago

Ilya Sutskever: bald Demis Hassabis: bald Noam Shazeer: bald Greg Brockman: bald forget AGI. forget curing cancer. cure baldness now. My hairline is on gradient descent.

396 346 7K 585K 655

MiniMax (official) @MiniMax__AI

3 months ago

Day 1/5 of #MiniMaxWeek: We’re open-sourcing MiniMax-M1, our latest LLM — setting new standards in long-context reasoning. - World’s longest context window: 1M-token input, 80k-token output - State-of-the-art agentic use among open-source models - RL at unmatched efficiency:…

83 311 1K 1.8M 672

Download Image

Qingxiu Dong @qx_dong

3 months ago

⏰ We introduce Reinforcement Pre-Training (RPT🍒) — reframing next-token prediction as a reasoning task using RLVR ✅ General-purpose reasoning 📑 Scalable RL on web corpus 📈 Stronger pre-training + RLVR results 🚀 Allow allocate more compute on specific tokens

31 149 955 106K 823

Download Image

Meng Li @limengnlp

3 months ago

In Hinton's NN class, there is an interesting tip to get a geometric view of high dimensional space. I think authors of interpretability papers did the opposite; they stare at LLMs and pray in their minds that it's linear and interpretable.

Haitham Bou Ammar @hbouammar

3 months ago

0 1 13 574 2

Download Image

0 0 0 84 0

Arnaud Bertrand @RnaudBertrand

4 months ago

I just read this WSJ article on why Europe's tech scene is so much smaller than the US's and China's. I'm afraid that, like most articles on this topic, it largely misses the mark. Which in itself illustrates a key reason why Europe is lagging behind: when you fail to…

697 1K 7K 810K 4K

Download Image

Khanh Nguyen @khanhxuannguyen

9 months ago

📢 I am on the JOB market this year 📢 I am looking for both faculty and research scientist positions. My research makes AI agents useful and safe for humans. I enable them to effectively convey uncertainty, ask for help, learn from human feedback, and pursue goals that benefit…

3 17 42 7K 7

Download Image

Kaiser Sun @KaiserWhoLearns

4 months ago

Excited to be at #NAACL2025! Let’s meet (and grab a Char's Zaku sticker 🚀). 📅 May 4, 11–12, RepL4NLP: "Amuro&Char: Analyzing the Relationship between Pre-Training and Fine-Tuning" 📅 May 2, 12 PM, Ballroom B: "SHADES: Towards a Multilingual Assessment of Stereotypes in LLMs"

0 4 18 1K 0

Download Image

Gianluca Bencomo @gianlucabencomo

5 months ago

Every ChatGPT query costs more energy than the entire life of a fruit fly.

1 3 17 1K 5

Justine Moore @venturetwins

6 months ago

AI phone agent realizes it is talking to a parrot

217 678 9K 772K 3K

Download Video

DeepSeek @deepseek_ai

7 months ago

🚀 Day 0: Warming up for #OpenSourceWeek! We're a tiny team @deepseek_ai exploring AGI. Starting next week, we'll be open-sourcing 5 repos, sharing our small but sincere progress with full transparency. These humble building blocks in our online service have been documented,…

1K 3K 21K 2.5M 2K

DeepSeek @deepseek_ai

8 months ago

🚀 DeepSeek-R1 is here! ⚡ Performance on par with OpenAI-o1 📖 Fully open-source model & technical report 🏆 MIT licensed: Distill & commercialize freely! 🌐 Website & API are live now! Try DeepThink at chat.deepseek.com today! 🐋 1/n

2K 7K 37K 12.4M 10K

Download Image

Meng Li @limengnlp

11 months ago

If gpt wins the Nobel Prize in Literature 2024 ... 😂

0 0 0 205 0

Download Image

Jürgen Schmidhuber @SchmidhuberAI

11 months ago

The #NobelPrizeinPhysics2024 for Hopfield & Hinton rewards plagiarism and incorrect attribution in computer science. It's mostly about Amari's "Hopfield network" and the "Boltzmann Machine." 1. The Lenz-Ising recurrent architecture with neuron-like elements was published in…

211 1K 5K 1.1M 3K

Rafael Rafailov @ NeurIPS @rm_rafailov

a year ago

My Bet: Strawberry is algorithm distillation/procedural cloning. Everyone right now is coming up with ways to distill System 2 into System 1, but that will always be limited. We need to train the model to run the algorithms, not just outputs (and post-train with RL of course).

9 46 471 124K 385

Download Image

Aryaman Arora @aryaman2020

a year ago

rip

7 7 116 48K 15

Download Image

Johan Edstedt @Parskatt

a year ago

Pretty fun paper, finetuning llama to produce blender code for synthetic renderings

6 88 623 64K 378

Download Image

Alison Gopnik @AlisonGopnik

a year ago

Good Scientific American piece on the idea of AGI -I think and argue here that its incoherent - there is no general intelligence natural or artificial but different cognitive abilities that often trade-off.. scientificamerican.com/article/what-d…

8 32 97 15K 72

Tomer Ullman @TomerUllman

a year ago

cognitive scientist: so the lesson of Clever Hans is we need.. engineer: more horses cognitive scientist: engineer: stacked horses. parallel horses. pooled horses. horse dropout. RL with horses in the loop. cognitive scientist: engineer: Hans is All You Need

20 96 581 55K 95

Bobby Sparks @rbzsparks

a year ago

What do preschool learning experiences look like? We examined variability in children’s language environments in a preschool classroom using a new dataset of naturalistic egocentric videos. 🧵 OSF: osf.io/preprints/psya… 🔗: escholarship.org/uc/item/94j9m5… @cogsci_soc #CogSci2024