Wenjun Li @liwenjun2016

Ph.D. Student @sgSMU, LLM x RL. wenjunli-0.github.io Singapore Joined September 2016

Tweets

23
Followers

53
Following

50
Likes

32

Andrei Lupu @_andreilupu

6 months ago

Lol, smart authors 😂

4 6 146 27K 32

Download Image

1 0 3 160 0

Wenjun Li @liwenjun2016

5 months ago

A clearer and more direct explanation to Dr. GRPO.

Zichen Liu @zzlccc

5 months ago

A clearer and more direct explanation to Dr. GRPO.

3 50 271 22K 195

Download Image

0 0 7 200 1

Wenjun Li @liwenjun2016

6 months ago

🚀 Our new findings continue to unravel the mysteries of R1-Zero-like training! 📢 We identify that BIAS in GRPO leads to longer responses—so we fixed it. ✅ GRPO Done Right → 7B SOTA on AIME!

Zichen Liu @zzlccc

6 months ago

26 185 1K 295K 1K

Download Image

0 0 7 250 2

🪂Understanding R1-Zero-Like Training: A Critical Perspective * DeepSeek-V3-Base already exhibits "Aha moment" before RL-tuning?? * The ever-increasing output length in RL-tuning might be due to a BIAS in GRPO?? * Getting GRPO Done Right, we achieve a 7B AIME sota! 🧵 📜Full…

26 185 1K 295K 1K

Download Image

Wenjun Li @liwenjun2016

7 months ago

lol😂

0 0 0 42 0

Download Image

Wenjun Li @liwenjun2016

7 months ago

rule-based reward shaping is the reason behind

Zichen Liu @zzlccc

7 months ago

rule-based reward shaping is the reason behind

2 2 44 3K 8

Download Image

0 0 0 96 0

Wenjun Li @liwenjun2016

7 months ago

these are the singal words

Zichen Liu @zzlccc

7 months ago

these are the singal words

3 2 51 5K 8

Download Image

0 0 0 66 0

Wenjun Li @liwenjun2016

7 months ago

we foubd base models can directly perform CoT and self-reflection. Check out the example below.

Zichen Liu @zzlccc

7 months ago

we foubd base models can directly perform CoT and self-reflection. Check out the example below.

1 2 36 4K 5

Download Image

0 0 0 50 0

Wenjun Li @liwenjun2016

7 months ago

Surprisingly, Aha moment appears at epoch 0, the base models.

Zichen Liu @zzlccc

7 months ago

Surprisingly, Aha moment appears at epoch 0, the base models.

18 74 472 116K 480

0 0 1 62 0

Airdrop Inspector @airdropinspect

4 years ago

New airdrop: Kubinex Finance (KUBIX) Reward: 2500 KUBIX ($125) Rate: ⭐️ ⭐️⭐️⭐️ Remarks: 150,000,000 KUBIX to be Airdropped Distribution: From 20th May Bot Airdrop Link: t.me/airdropinspect… #Airdrop #Airdrops #Airdropinspector #BinanceSmartChain #BSC #BNB #Bitcoin #Crypto

414 5K 5K 0 11

Kubinex FInance @Kubinexfinance

4 years ago

KUBIX Limited Time Airdrop. Total Airdrop Reward 150,000,000 $KUBIX. Airdrop Duration 6 May to 15 May #DeFi #ido #BNB #BNB #BEP20 #BinanceSmartChain #Kubix #Kubinex #Crypto #ico #altcoin link.medium.com/kJPuK5rI2fb

1K 2K 3K 0 14

Deebee @Deebee0182395

2 Followers 203 Following

Zrirtirv @Zrirtirv92510

0 Followers 214 Following

Truiheg @Truiheg0274

0 Followers 133 Following

NLP/ML Researcher (working on developing GenAI and its human-centric applications) & Ex-@JD_Corporate @TencentGlobal @Sydney_Uni. Opinions are my own.

Liang Ding @liangdingNLP

781 Followers 2K Following NLP/ML Researcher (working on developing GenAI and its human-centric applications) & Ex-@JD_Corporate @TencentGlobal @Sydney_Uni. Opinions are my own.

Sethiski @SethiskiG4FtA_

55 Followers 4K Following Wealth is the test of a man's character.

Dung Doan @dungdx34

332 Followers 7K Following

Fahad Shah @sfahad

935 Followers 8K Following @Leadership @DataScience @HP @AzureML @Happily Married 😊

Avinandan Bose @avibose22

155 Followers 303 Following 3rd Year PhD @UWCSE | Visiting Researcher FAIR @AiatMeta I Prev. Research Engineer @sgSMU | CSE @IITKanpur '22

postdoc @OxCSML @NatureRecovery 🌱
AI for Social Good @barefootlaw_org 🌍
prev @ClopathLab @TheTeamAtX @ucl
@klarakaleb.bsky.social

Klara Kaleb @klarakaleb

496 Followers 3K Following postdoc @OxCSML @NatureRecovery 🌱 AI for Social Good @barefootlaw_org 🌍 prev @ClopathLab @TheTeamAtX @ucl @klarakaleb.bsky.social

AfSept. @Shown1088

13 Followers 315 Following

L @CodeTitanium

101 Followers 5K Following

Sidney Tio @SeedneyTio

19 Followers 128 Following Research Intern @SakanaAILabs | PhD Student in Reinforcement Learning, Curriculum Learning @sgSMU 🇸🇬

Carolyn @draincarolyn38

167 Followers 3K Following

Ivan Radkevich @radke149

39 Followers 1K Following La vie n'a pas d'prix Mais la muerte coute cher @UMNComputerSci

Grad @Grad62304977

4K Followers 2K Following

Bushrebate @bushrebate38761

0 Followers 10 Following

leloy! @leloykun

7K Followers 4K Following Math @ AdMU • NanoGPT speedrunner • Muon fan 🤍 • prev ML @ XPD • 2x IOI & 2x ICPC • https://t.co/nfO038itfn

oeohomos @joeohomos

48 Followers 2K Following

Jeff Coggshall @voxmenthe

2K Followers 8K Following Escaping local minima.

Alpay Ariyak @AlpayAriyak

3K Followers 3K Following Post-Training Lead @ Together AI | OpenChat Project Lead (#1 7B LLM on Arena for 2+ months, 2M+ downloads) | DeepCoder, DeepSWE

Zark Muckerburg @zplus1_

22 Followers 1K Following ai/ml open for research internship roles in computer vision (diffusion / rect flow specifically)

Mr. Jack Tung @MrJackTung

294 Followers 6K Following

Stephen Oates @stephenjaoates

810 Followers 7K Following

I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.

Ofir Press @OfirPress

15K Followers 6K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.

Chris Song @fakechris

177 Followers 3K Following python java voip scala datamining

Xiaosen Zheng @xszheng2020

597 Followers 2K Following Researcher @ TikTok 📄 RegMix 💼 Past: PhD @sgSMU | Intern @SeaAIL 🧠 Interests: Data-Centric AI | Code AI

Runxin Xu @pigjunebaba

7K Followers 3K Following AI researcher @deepseek_ai | @PKU1898 | @SJTU1896 Opinions are my own.

Adam Falls @AdamFalls172137

53 Followers 4K Following

Robert Washbourne @rawsh0

202 Followers 2K Following ai @ zyphra

Penghui Qi @QPHutu

121 Followers 105 Following Senior Research Engineer @SeaAIL PhD student @NUSingapore Working on RL, LLM Reasoning, and MLSys.

🚀 AISecHub | AI & Cybersecurity | Discussing AI-driven threats, securing AI systems, and sharing insights on emerging challenges 💡

AISecHub @AISecHub

4K Followers 4K Following 🚀 AISecHub | AI & Cybersecurity | Discussing AI-driven threats, securing AI systems, and sharing insights on emerging challenges 💡

Eva Louise Marie Gabr... @e681554349

11 Followers 7K Following

Louella Gala ❤️ M... @GalaLouell59785

10 Followers 106 Following

succint_zk @KevinLove3971

222 Followers 7K Following zk money, zk frontier, zk life, zk_immune

On the quest to understand the fundamental mathematics of intelligence and of the universe with curiosity. https://t.co/mMchI2d4pg Upskilling @StanfordOnline

Burny - Effective Cur... @burny_tech

19K Followers 8K Following On the quest to understand the fundamental mathematics of intelligence and of the universe with curiosity. https://t.co/mMchI2d4pg Upskilling @StanfordOnline

Harpinder Jot Singh @singhhcoder

96 Followers 2K Following Building @qordinate_ai Past: AI @DevRev

🇸🇬Research Scientist at Sea AI Lab @SeaGroup; 👨🏻‍🎓PhD/BS from @Tsinghua_Uni and ex-@MSFTResearch; 🛡️Trustworthy AI and Generative Models.

Tianyu Pang @TianyuPang1

1K Followers 311 Following 🇸🇬Research Scientist at Sea AI Lab @SeaGroup; 👨🏻‍🎓PhD/BS from @Tsinghua_Uni and ex-@MSFTResearch; 🛡️Trustworthy AI and Generative Models.

Jun (Richard) Wang @AI_richard

35 Followers 1K Following

Morgan McGuire @morgymcg

3K Followers 4K Following Applied AI @weights_biases | ex-Facebook Safety | https://t.co/a7i7G5dkLG | 🇮🇪 | Came for the bants, stayed for the rants

Shital Shah @sytelus

13K Followers 11K Following Mostly research and code. If universe is an optimizer, what is its loss function? All opinions are my own.

Hamid Eghbalzadeh @heghbalz

3K Followers 6K Following AI Research , Opinions @ MyOwn.

Nathan Benaich @nathanbenaich

61K Followers 34K Following solo member of investment staff @airstreet @airstreetpress @stateofaireport @raais

Changyu Chen @Cameron_Chann

252 Followers 244 Following PhD student @sgSMU. RL x LLMs. Previously @NTUsg, @ZJU_China Post-training for Sailor2

Zichen Liu @zzlccc

3K Followers 356 Following PhD student, RL believer @SeaAIL @NUSingapore | 💻 📄 🏸

Varun Bhatt @vbhatt_cs

71 Followers 60 Following PhD Student at the University of Southern California

。。。。 @ElmaJob443574

83 Followers 620 Following 我想找你玩儿，但妈妈说女孩儿要矜持，所以我换头像暗示你。希望你能主动找我聊天，给我个面子，谢谢！

MoMo @XiaoPangHuBB

5 Followers 5K Following

Kai Wang @kaiwang_gua

963 Followers 340 Following Assistant Professor @ Georgia Tech CSE

Arunesh Sinha @aruneshsinha

147 Followers 114 Following Assistant Professor at Rutgers Business School, Interested in anything interesting!

Professor of Computer Science, Lee Kuan Yew Fellow, School of Computing and Information Systems, Singapore Management University

Pradeep Varakantham @PradeepVarakan1

148 Followers 180 Following Professor of Computer Science, Lee Kuan Yew Fellow, School of Computing and Information Systems, Singapore Management University

Jason Wei @_jasonwei

98K Followers 634 Following ai researcher @meta superintelligence labs, past: openai, google 🧠

Yi Tay @YiTayML

46K Followers 81 Following research scientist @googledeepmind ✨♊, model co-lead/captain of gemini deepthink imo gold medal 🥇, opinions are my own.

AI Prof @Stanford | CEO & Cofounder @InceptionAILabs
| Co-inventor of DDIM, FlashAttention, DPO, GAIL, and score-based/diffusion models

Stefano Ermon @StefanoErmon

20K Followers 373 Following AI Prof @Stanford | CEO & Cofounder @InceptionAILabs | Co-inventor of DDIM, FlashAttention, DPO, GAIL, and score-based/diffusion models

Co-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log

Lilian Weng @lilianweng

163K Followers 166 Following Co-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log

fly51fly @fly51fly

8K Followers 2K Following BUPT prof | Sharing latest AI papers & insights | Join me in embracing the AI revolution! #MachineLearning #AI #Innovation

Anna Goldie @annadgoldie

7K Followers 137 Following Senior Staff Research Scientist at @GoogleDeepMind. Prev: @AnthropicAI, @StanfordNLP, @MIT. AlphaChip co-lead.

Sidney Tio @SeedneyTio

19 Followers 128 Following Research Intern @SakanaAILabs | PhD Student in Reinforcement Learning, Curriculum Learning @sgSMU 🇸🇬

Quentin Gallouédec @QGallouedec

3K Followers 664 Following PhD - Research @huggingface 🤗 TRL lead maintainer 🇫🇷 in 🇨🇦

Penghui Qi @QPHutu

121 Followers 105 Following Senior Research Engineer @SeaAIL PhD student @NUSingapore Working on RL, LLM Reasoning, and MLSys.

Figuring out AI @allen_ai, open models, RLHF, fine-tuning, etc
Contact via email.
Writes @interconnectsai
Wrote The RLHF Book
Mountain runner

Nathan Lambert @natolambert

56K Followers 853 Following Figuring out AI @allen_ai, open models, RLHF, fine-tuning, etc Contact via email. Writes @interconnectsai Wrote The RLHF Book Mountain runner

Alexander Doria @Dorialexander

19K Followers 4K Following Reasoning models to come. Co-founder @pleiasfr

Chujie Zheng @ChujieZheng

6K Followers 301 Following Researcher @Alibaba_Qwen | GSPO, Qwen3, QwQ, ProcessBench | Opinions are my own

AI researcher. Interested in Reasoning, Multimodal. I direct TIGER-Lab. Author of PoT, MMMU, MMLU-Pro, MAmmoTH, LongRAG, MAP-Neo, YuE, VL-Rethinker

Wenhu Chen @WenhuChen

22K Followers 664 Following AI researcher. Interested in Reasoning, Multimodal. I direct TIGER-Lab. Author of PoT, MMMU, MMLU-Pro, MAmmoTH, LongRAG, MAP-Neo, YuE, VL-Rethinker

Runxin Xu @pigjunebaba

7K Followers 3K Following AI researcher @deepseek_ai | @PKU1898 | @SJTU1896 Opinions are my own.

Xiaosen Zheng @xszheng2020

597 Followers 2K Following Researcher @ TikTok 📄 RegMix 💼 Past: PhD @sgSMU | Intern @SeaAIL 🧠 Interests: Data-Centric AI | Code AI

Geoffrey Hinton @geoffreyhinton

498K Followers 28 Following deep learning

John Schulman @johnschulman2

65K Followers 1K Following Recently started @thinkymachines. Interested in reinforcement learning, alignment, birds, jazz music

PhD @PKU1898; Researcher @deepseek_ai; Recent: DeepSeek-R1/CoderV2/Math/V1/V2/V3, Mathshepherd, FairEval, Speculative Decoding.

Peiyi Wang @sybilhyz

11K Followers 302 Following PhD @PKU1898; Researcher @deepseek_ai; Recent: DeepSeek-R1/CoderV2/Math/V1/V2/V3, Mathshepherd, FairEval, Speculative Decoding.

Min Lin @mavenlin

179 Followers 203 Following

Chao Du @duchao0726

107 Followers 129 Following Research Scientist @SeaGroup; Prev Ph.D. @Tsinghua_Uni;

Junxian He @junxian_he

6K Followers 643 Following Assist. Prof @hkust. NLP/ML PhD @LTIatCMU.

Researcher @ TikTok 🇸🇬

📄 Sailor / StarCoder / OpenCoder
💼 Past: Research Scientist @SeaAIL; PhD @MSFTResearch
🧠 Contribution: @XlangNLP @BigCodeProject

Qian Liu @sivil_taram

4K Followers 743 Following Researcher @ TikTok 🇸🇬 📄 Sailor / StarCoder / OpenCoder 💼 Past: Research Scientist @SeaAIL; PhD @MSFTResearch 🧠 Contribution: @XlangNLP @BigCodeProject

Harpinder Jot Singh @singhhcoder

96 Followers 2K Following Building @qordinate_ai Past: AI @DevRev

Jun (Richard) Wang @AI_richard

35 Followers 1K Following

Morgan McGuire @morgymcg

3K Followers 4K Following Applied AI @weights_biases | ex-Facebook Safety | https://t.co/a7i7G5dkLG | 🇮🇪 | Came for the bants, stayed for the rants

Shital Shah @sytelus

13K Followers 11K Following Mostly research and code. If universe is an optimizer, what is its loss function? All opinions are my own.

Hamid Eghbalzadeh @heghbalz

3K Followers 6K Following AI Research , Opinions @ MyOwn.

Nathan Benaich @nathanbenaich

61K Followers 34K Following solo member of investment staff @airstreet @airstreetpress @stateofaireport @raais

Tianyu Pang @TianyuPang1

1K Followers 311 Following 🇸🇬Research Scientist at Sea AI Lab @SeaGroup; 👨🏻‍🎓PhD/BS from @Tsinghua_Uni and ex-@MSFTResearch; 🛡️Trustworthy AI and Generative Models.

Changyu Chen @Cameron_Chann

252 Followers 244 Following PhD student @sgSMU. RL x LLMs. Previously @NTUsg, @ZJU_China Post-training for Sailor2

Zichen Liu @zzlccc

3K Followers 356 Following PhD student, RL believer @SeaAIL @NUSingapore | 💻 📄 🏸

RS Intern @AIatMeta • PhDing @UBC @VectorInst • Undergrad @imperialcollege • Reinforcement Learning, Self-Improving AI, Open-endedness

Jenny Zhang @jennyzhangzt

4K Followers 917 Following RS Intern @AIatMeta • PhDing @UBC @VectorInst • Undergrad @imperialcollege • Reinforcement Learning, Self-Improving AI, Open-endedness

Ishita Mediratta @ishitamed

690 Followers 1K Following 🤖

Matt Fontaine @tehqin17

462 Followers 242 Following roboticist PhD trained @ USC, Algorithms Live! host, ICPC judge, USACO coach https://t.co/WaIWptKeot

Jack Parker-Holder @jparkerholder

9K Followers 779 Following Co-lead of Genie 3 @GoogleDeepMind & Honorary Associate Professor @UCL_DARK. Dad (👶🐶), CFC fan, BJJ. Views are my own :)

Associate Professor @USC.
PhD from @CMU_Robotics, MS from @MIT, MEng from #UTokyo, BS from @NTUA. Previously worked at @SquareEnix in Tokyo.

Stefanos Nikolaidis @snikolaidis19

1K Followers 181 Following Associate Professor @USC. PhD from @CMU_Robotics, MS from @MIT, MEng from #UTokyo, BS from @NTUA. Previously worked at @SquareEnix in Tokyo.

Varun Bhatt @vbhatt_cs

71 Followers 60 Following PhD Student at the University of Southern California

Professor, CS, U. British Columbia. CIFAR AI Chair, Vector Institute. Sr. Advisor, DeepMind | ML, AI, deep RL, deep learning, AI-Generating Algorithms (AI-GAs)

Jeff Clune @jeffclune

29K Followers 431 Following Professor, CS, U. British Columbia. CIFAR AI Chair, Vector Institute. Sr. Advisor, DeepMind | ML, AI, deep RL, deep learning, AI-Generating Algorithms (AI-GAs)

Kai Wang @kaiwang_gua

963 Followers 340 Following Assistant Professor @ Georgia Tech CSE

Tim Rocktäschel @_rockt

39K Followers 2K Following Director and Open-Endedness Team Lead @GoogleDeepMind, Professor of AI @AI_UCL, PI @UCL_DARK, Fellow @ELLISforEurope.

Open-Endedness RS @GoogleDeepMind. Building for an unspecifiable world | Unsupervised Environment Design, Game&Decision Theory, RL, AIS. prev @CHAI_Berkeley

Michael Dennis @MichaelD1729

4K Followers 813 Following Open-Endedness RS @GoogleDeepMind. Building for an unspecifiable world | Unsupervised Environment Design, Game&Decision Theory, RL, AIS. prev @CHAI_Berkeley

Fei Fang @fangf07

3K Followers 180 Following Associate Professor, Carnegie Mellon University

@Harvard Professor & Director Ctr for Computation & Society @HCRCS
@GoogleDeepMind Principal Scientist & Director #AIforGood #AIforhealth #AIforConservation

Milind Tambe (moving ... @MilindTambe_AI

8K Followers 270 Following @Harvard Professor & Director Ctr for Computation & Society @HCRCS @GoogleDeepMind Principal Scientist & Director #AIforGood #AIforhealth #AIforConservation

Arunesh Sinha @aruneshsinha

147 Followers 114 Following Assistant Professor at Rutgers Business School, Interested in anything interesting!

Pradeep Varakantham @PradeepVarakan1

148 Followers 180 Following Professor of Computer Science, Lee Kuan Yew Fellow, School of Computing and Information Systems, Singapore Management University

Minqi Jiang @MinqiJiang

6K Followers 880 Following

Bob Wen @BobWen16

2 Followers 18 Following

Assistant Professor @SheffieldNLP @sheffielduni, working on Multimodal LLM, RAG, Misinformation Detection, Recommender Systems, and AI for Science.

Delvin Ce Zhang @delvincezhang

25 Followers 60 Following Assistant Professor @SheffieldNLP @sheffielduni, working on Multimodal LLM, RAG, Misinformation Detection, Recommender Systems, and AI for Science.

Hugh Lee @HughLee65444358

4 Followers 26 Following

Chelsea Finn @chelseabfinn

82K Followers 399 Following Asst Prof of CS & EE @Stanford Co-founder of Physical Intelligence @physical_int PhD from @Berkeley_EECS, EECS BS from @MIT

No recent Favorites. New Favorites will appear here.