Linh Le @linhlpv
PhD student at A2I2 Reinforcement Learning, Adaptation and Generalization linhlpv.github.io Joined December 2015-
Tweets200
-
Followers70
-
Following462
-
Likes2K
I was happy to give a more technical talk on how we might create an AI at RLC-2025 and AGI-2025 (video below). The Oak Architecture: A Vision of Super-Intelligence from Experience As AI has become a huge industry, to an extent it has lost its way. What is needed to get us back on…
Finding an ML summer school has never been easier Here is a GitHub repo with a comprehensive list, with 50+ ML summer (and winter) schools all over the world (link in comments) Some of them are even free, few even offer scholarship so you don't have to pay absolutely anything
🚀 I'm excited to share our new paper: SegDAC: Segmentation-Driven Actor-Critic for Visual Reinforcement Learning 🧠 SegDAC combines large vision models with online RL to reason about its environment at the object and sub-object level, avoiding noisy pixel-level reasoning. 🛠️…
Humanoids finally move like humans… and can do more than copy. [Details + demos in thread 👇] A new framework, BeyondMimic, shows how to learn naturalistic whole-body control from human motion. But then goes further by composing those skills into versatile, zero-shot…
At what point does perf optimization get ridiculous. During my PhD, everything was 500-5000 sps. Then I got 10k and was very proud. Then 100k in early versions of PufferLib. Then 1M in 2.0... and now we're at up to 6M productive SPS on some RL envs
Fine-tuning pre-trained robotic models with online RL requires a way to train RL with expressive policies Can we design an effective method for this? We propose EXPO, a sample-efficient online RL algorithm that enables stable fine-tuning of expressive policy classes (1/6)
✨Introducing SENSEI✨ We bring semantically meaningful exploration to model-based RL using VLMs. With intrinsic rewards for novel yet useful behaviors, SENSEI showcases strong exploration in MiniHack, Pokémon Red & Robodesk. Accepted at ICML 2025🎉 Joint work with @cgumbsch 🧵
missing ICML, and I used this week to write my first technical blog on some recent thoughts on two different roles of simulators in RL and the confusions/misconceptions around them. Comments welcome! nanjiang.cs.illinois.edu/2025/07/16/sim…
The paper: arxiv.org/abs/2502.03349
Everyone knows action chunking is great for imitation learning. It turns out that we can extend its success to RL to better leverage prior data for improved exploration and online sample efficiency! colinqiyangli.github.io/qc/ The recipe to achieve this is incredibly simple. 🧵 1/N
Warm-start RL (WSRL) can learn to control a real robot in under 20 minutes! Deep RL is getting really fast. Warm-start from offline data + super-efficient online learning is increasingly making real world RL not just practical but pretty easy.
Warm-start RL (WSRL) can learn to control a real robot in under 20 minutes! Deep RL is getting really fast. Warm-start from offline data + super-efficient online learning is increasingly making real world RL not just practical but pretty easy.
You don't _need_ a PhD (or any qualification) to do almost anything. A PhD is a rare opportunity to grow as an independent thinker in an academic environment, rather than immediately becoming a gear in a corporate agenda. It's definitely not for everyone!
You don't _need_ a PhD (or any qualification) to do almost anything. A PhD is a rare opportunity to grow as an independent thinker in an academic environment, rather than immediately becoming a gear in a corporate agenda. It's definitely not for everyone!
📢📢 "Align Your Flow: Scaling Continuous-Time Flow Map Distillation" New flow map framework for state-of-the-art few-step generation, w/ the amazing @amsabour and @FidlerSanja. 🔥 Project page: research.nvidia.com/labs/toronto-a… 📜 Paper: arxiv.org/abs/2506.14603 🧵Thread below... (1/n)
🚨 Excited to share our new work: "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning"! 📈 We propose gradient interventions that enable stable, scalable learning, achieving significant performance gains across agents and environments! Details below 👇
Self-supervised representation learning looks a bit like RL. What if we literally use RL as a SSL method for visual representations? Turns out that it works quite well. In new work by @its_dibya, we show how this can be done: dibyaghosh.com/annotation_boo…
Normally, changing robot policy behavior means changing its weights or relying on a goal-conditioned policy. What if there was another way? Check out DynaGuide, a novel policy steering approach that works on any pretrained diffusion policy. dynaguide.github.io 🧵
Our view on test-time scaling has been to train models to discover algos that enable them to solve harder problems. @setlur_amrith & @matthewyryang's new work e3 shows how RL done with this view produces best <2B LLM on math that extrapolates beyond training budget. 🧵⬇️…
1/ How should RL agents prepare to solve new tasks? While prior methods often learn a model that predicts the immediate next observation, we build a model that predicts many steps into the future, conditioning on different user intentions: chongyi-zheng.github.io/infom.
Hierarchical methods for offline goal-conditioned RL (GCRL) can scale to very distant goals that stymie flat (non-hierarchical) policies — but are they really necessary? Paper: arxiv.org/abs/2505.14975 Project page: johnlyzhou.github.io/saw/ Code: github.com/johnlyzhou/saw Thread ↓
Phaidra is hiring a Research Scientist to work on sequential decision-making problems. I'm at the RLDM conference in Dublin this week. If you're attending and would like to learn more about the role or the company, feel free to reach out! job-boards.greenhouse.io/phaidra/jobs/4…

Ziyan "Ray" Luo @RLC'... @RayZiyan41307
73 Followers 168 Following Abstraction & RL / Ph.D. @Mila_Quebec, @mcgillu with @XujieSi & Doina Precup / Music: @SunsetRay_Ra / https://t.co/im1jR2Vend
Josephine Howe @howe_josep16896
79 Followers 4K Following
Iefohwe @Iefohwe1009054
125 Followers 3K Following
jessica🩶 @ds_jessica_
14K Followers 11K Following analytics lead & angel investor & advisor. always learning = business & innovation. doing #datascience
Théo Vincent @RLC @Theo_Vincent_
318 Followers 436 Following PhD student at @DFKI & @ias_tudarmstadt, working on RL 🤖 Previously master student at MVA @ENS_ParisSaclay & ENPC 🎓
Clarisse Wibault @ClarisseWibault
24 Followers 54 Following PhD Student @UniofOxford @FLAIR_Ox | Supervised by @maosbot @j_foerst |
Johan Obando-Ceron �... @johanobandoc
2K Followers 4K Following Graduate student @Mila_Quebec @UMontrealDIRO | RL/Deep Learning/AI | De Cali/Colombia pal’ Mundo 🇨🇴 | #JuntosProsperamos⚡#TogetherWeThrive| 🌱🌎
John Zhou @johnlyzhou
108 Followers 261 Following PhD student @UCLA, previously @Columbia | Scalable reinforcement learning
Geonwoo Cho @GeonwooC51050
12 Followers 104 Following Reinforcement Learning | CS Undergrad Ex Match Group / HyperConnect Machine Learning Software Engineer
Zhaolin Gao @GaoZhaolin
138 Followers 116 Following CS PhD Student @Cornell & @cornell_tech | GenAI Intern @Meta https://t.co/9mVl01Ilui
Nacho Mellado @uavster
3K Followers 763 Following Building your companion robot in public: https://t.co/pDyfICPowG Formerly Google X, Apple, https://t.co/CaT9ffzG6r,@PickNikRobotics, demoscene.
Laurence Feil @LaurenceFe87679
80 Followers 4K Following
TRUMP SUPPORTER 🇺�... @_TRUMP2025_
57 Followers 520 Following MAKE/ AMERICAN /GREAT/TRUMP 2016/TRUMP 2020/TRUMP https://t.co/Nl8u4TPjtf
PoppyThoreau @65oOMKs588d3x4
115 Followers 2K Following
Kory Mathewson @korymath
11K Followers 4K Following @GoogleDeepMind working on Veo + Flow -- getting great generative AI into the hands of great creative people
Shivam Vats @ShivaamVats
721 Followers 514 Following Postdoc @BrownBigAI Previously: PhD @CMU_Robotics, Maths @IITKgp, Core developer @SymPy
David van Dijk @david_van_dijk
5K Followers 4K Following Assistant Professor @Yale @YaleMed @YaleCSDept | ML/AI comp bio
Joe Mayo @JoeMayo
16K Followers 7K Following Author and Independent Consultant Recent books: - Programming the Microsoft Bot Framework/MSPress - C# Cookbook/O'Reilly Agents, AI, Generative AI, MCP, RAG
Evelyn @omara_evelyn55
395 Followers 3K Following
Alessandro Montenegro @montenegronwski
60 Followers 114 Following 💡PhD Student @polimi | 🤖Reinforcement Learning @rl3polimi | 📍Made in Italy, Rome.
Zhaochen Su @SuZhaochen0110
340 Followers 711 Following LLM/LVLM Knowledge & Reasoning | Incoming Ph.D. Student @hkust @hkustnlp | Previous Shanghai AI Lab.
R. Alessio @ BU @rssalessio
101 Followers 238 Following Postdoc at Boston University with Aldo Pacchiano (PLAIA Lab). Interested in RL, Bandit problems and Adaptive Control.
Laixi Shi @ShiLaixi
396 Followers 242 Following RL with uncertainty foundation; Assistant Professor in JHU ECE&DSAI @JohnsHopkins; Postdoc in @Caltech; Ph.D. in CMU (@CMU_ECE); BEng in Tsinghua @Tsinghua_Uni
Arip @machinestein
1K Followers 834 Following
Abhishek Sharma @sharma_abhishek
425 Followers 1K Following PhD Candidate @ Harvard SEAS. Research in Reinforcement Learning and Probabilistic ML
Mirco Mutti @mirco_mutti
631 Followers 641 Following Postdoc @TechnionLive. PhD from @polimi. Reinforcement learning, but without rewards.
Yinglun Zhu @yinglun122
360 Followers 404 Following Assistant Prof @UCRiverside. PhD @WisconsinCS. Researching Efficient ML, RL, and LLMs.
Lucas Alegre @lnalegre
164 Followers 442 Following Professor at @INF_UFRGS. Interested in multi-task and multi-objective reinforcement learning.
Haimin Hu @HaiminHu
533 Followers 304 Following Incoming Assistant Professor @JHUCompSci | PhD @Princeton ECE | MSE @Penn @GRASPlab | BEng @ShanghaiTechUni. I like robots (when they are safe).
evo @evo_agent
17 Followers 45 Following
Academic Giant @APremierWriter
68 Followers 733 Following For due assignments, essays, classes or any other academic task, exams included, just HMU
Amir-massoud Farahman... @SoloGen
6K Followers 2K Following Goal: Understanding the computational and statistical principles required to design adaptive agents. Associate Prof @polymtl @Mila_Quebec 🇨🇦 #MahsaAmini
Hao Sun - RL @HolarisSun
893 Followers 960 Following RS @GoogleDeepMind. Prev. PhD @CambridgeUni, #MMLab, B.Phys. @PKU1898
Kyoung Whan Choe @kywch500
855 Followers 1K Following Robot Learning Engineer @ https://t.co/wcLx79rCuW
Zhiyong Wang @Zhiyong16403503
782 Followers 4K Following Ph.D. candidate at CUHK. Former Visiting Scholar at Cornell. Working on reinforcement learning and multi-armed bandits.
nissymori @nissymori1
190 Followers 432 Following PhD candidate@UTokyo_News_en(Sugiyama-Yokoya-Ishida lab) Reinforcement Learning (RL) JAX-based RL Game AI Slack Community Vista
Ignacio Carlucho @i_carlucho
178 Followers 671 Following Assistant Professor at @HeriotWattUni and @NRobotarium Working on Robotics and Reinforcement learning.
Hue @__lily_ng__
1 Followers 14 Following
Zixuan Huang @ZixuanHuang15
287 Followers 381 Following Intern at Amazon FAR. PhD @UMRobotics. Former MS @CMU_Robotics
Levi Lelis @levilelis
700 Followers 541 Following Artificial Intelligence Researcher - Associate Professor - University of Alberta - Canada CIFAR AI Chair (he/him, ele/dele).
Bram Grooten @BramGrooten
638 Followers 547 Following 4th-year PhD candidate @TUeindhoven 🇳🇱 Looking for postdoc/industry positions! Deep Learning, RL, Sparse NNs, Robotics. Ex: @UAlberta 🇨🇦 @SonyAI_global🇨🇭
Arash Tavakoli @arshtvk
833 Followers 507 Following Reinforcement Learning, Staff Research Scientist @RiotGames. Spent time @MPI_IS, @ImperialCollege, @UCL, @USC, @GeorgiaTech, @Microsoft (MSR), @UAlberta (RLAI).
Thao Nguyen @thao_nguyen26
1K Followers 307 Following PhD student @uwcse & visiting researcher @AIatMeta. Formerly @GoogleAI Resident, @Stanford'19, @twosigma.
Daphne Cornelisse @daphne_cor
1K Followers 553 Following Ph.D. student @nyuniversity • Building human-like agents 🦋 https://t.co/BhKiCutsdY
Alexandre Brown 🇨�... @AlexandreBrown0
173 Followers 1K Following PhD student at @UMontreal and research @Mila_Quebec on robot learning
Jiaxun Cui 🐿️ @cuijiaxun
676 Followers 761 Following Ph.D. @utlarg @UTAustin 🤘 | Multi-agent Reinforcement Learning | Undergrad SJTU @sjtu1896 | Research Intern FAIR Labs @AIatMeta, Robert Bosch, Tencent AI Labs
Rui Shu @_smileyball
3K Followers 428 Following I draw smileyball https://t.co/VZJD2Av8PY Writing organic artisanal handcrafted code @OpenAI Previously doing the same @Stanford
Ziyan "Ray" Luo @RLC'... @RayZiyan41307
73 Followers 168 Following Abstraction & RL / Ph.D. @Mila_Quebec, @mcgillu with @XujieSi & Doina Precup / Music: @SunsetRay_Ra / https://t.co/im1jR2Vend
Qian Huang @qhwang3
14K Followers 331 Following prev @xai | CS PhD student @StanfordAILab (on leave)
Keerthana Gopalakrish... @keerthanpg
17K Followers 1K Following Mother of robots. Building Embodied AGI @DeepMind. Author of "AI for Robotics". Opinions my own.
Edward Grefenstette @egrefen
42K Followers 865 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.
Tabitha Edith Lee @TabulaRobot
930 Followers 592 Following Postdoc at @UMontreal & @Mila_Quebec in causal learning for robots and embodied AI. Prior stops at @CMU_Robotics, @nvidia, LM Space ATC, & Uber ATG.
Liliang Ren @liliang_ren
4K Followers 573 Following Senior Researcher at Microsoft GenAI | UIUC CS PhD graduate | Efficient LLM | NLP | Former Intern @MSFTResearch @Azure @AmazonScience
Christian Gumbsch @cgumbsch
191 Followers 169 Following Postdoc @UvA_Amsterdam | world models and sensorimotor abstractions |👾🤖🧠
Shuran Song @SongShuran
12K Followers 520 Following Assistant Professor @Stanford University working on #Robotics #AI #ComputerVision
Annie Chen @_anniechen_
1K Followers 406 Following PhD student @StanfordAILab. Prev: research @GoogleDeepMind, Stanford BS/MS
Nitish ⚡️ @nitishmutha
4K Followers 348 Following Co-founder and CTO @GenieAI - Building the world’s best AI Legal Drafter. @UCL alum.
Johan Obando-Ceron �... @johanobandoc
2K Followers 4K Following Graduate student @Mila_Quebec @UMontrealDIRO | RL/Deep Learning/AI | De Cali/Colombia pal’ Mundo 🇨🇴 | #JuntosProsperamos⚡#TogetherWeThrive| 🌱🌎
Kevin Ellis @ellisk_kellis
2K Followers 176 Following Cornell Computer Science, Assistant Professor. Program synthesis, AI
John Zhou @johnlyzhou
108 Followers 261 Following PhD student @UCLA, previously @Columbia | Scalable reinforcement learning
Geonwoo Cho @GeonwooC51050
12 Followers 104 Following Reinforcement Learning | CS Undergrad Ex Match Group / HyperConnect Machine Learning Software Engineer
Núria Armengol @NriaArmengol2
124 Followers 201 Following ETH/CLS PhD candidate focused on reinforcement learning and sports lover.
Eric Rosen @_ericrosen
1K Followers 608 Following Robotics Research Scientist @ Robotics and AI Institute (RAI) | Making robots smarter for everyone | CS PhD from @BrownUniversity 🤖
Robotic Systems Lab @leggedrobotics
16K Followers 173 Following The Robotic Systems Lab designs machines, creates actuation principles, and builds up control technologies for autonomous operation in challenging environments.
Sumeet Batra @SumeetBt
288 Followers 148 Following 5th year PhD Candidate at USC . Interested in robotics and generalist embodied agents. Inspired by neuroscience. Prev. 2X research intern at NVIDIA.
IEEE ICRA @ieee_ras_icra
12K Followers 82 Following #ICRA2025 IEEE International Conference on Robotics & Automation 19–23 May, Atlanta, USA
Georg Martius @GMartius
2K Followers 203 Following Researcher, interested in autonomous machine learning, reinforcement learning, robotics, 3d printing and more
Chris Paxton @chris_j_paxton
19K Followers 3K Following Mostly posting about robots. currently AI @agilityrobotics prev embodied AI @AIatMeta, @NVIDIAAI. All views my own. writing: https://t.co/iNLA4djfZo
Michael Black @Michael_J_Black
84K Followers 702 Following Director, Max Planck Institute for Intelligent Systems (@MPI_IS). Chief Scientist @meshcapade. Building 3D digital humans using vision, graphics, and learning.
ECML PKDD @ECMLPKDD
3K Followers 101 Following Official Twitter account of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. BlueSky: @ecmlpkdd.org
Ahmad Beirami @abeirami
10K Followers 4K Following sth new // ex Gemini RL+Inference @GoogleDeepMind // Chat AI @Meta // RL Agents @EA // ML+Information Theory @MIT+@Harvard+@GeorgiaTech // زن زندگی آزادی
Fan Nie @FanNie1208
753 Followers 354 Following AI @Stanford | Prev. @EPFL @SJTU1886 |Research in Reliable AI & Large Language Models
Hailey Nguyen @hailey_huong
241 Followers 215 Following Untangling the complexities of LLM alignment. Researcher @AIatMeta 🦙🦙🦙
Grace Liu @GraceLiu78
46 Followers 3 Following
Jiaxin Shi @thjashin
4K Followers 348 Following Research Scientist @GoogleDeepMind | prev @Stanford @MSRNE @VectorInst @RIKEN_AIP_EN @Tsinghua_Uni. Building probabilistic & algorithmic models for learning.
Luisa Zintgraf @luisa_zintgraf
5K Followers 501 Following Senior Research Scientist in the RL team @googledeepmind. PhD from @UniofOxford.
Chuang Gan @gan_chuang
9K Followers 484 Following Faculty Member at UMass Amherst; Principal researcher at MIT-IBM Watson AI Lab; Homepage: https://t.co/Pc8WeREfTz
Jacob Beck @jakeABeck
346 Followers 106 Following Let’s get agents to learn fast! 🤖🔥 Research Scientist @Oracle | PhD @UniOfOxford, MS & BS @BrownUniversity, Predoc @Microsoft
Lily Xu @lilyxu0
3K Followers 1K Following AI & decision making for planetary health. Postdoc @UniofOxford, incoming prof @Columbia IEOR, PhD @Harvard. Bridging research and practice through @EAAMO_ORG.
Kevin Wang @kevin_wang3290
239 Followers 177 Following CS @Princeton '22–'25, research Princeton RL + @princeton_nlp | prev quant intern @citsecurities
Kory Mathewson @korymath
11K Followers 4K Following @GoogleDeepMind working on Veo + Flow -- getting great generative AI into the hands of great creative people
Swaroop Mishra @Swarooprm7
13K Followers 813 Following MTS @Microsoft AI, Prev: RS @GoogleDeepMind (Gemini). Opinions my own.
JHU Computer Science @JHUCompSci
2K Followers 722 Following A diverse and collaborative community on the cutting edge of computing and technology within @HopkinsEngineer at @JohnsHopkins. https://t.co/3hwXFTdGyw