J.Nathan Yan @NathanYan2012
Research Scientist @GoogleDeepMind and Ph.D. @CornellCIS/@cornell_tech. I bake my own opinions. nathanyanjing.github.io New York Joined July 2014-
Tweets273
-
Followers856
-
Following2K
-
Likes670
It is the time to rethink the role of tokenization, and to tailor/develop the right model architecture to support the token-free models! Albert again did it!
It is the time to rethink the role of tokenization, and to tailor/develop the right model architecture to support the token-free models! Albert again did it!
wowowow!!! this is super cool!!!
wowowow!!! this is super cool!!!
Thanks @9to5mac for summarizing our research on TARFlow/STARFlow! It is an exciting direction of reviving normalizing flow with modern scalable techniques… and more will come!
Thanks @9to5mac for summarizing our research on TARFlow/STARFlow! It is an exciting direction of reviving normalizing flow with modern scalable techniques… and more will come!
I will be attending #CVPR2025 and presenting our latest research at Apple MLR! Specifically, I will present our highlight poster--world consistent video diffusion (cvpr.thecvf.com/virtual/2025/p…), and three workshop invited talks which includes our recent preprint ★STARFlow★! (0/n)
I will be attending #CVPR2025 and presenting our latest research at Apple MLR! Specifically, I will present our highlight poster--world consistent video diffusion (cvpr.thecvf.com/virtual/2025/p…), and three workshop invited talks which includes our recent preprint ★STARFlow★! (0/n)
🔥Thrilled to share that I’ll be joining the Computer Science Department at NYU Shanghai as an Assistant Professor starting Fall 2025! @nyushanghai 🎯 I’ll be recruiting PhD students across the entire NYU network—including @nyushanghai, @nyutandon, and @NYU_Courant—to build…
I will be attending #ICLR2025 in person during Apr 24-28, and presenting our research: DART: Denoising Autoregressive Transformer 📌Fri 25 Apr 3 p.m. +08 — 5:30 p.m. +08 This is my first time visiting Singapore, and I am looking forward to chatting with old and new friends!
I will be attending #ICLR2025 in person during Apr 24-28, and presenting our research: DART: Denoising Autoregressive Transformer 📌Fri 25 Apr 3 p.m. +08 — 5:30 p.m. +08 This is my first time visiting Singapore, and I am looking forward to chatting with old and new friends!
Today, we're releasing a new paper – One-Minute Video Generation with Test-Time Training. We add TTT layers to a pre-trained Transformer and fine-tune it to generate one-minute Tom and Jerry cartoons with strong temporal consistency. Every video below is produced directly by…
Some personal news: I recently joined Cursor. Cursor is a small, ambitious team, and they’ve created my favorite AI systems. We’re now building frontier RL models at scale in real-world coding environments. Excited for how good coding is going to be.
How well do data-selection methods work for instruction-tuning at scale? Turns out, when you look at large, varied data pools, lots of recent methods lag behind simple baselines, and a simple embedding-based method (RDS) does best! More below ⬇️ (1/8)
I've uploaded the latest slides & beamer source code to github.com/sustcsonglin/l…. Hopefully this repository will help train an LLM that generates Beamer slides better than I do :)
I've uploaded the latest slides & beamer source code to github.com/sustcsonglin/l…. Hopefully this repository will help train an LLM that generates Beamer slides better than I do :)
Introducing the first open-source implementation of native sparse attention: github.com/fla-org/native…. Give it a spin and cook your NSA model! 🐳🐳🐳
🚀 Announcing ASAP: asap-seminar.github.io! A fully virtual seminar bridging theory, algorithms, and systems to tackle fundamental challenges in Transformers. Co-organized by @simran_s_arora @Xinyu2ML @HanGuo97 Our first speaker: @heyyalexwang on Test-time Regression
Got talked into giving a DeepSeek talk this afternoon simons.berkeley.edu/workshops/llms… Not sure I have anything new to say here! But good excuse for me to read all the blogs.
🚀Thrilled to share our paper "DART" has been accepted by #ICLR2025! Congrats to my amazing collaborators @YuyangW95 @YizheZhangNLP @QihangZhang0224 @zdhnarsil Navdeep Jaitly @jsusskin @zhaisf! Please also check the updated version with more results at arxiv.org/abs/2410.08159
🚀Thrilled to share our paper "DART" has been accepted by #ICLR2025! Congrats to my amazing collaborators @YuyangW95 @YizheZhangNLP @QihangZhang0224 @zdhnarsil Navdeep Jaitly @jsusskin @zhaisf! Please also check the updated version with more results at arxiv.org/abs/2410.08159
spent the last month building my own framework to train a diffusion model from scratch. it was hard almost like i just learned to cast an ancient spell that requires lots of mysterious steps and ingredients. for a long time i was trying, and nothing happened. but when it…
In this video, I'll be deriving and coding Flash Attention from scratch. No prior knowledge of CUDA or Triton is required. Link to the video: youtu.be/zy8ChVd_oTM All the code will be written in Python with Triton, but no prior knowledge of Triton is required. I'll also…
The most beautiful thing on LLM reasoning is that the thought process is generated in an autoregressive way, rather than relying on search (e.g. mcts) over the generation space, whether by a well-finetuned model or a carefully designed prompt.
How far is an LLM from not only understanding but also generating visually? Not very far! Introducing MetaMorph---a multimodal understanding and generation model. In MetaMorph, understanding and generation benefit each other. Very moderate generation data is needed to elicit…
Experience Gemini 2.0 Flash Thinking—the fast and transparent reasoning model that reveals its thought process in real-time! This breakthrough brings us one step closer to deeper, more reliable AI understanding. Try it now!
Experience Gemini 2.0 Flash Thinking—the fast and transparent reasoning model that reveals its thought process in real-time! This breakthrough brings us one step closer to deeper, more reliable AI understanding. Try it now!

Ilse @3b13OpL78Drq4w4
28 Followers 1K Following
Ybirxawl @Ybirxawl0791
1 Followers 308 Following
Ayuni Raphael @AyuniRapha23052
1 Followers 535 Following
Maribelle @Yherveam177472
42 Followers 2K Following Like to talk Do not hold any investment products
Patrick Drake @time8machine
17K Followers 6K Following Neurodivergent physics student with a keen interest in multisensory integration and emergent perception. Exploring research on a proposed ‘sixth sense’. Δ
Joseph Imperial @josephimperial_
2K Followers 5K Following UKRI PhD Candidate @ARTAIBath @bathnlp @UniofBath 🇬🇧. Technical alignment, compliance, and AI safety. Research Faculty @NationalUPhil 🇵🇭. He/him.
WeiCUI6 @Cui6Wei
34 Followers 753 Following Systems Software Engineer @NVIDIA. Prev @UofT @UCLA @KITE_UHN @Tesla @Samsung @Apple. Working on @NVIDIAGFN
Zijie Huang@ACL2025(v... @HuangZi71008374
1K Followers 703 Following Research Scientist @GoogleDeepMind. Prev @UCLA @SJTU1896 @Amazon @Nvidia @Netflix; Work on #LLM, #AI4Science, #GraphML.
Etooirjar @Etooirjar30799
19 Followers 496 Following
Sanjid Hasan Al Rifat @SanjidHRifat
38 Followers 579 Following Working on clean energy in Bangladesh for Global. Co-Founder @ ZEROOZEN
Sinuo @TithueserhKjDm
43 Followers 806 Following Girls who love to laugh will never have bad luck. I also hope to meet my prince charming.
Eva Louise Marie Gabr... @e681554349
11 Followers 7K Following
Ayush Sharma @tyayush
62 Followers 625 Following Founder @varvya, Prev: @relayersoftware, @mostli. Art, Creativity, Design, and Learning!
Trevor Loy @trevorloy
20K Followers 2K Following VC @FlywheelVC. Lecturer, entrep mgmt fin & VC @Stanford. Expert witness. Prev: @NVCA @KauffmanFellows @Intel & 3x founder. I am "trevorloy" on all other apps.
Yucheng Lu @_yucheng_lu
403 Followers 798 Following Assistant Professor @nyushanghai in ML Systems. Prev @togethercompute.
ChristineLynd @8A8dPGA4nmOj3
108 Followers 4K Following Heart full of poetry & pockets full of seeds 📜🌱
Namau @Namau38084
24 Followers 311 Following
Shumo Chu @shumochu
6K Followers 825 Following brewing a stealth mode AI startup. ex prof. @UCSBCS, ph.d. @UWCSE, eng. @Google
Zefeng DU @seele_du
29 Followers 115 Following
Kavan Fatehi @FatehiKavan
317 Followers 181 Following PhD in Computer Science @ University of Nottingham
Lyndon_L @Lyndon2042
21 Followers 142 Following PhD student. CEGE @UMNews, Stats & Math & CS @Columbia. Bayesian statistics, applied causality (causal generative model), machine learning.
Tearleigh @Tearleigh2JX
46 Followers 5K Following
Manish Shetty @slimshetty_
1K Followers 767 Following PhD @UCBerkeley | AI4Code & Evals | Projects: GSO, R2E, Syzygy, LMArena RepoChat, AIOpsLab | prev @googledeepmind @msftresearch
abderrahim zine @abderrahimzine6
42 Followers 3K Following
ADAM @noadm19
114 Followers 8K Following
codergoose @codergoose
7 Followers 683 Following
Steven (Shaobo) Wang ... @ShaoboWang6
385 Followers 1K Following Ph.D Candidate @sjtu1896, Intern @yaledatascience and @Alibaba_Qwen. Exploring Data-Centric AI on Foundation Models.
JAEHYEONG_KIM @jhk40160806
1K Followers 7K Following I'm a technical imagineer: If MyBrain ideas + great Scientists meet, I & Scientists make New things, if MyBrain ideas can link(implant) Neuro into AI quantum 📩
Researcher @ai_science_
0 Followers 53 Following
Mengdi Xu @mengdixu_
2K Followers 936 Following Postdoc @StanfordSVL. Learning and Robotics. Ph.D. @CarnegieMellon. Prev. @GoogleDeepMind @MITIBMLab @Tsinghua_Uni.
Ningyu Zhang@ZJU @zxlzr
3K Followers 2K Following Associate Professor @ZJU_China. Research interests include NLP, LLM, KG, Agent, Knowledge Editing.
Jiajing Guo @jiajing_guo
659 Followers 727 Following Research Engineer @boschusa. HCI, intelligent interface, human-AI collaboration, @CornellInfoSci. Yoga and illustration.
Saksham Suri @_sakshams_
781 Followers 641 Following Research Scientist @AiatMeta. Previously PhD @UMDCS, @MetaAI, @AmazonScience, @USCViterbi, @IIITDelhi, @IBMResearch. #computervision #deeplearning
Charlie Snell @sea_snell
8K Followers 6K Following PhD student @berkeley_ai; research @cursor_ai; prev @GoogleDeepMind. My friend told me to tweet more. I stare at my computer a lot and make things
Sishoosh @SishooshF2E9zz
14 Followers 635 Following
Sukjun (June) Hwang @sukjun_hwang
3K Followers 300 Following ML PhD student @mldcmu advised by @_albertgu
Yoav Artzi @yoavartzi
17K Followers 183 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaC and @COLM_conf
Sebastian Riedel (@ri... @riedelcastro
17K Followers 460 Following Researcher in NLP/ML @deepmind, @ucl_nlp, @[email protected] on Mastodon
Yucheng Lu @_yucheng_lu
403 Followers 798 Following Assistant Professor @nyushanghai in ML Systems. Prev @togethercompute.
Yufei Wang @YufeiWang25
648 Followers 233 Following PhD in Robotics. Robot Learning. Robotics Institute, CMU.
Danfei Xu @danfei_xu
8K Followers 1K Following Faculty at Georgia Tech @ICatGT, researcher at @NVIDIAAI | Ph.D. @StanfordAILab | Making robots smarter | all opinions are my own
Wenxuan Zhou @Wenxuan_Zhou
3K Followers 330 Following Gemini Post-training @DeepMind. Prev: Research Scientist @Meta GenAI; Ph.D. in Robotics @CarnegieMellon;
Fei Xia @xf1280
9K Followers 761 Following Staff Research Scientist, TLM at @GoogleDeepMind, ✨♊, Gemini & Robotics, PhD from @StanfordAILab @StanfordSVL, previously @Tsinghua_Uni. #AGI through Embodiment
Jacob Austin @jacobaustin132
7K Followers 917 Following Research at @GoogleDeepMind. Currently making LLMs go fast. I also play piano and climb. NYC. Opinions my own
Manish Shetty @slimshetty_
1K Followers 767 Following PhD @UCBerkeley | AI4Code & Evals | Projects: GSO, R2E, Syzygy, LMArena RepoChat, AIOpsLab | prev @googledeepmind @msftresearch
John Schulman @johnschulman2
65K Followers 1K Following Recently started @thinkymachines. Interested in reinforcement learning, alignment, birds, jazz music
Yixin Zou @yixinzouu
2K Followers 1K Following Tenure-track faculty @maxplanckpress #MPI_SP | PhD @umsi | HCI, privacy, security | 🐈 mom of Lokum and Nala
Mengdi Xu @mengdixu_
2K Followers 936 Following Postdoc @StanfordSVL. Learning and Robotics. Ph.D. @CarnegieMellon. Prev. @GoogleDeepMind @MITIBMLab @Tsinghua_Uni.
Charlie Snell @sea_snell
8K Followers 6K Following PhD student @berkeley_ai; research @cursor_ai; prev @GoogleDeepMind. My friend told me to tweet more. I stare at my computer a lot and make things
Hank Couture @HankCouture
6K Followers 6K Following Investing in Students, Grads, Dropouts @_LeapYear_ . Former VP DoorDash. COO Fanatics
Sukjun (June) Hwang @sukjun_hwang
3K Followers 300 Following ML PhD student @mldcmu advised by @_albertgu
clem 🤗 @ClementDelangue
155K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI builders
Hongjie Wang @HongjieWang3
323 Followers 288 Following PhD candidate @ Princeton University | Research Scientist Intern @ Meta GenAI | ex Research Scientist Intern @ Adobe | Efficient ML | Image & Video Generation
Kuan Fang @KuanFang
3K Followers 783 Following Assistant Professor @Cornell CS | robotics, machine learning, computer vision
Jiuhai Chen @JiuhaiC
678 Followers 2K Following CS Phd student @ UMD Ex-intern @Meta @Microsoft @Amazon On the industry job market
Debajyoti (Debo) Datt... @debo_datta_
303 Followers 2K Following Co-Founder @HippocraticAI | PhD @UVa | Ex Amazon (AWS) Interests: Differential Geometry, Tensor Decomposition, Large Language Models, Healthcare
Orion Weller @orionweller
2K Followers 947 Following PhD student @jhuclsp interning @AIatMeta FAIR. Prev intern @GoogleDeepMind, @samaya_ai, @allen_ai. Research: LLMs, RAG, and IR
Naoto Usuyama @naotous
1K Followers 704 Following Principal Researcher @Microsoft | AI for Health 🧬🔬 🎾 | Tokyo 🇯🇵 → Seattle 🏞️
Junheng Hao @Jeff_Haojh
193 Followers 430 Following Researcher at @Microsoft GenAI. PhD@UCLA. Ex-intern @MSFTResearch @IBMResearch @AmazonScience @NECLabsAmerica.
Han Shao @HanShao16
638 Followers 264 Following Assistant Professor @umdcs. Prev: Postdoc @Harvard, PhD @TTIC_Connect. Interested in machine learning theory problems.
Junxiong Wang @_junxiong_wang
1 Followers 3 Following PhD @CornellCIS, work on large language models
EMNLP 2025 @emnlpmeeting
15K Followers 50 Following EMNLP 2025 - The 2025 Conference on Empirical Methods in Natural Language Processing, 2025 Hashtag: #EMNLP2025 Dates: November 5-9 Submission Deadline: May 19th
The Nobel Prize @NobelPrize
1.2M Followers 497 Following The official feed of the Nobel Prize @NobelPrize #NobelPrize
Aishwarya Kamath @ashkamath20
8K Followers 622 Following Senior Research Scientist @GoogleDeepMind on the Gemini team. Multimodal Lead on Gemma 3. PhD at NYU with @ylecun Masters at UMass Amherst
Sameer Singh @sameer_
7K Followers 2K Following Cofounder/CTO @SpiffyAI and Prof at @UCIrvine, working on reliable LLMs, explanations for AI+ML, adversaries for NLP, and debugging/evaluation.
Daniel Fried @dan_fried
4K Followers 864 Following Assistant prof. @LTIatCMU @SCSatCMU. Working on NLP: language interfaces, applied pragmatics, language-to-code, grounding.
Hanlin Li @hanlinliii
921 Followers 498 Following She/her. assist prof @UTAustin. Social and economic impact of data, HCI, CSS
Wei Hu @weihu_
2K Followers 1K Following Assistant professor @UMichCSE @UMich; previously @SimonsInstitute @UCBerkeley @Princeton @Tsinghua_Uni. Theoretical and scientific foundations of deep learning.
YYYao @YYYao45
34 Followers 162 Following
Jiaxin Lu @jacinth_lu
295 Followers 367 Following PhD student @UTCompSci | previously ACM Class 2018, SJTU | film lover | she/her
Yihao Xue @xue_yihao65785
442 Followers 509 Following PhD @UCLA | LLM Reasoning · Safety · Robustness · Multimodal & Self-Supervised Learning | OpenAI Fellow | @GoogleResearch | Prev: @MITIBMLab · @Cisco Research
Simon Shaolei Du @SimonShaoleiDu
9K Followers 2K Following Assistant Professor @uwcse. Postdoc @the_IAS. PhD in machine learning @mldcmu.
Zhaoran Wang @zhaoran_wang
4K Followers 1K Following Associate Professor @NorthwesternU | PhD @Princeton | studying Reinforcement Learning
Nan Jiang @nanjiang_cs
10K Followers 73 Following machine learning researcher, with focus on reinforcement learning. assoc prof @ uiuc cs. Course on RL theory (w/ videos): https://t.co/vqVKwY4RJE
Haoxiang Wang @Haoxiang__Wang
1K Followers 941 Following NVIDIA Research Scientist. PhD from UIUC. Past intern at Apple/Amazon/Waymo.
Minchan Jeong @mc_jeong
112 Followers 197 Following Currently at @kaist_ai for Ph.D course. B.S at physics and mathematics in SNU.
Jian Zhao @jeffjianzhao
1K Followers 517 Following Prof@UWaterloo - Design, code & research; Make your data live! https://t.co/A0C3D0IOxS