Jaehoon Lee @hoonkp
Researcher in machine learning with background in physics; Member of Technical Staff @AnthropicAI; Prev. Research scientist @GoogleDeepMind/@GoogleBrain. jaehlee.github.io San Francisco Bay Area, CA Joined November 2009-
Tweets244
-
Followers1K
-
Following666
-
Likes224
Claude 4 models are here 🎉 From research to engineering, safety to product - this launch showcases what's possible when the entire Anthropic team comes together. Honored to be part of this journey! Claude has been transforming my daily workflow, hope it does the same for you!
Claude 4 models are here 🎉 From research to engineering, safety to product - this launch showcases what's possible when the entire Anthropic team comes together. Honored to be part of this journey! Claude has been transforming my daily workflow, hope it does the same for you!
@ethansdyer and I have started a new team at @AnthropicAI — and we’re hiring! Our team is organized around the north star goal of building an AI scientist: a system capable of solving the long-term reasoning challenges and core capabilities needed to push the scientific…
Tour de force led by @_katieeverett investigating the interplay between neural network parameterization and optimizers; the thread/paper includes lot of gems (theory insight, extensive empirics, and cool new tricks)!
It was a pleasure working on Gemma 2. The team is relatively small but very capable. Glad to see it get released. On the origin of techniques: 'like Grok', 'like Mistral', etc. is a weird way to describe them as they all originated at Google Brain/DeepMind and the way they ended…
It was a pleasure working on Gemma 2. The team is relatively small but very capable. Glad to see it get released. On the origin of techniques: 'like Grok', 'like Mistral', etc. is a weird way to describe them as they all originated at Google Brain/DeepMind and the way they ended…
We recently open-sourced a relatively minimal implementation example of Transformer language model training in JAX, called NanoDO. If you stick to vanilla JAX components, the code is relatively straightforward to read -- the model file is <150 lines. We found it useful as a…
Ever wonder why we don’t train LLMs over highly compressed text? Turns out it’s hard to make it work. Check out our paper for some progress that we’re hoping others can build on. arxiv.org/abs/2404.03626 With @blester125, @hoonkp, @alemi, Jeffrey Pennington, @ada_rob, @jaschasd
Is Kevin onto something? We found that LLMs can struggle to understand compressed text, unless you do some specific tricks. Check out arxiv.org/abs/2404.03626 and help @hoonkp, @alemi, Jeffrey Pennington, @ada_rob, @jaschasd, @noahconst and I make Kevin’s dream a reality.
This is an awesome opportunity to work with strong collaborators on an impactful science problem! Highly recommended!
This is an awesome opportunity to work with strong collaborators on an impactful science problem! Highly recommended!
Analyzing training instabilities in Transformers made more accessible by awesome work by @Mitchnw during his internship at @GoogleDeepMind! We encourage you to think more on understanding the fundamental cause and effect of training instabilities as the models scale up!
Analyzing training instabilities in Transformers made more accessible by awesome work by @Mitchnw during his internship at @GoogleDeepMind! We encourage you to think more on understanding the fundamental cause and effect of training instabilities as the models scale up!
This is amazing opportunity to work on impactful problems in Large Language Models with cool people! Highly recommended!
This is amazing opportunity to work on impactful problems in Large Language Models with cool people! Highly recommended!
Jasper @latentjasper talking about the ongoing journey towards BIG Gaussian processes! A team effort with @hoonkp, Ben Adlam, @shreyaspadhy and @zacharynado. Join us at NeurIPS GP workshop neurips.cc/virtual/2022/w…
Today at 11am CT, Hall J #806 we are presenting our paper on infinite width neural network kernels! We have methods to compute NTK/NNGP for extended set of activations + sketched embeddings for efficient approximation (100x) for compute intensive conv kernels! See you there!
Today at 11am CT, Hall J #806 we are presenting our paper on infinite width neural network kernels! We have methods to compute NTK/NNGP for extended set of activations + sketched embeddings for efficient approximation (100x) for compute intensive conv kernels! See you there!
Tired of tuning your neural network optimizer? Wish there was an optimizer that just worked? We’re excited to release VeLO 🚲, the first hyperparameter-free learned optimizer that outperforms hand-designed optimizers on real-world problems: velo-code.github.io 🧵
Very interesting paper by @jamiesully2, @danintheory and Alex Maloney investigating theoretical origin of neural scaling laws! Happy to read the 97p paper and learn about new tools in RMT and insights of how statistics of natural datasets are translated into power-law scaling.
Very interesting paper by @jamiesully2, @danintheory and Alex Maloney investigating theoretical origin of neural scaling laws! Happy to read the 97p paper and learn about new tools in RMT and insights of how statistics of natural datasets are translated into power-law scaling.
the deadline for applying to the OpenAI residency is tomorrow. if you are an engineer or researcher from any field who wants to start working on AI, please consider applying. many of our best people have come from this program! boards.greenhouse.io/openai/jobs/46… boards.greenhouse.io/openai/jobs/46…
🧮 I finally spent some time learning what exactly Neural Tangent Kernel (NTK) is and went through some mathematical proof. Hopefully after reading this, you will not feel all the math behind NTK is that scaring, but rather, quite intuitive. lilianweng.github.io/posts/2022-09-…
1/ Super excited to introduce #Minerva 🦉(goo.gle/3yGpTN7). Minerva was trained on math and science found on the web and can solve many multi-step quantitative reasoning problems.
1/ Super excited to introduce #Minerva 🦉(goo.gle/3yGpTN7). Minerva was trained on math and science found on the web and can solve many multi-step quantitative reasoning problems. https://t.co/0up7y13crm

Behnam Neyshabur @bneyshabur
29K Followers 858 Following Research @AnthropicAI (Co-lead Discovery team) 💼 Past: Gemini @GoogleDeepMind (Co-led Blueshift team) 🧠 LLM Reasoning / AI Scientist 🎒Traveling & Backpacking
Rosanne Liu @savvyRL
46K Followers 1K Following (On mat leave.) Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS.
Jascha Sohl-Dickstein @jaschasd
24K Followers 706 Following Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.
Kyle Cranmer @KyleCranmer
17K Followers 3K Following Director Data Science Institute @UWMadison @datascience_uw. EiC @MLSTjournal. Physics, stats/ML/AI, open science. same handle @sigmoid.social and bsky
Natasha Jaques @natashajaques
30K Followers 1K Following Assistant Professor @uwcse and Staff Research Scientist at @GoogleAI. Let's get off this app: https://t.co/jbH2oAjbPN
Jeff Dean @JeffDean
365K Followers 6K Following Chief Scientist, Google DeepMind & Google Research. Gemini Lead. Opinions stated here are my own, not those of Google. TensorFlow, MapReduce, Bigtable, ...
Dimitris Papailiopoul... @DimitrisPapail
20K Followers 1K Following Researcher @MSFTResearch, AI Frontiers Lab; Prof @UWMadison (on leave); learning in context; thinking about reasoning; babas of Inez Lily.
Jeremy Cohen @deepcohen
5K Followers 928 Following Research fellow at Flatiron Institute, working on understanding optimization in deep learning. Previously: PhD in machine learning at Carnegie Mellon.
Ben Poole @poolio
21K Followers 1K Following research scientist at google brain. phd in neural nonsense from stanford.
Jason Lee @jasondeanlee
18K Followers 4K Following Associate Professor at UC Berkeley. Former Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learning.
Jeremy Howard @jeremyphoward
260K Followers 6K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Prev: professor @ UQ; Stanford fellow; @kaggle president; @fastmail/@enlitic/etc founder https://t.co/16UBFTX7mo
Sara Hooker @sarahookr
49K Followers 9K Following I lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
Sam Power @sp_monte_carlo
19K Followers 7K Following Lecturer in Maths & Stats at Bristol. Interested in probabilistic + numerical computation, statistical modelling + inference. @OnlineMCSeminar. (he / him)
James Bradbury @jekbradbury
13K Followers 9K Following Compute at @AnthropicAI! Previously JAX, TPUs, and LLMs at Google, MetaMind/@SFResearch, @Stanford Linguistics, @Caixin.
trent e @_trente_
9K Followers 3K Following Building inference mods @concordanceai former @______jpg______ @yamfinance and misc defi things
Magnivel Internationa... @magnivel
110 Followers 3K Following Magnivel International is an Open Access publisher and international scientific conferences and expo Organizer.
Mahesh Raj @MaheshR53339165
5 Followers 484 Following
Koji 🐍 @cooosyku23
48 Followers 215 Following ソフトウェアエンジニア 💻 | AI信奉者 🧠 | 慶應義塾大学OB 🖋️ | アメリカ/イギリス帰国子女 🇺🇸🇬🇧
Icexief @Icexief6588309
28 Followers 1K Following
MMM @MMM1897775
9 Followers 3K Following
Brook Effertz @BrookEffer64936
76 Followers 3K Following
Zeyuan Allen-Zhu, Sc.... @ZeyuanAllenZhu
20K Followers 452 Following physics of language models @ Meta (FAIR, not GenAI) 🎓:Tsinghua Physics — MIT CSAIL — Princeton/IAS 🏅:IOI x 2 — ACM-ICPC — USACO — Codejam — math MCM
The Expensive Hooker @Trifenol
5K Followers 4K Following Mestrando em Matemática, Estatística e Computação na @usponline • Ciência de Dados @UFMSbr • Estudos de Mídia/@UFF_br • 🏳️🌈 • Capixaba #rstats
Mussa Kambi @MussaKambi77202
16 Followers 861 Following
Valentina Tardelli @ValentinaT32922
91 Followers 6K Following
Patrick Drake @time8machine
17K Followers 6K Following Neurodivergent physics student with a keen interest in multisensory integration and emergent perception. Exploring research on a proposed ‘sixth sense’. Δ
Gregor Bachmann @GregorBachmann1
372 Followers 384 Following I am a PhD student @ETH Zürich working on deep learning. MLP-pilled 💊. https://t.co/yWdDEV6Z15
Hassan Ahmed @HassanAhmedAI
100 Followers 2K Following AI/ML Engineer | AI x Healthcare | LLMs, NLP, Agents | Building HealthTech AI | Open to collab & remote roles LinkedIn ⬇ https://t.co/QPCmZMBPQ0
xyyzxxyzzxyy @xyyzxxyzzxyy
0 Followers 450 Following
arion das @ArionDas
834 Followers 8K Following gen ai intern @Techolution_com || research @ aiisc, usc || author @naacl || reviewer @aclmeeting, aia @COLM_conf
Beligrad @dentfenglin
8 Followers 221 Following
Pavel Izmailov @Pavel_Izmailov
8K Followers 1K Following Researcher @AnthropicAI 🤖 Incoming Assistant Professor @nyuniversity 🏙️ Previously @OpenAI #StopWar 🇺🇦
Advik Anand @goduchihaitachi
12 Followers 304 Following cofounder + ceo @ lumigenic therapeutics | building the future of agriculture
Joon Choi @JoonCho80234531
0 Followers 38 Following
Sleetew @SleetewBlm
34 Followers 1K Following
shaggydogai @shaggydogai
3 Followers 70 Following
Samira Daruki @SamiraDaruki
179 Followers 725 Following Learning and Training Gemini ♊, PreTraining 🤝 PostTraining RL, Science of Scaling, Model Design, Compute 🤝 Intelligence.
Arthur Douillard @Ar_Douillard
8K Followers 2K Following Distributed Learning @ deepmind | DiLoCo, DiPaCo. Continual Learning PhD @ Sorbonne
MB @mbitem
144 Followers 3K Following
Amy Lu @amyxlu
4K Followers 2K Following Senior Research Scientist @IsomorphicLabs building AI for drug discovery. Prev: CS PhD @berkeley_ai, @PrescientDesign, @insitro, @UofT. 🇨🇦
Anne Damilio @AnneDamilio
34 Followers 1K Following
김재오 @plain903
0 Followers 27 Following
Kiho Park @KihoPark_
440 Followers 218 Following @UChicago Stat PhD candidate advised by @victorveitch
Leynirth @LeynirthQgw
50 Followers 5K Following
Taishi Nakamura @Setuna7777_2
2K Followers 6K Following CS MS at @sciencetokyo_en Intern @SakanaAILabs
Peace Martins @PeaceMartins8
100 Followers 408 Following Believer ° God-Class || Global Citizen || Peace Advocate || 21st Century Think-Tank || Model United Nations Enthusiast
Yann LeCun @ylecun
949K Followers 764 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
Kyunghyun Cho @kchonyc
77K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
AK @_akhaliq
425K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5YmrQ
Google DeepMind @GoogleDeepMind
1.2M Followers 279 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
François Chollet @fchollet
572K Followers 813 Following Co-founder @ndea. Co-founder @arcprize. Creator of Keras and ARC-AGI. Author of 'Deep Learning with Python'.
Behnam Neyshabur @bneyshabur
29K Followers 858 Following Research @AnthropicAI (Co-lead Discovery team) 💼 Past: Gemini @GoogleDeepMind (Co-led Blueshift team) 🧠 LLM Reasoning / AI Scientist 🎒Traveling & Backpacking
Rosanne Liu @savvyRL
46K Followers 1K Following (On mat leave.) Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS.
Jim Fan @DrJimFan
325K Followers 3K Following NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
Sebastian Raschka @rasbt
355K Followers 1K Following ML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).
Percy Liang @percyliang
84K Followers 417 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist
Ferenc Huszár @fhuszar
42K Followers 1K Following Secular Bayesian. Professor of Machine Learning @Cambridge_CL. Talent aficionado at https://t.co/RbJkoLguey Alum of @Twitter, Magic Pony and @Balderton
Jascha Sohl-Dickstein @jaschasd
24K Followers 706 Following Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.
Kyle Cranmer @KyleCranmer
17K Followers 3K Following Director Data Science Institute @UWMadison @datascience_uw. EiC @MLSTjournal. Physics, stats/ML/AI, open science. same handle @sigmoid.social and bsky
Natasha Jaques @natashajaques
30K Followers 1K Following Assistant Professor @uwcse and Staff Research Scientist at @GoogleAI. Let's get off this app: https://t.co/jbH2oAjbPN
The Founders' Tribune @foundertribune
18K Followers 3 Following A new Op-Ed platform for founders. One essay every Sunday.
Pavel Izmailov @Pavel_Izmailov
8K Followers 1K Following Researcher @AnthropicAI 🤖 Incoming Assistant Professor @nyuniversity 🏙️ Previously @OpenAI #StopWar 🇺🇦
will grathwohl @wgrathwohl
4K Followers 252 Following graduated from high school, college, and even grad school
Edgar Shaghoulian @eshaghoulian
2K Followers 148 Following Assistant professor of physics @UCSC. Interested in black holes and quantum cosmology.
Kevin Patrick Murphy @sirbayes
61K Followers 530 Following Research Scientist at Google DeepMind. Interested in Bayesian Machine Learning.
Sebastien Bubeck @SebastienBubeck
56K Followers 1K Following I work on AI at OpenAI. Former VP AI and Distinguished Scientist at Microsoft.
Brandon Amos @brandondamos
20K Followers 2K Following research scientist @MetaAI (FAIR) | optimization, machine learning, control, transport | PhD from @SCSatCMU
하이브레인넷 @HiBrainNet
4K Followers 353 Following 안녕하세요? 고급두뇌를 위한 네트워크 ! 하이브레인넷(http://t.co/8tZnlo9w5D)입니다. 트위터를 통해 교수/연구원 채용정보를 제공합니다. #하이브레인넷_ #hibrainnet 태그로 검색하세요~
Nat McAleese @__nmca__
14K Followers 353 Following Research @AnthropicAI. Previously @OpenAI, @DeepMind. Views my own.
DeepSeek @deepseek_ai
973K Followers 0 Following Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
Arthur Douillard @Ar_Douillard
8K Followers 2K Following Distributed Learning @ deepmind | DiLoCo, DiPaCo. Continual Learning PhD @ Sorbonne
Jeremy Bernstein @jxbz
6K Followers 605 Following 🧪 @thinkymachines ✍️ anon feedback @ https://t.co/RIhBhjMRdD
Surya Bhupatiraju @suryabhupa
2K Followers 511 Following research engineer @GoogleDeepMind working on language models | previously CS @MIT
SSI Inc. @ssi
102K Followers 0 Following A straight shot to safe superintelligence. Join us https://t.co/hHla3vusDE.
Interesting things @awkwardgoogle
3.9M Followers 140 Following Posting interesting things from all corners of the internet.
Kevin Slagle @kjslag
123 Followers 228 Following AI Research Engineer @magicailabs. Former quantum physics professor @RiceECE
Brian Lester @blester125
458 Followers 242 Following Senior Research Engineer at Google Deep Mind working on parameter-efficient adaptation and few-shot generalization, mostly within NLP. View are my own. he/him
Ashutosh Mehra @ashutoshmehra
2K Followers 7K Following Senior Principal Scientist at Adobe. Working on Acrobat AI Assistant, LLMs, and document ML.
Adam Roberts @ada_rob
8K Followers 718 Following ai researcher & engineer @ Google DeepMind :: ♫ Co-lead Magenta & Lyria :: 📝 Foundational LLM work: TL for T5 & C4, core PaLM :: 👨💻 TL for T5x & SeqIO
Joshua Batson @thebasepoint
4K Followers 677 Following trying to understand evolved systems (🖥 and 🧬) interpretability research @anthropicai formerly @czbiohub, @mit math
Chanwoo Park @chanwoopark20
1K Followers 1K Following Games, Multi-agent (gen) AI | @speedrun SR003 | @mit EECS Ph.D. Candidate
Joshua Achiam @jachiam0
22K Followers 1K Following Freedom, flourishing, and abundance. Head of Mission Alignment at @openai. Main author of https://t.co/cKuSh21yaz
Logan Kilpatrick @OfficialLoganK
210K Followers 2K Following Lead product for @GoogleAIStudio + the Gemini API. My views!
Hiroki Naganuma @_Hiroki11x
991 Followers 789 Following PhD Candidate at @UMontreal ,@Mila_Quebec / HPC, Generalization, Large Scale Optimization / ex- @AIatMeta, @GoogleDeepMind, @MSFTResearch, @IBMResearch
Jason Weston @jaseweston
13K Followers 704 Following @MetaAI+NYU. NLP from scratch(Pretrain+FT LLM) 2008, MemNets (pre-Transformer) 2015, DrQA(pre-RAG) 2017, BlenderBot(dialog pre-ChatGPT) 2018+,Self-Reward+ more!
Zeyuan Allen-Zhu, Sc.... @ZeyuanAllenZhu
20K Followers 452 Following physics of language models @ Meta (FAIR, not GenAI) 🎓:Tsinghua Physics — MIT CSAIL — Princeton/IAS 🏅:IOI x 2 — ACM-ICPC — USACO — Codejam — math MCM
Chief AI Officer @chiefaioffice
36K Followers 1K Following Track the latest funding in AI → https://t.co/NlbhFKCLNf
골빈해커 @golbin
16K Followers 756 Following Code addict, AI/ML believer, 25+ years’ coding experienced start-up guy, Built a unicorn and building another big one!
Ludwig Schmidt @lschmidt3
6K Followers 426 Following Assistant professor at @Stanford and member of the technical staff at @AnthropicAI.
Katie Everett @_katieeverett
3K Followers 632 Following Machine learning researcher @GoogleDeepMind + PhD student @MIT. Opinions are my own.
Sharad Vikram @sharadvikram
2K Followers 585 Following Researcher @ Google Deepmind. I work on JAX + Pallas (https://t.co/lPMsq3yzgL) and Gemini. In the past I worked on Oryx and TFP. I like learning.
Horace He @cHHillee
39K Followers 534 Following @thinkymachines Formerly @PyTorch "My learning style is Horace twitter threads" - @typedfemale
Startup Archive @StartupArchive_
96K Followers 2 Following Archiving the world's best startup advice for future generations of founders | New project: @foundertribune
Almost Sure @Almost_Sure
8K Followers 233 Following George Lowther, Author of Almost Sure blog, on maths, probability and stochastic calculus. Also on YouTube https://t.co/VyOijwbe9l
TuringPost @TheTuringPost
76K Followers 13K Following Newsletter exploring AI&ML - AI 101, Agentic Workflow, Business insights. From ML history to AI trends. Led by @kseniase_ Know what you are talking about👇🏼
typedfemale @typedfemale
38K Followers 534 Following a really exciting new account "advanced pytorch user" - @cHHillee alt: @typedalt
Charlie Snell @sea_snell
8K Followers 6K Following PhD student @berkeley_ai; research @cursor_ai; prev @GoogleDeepMind. My friend told me to tweet more. I stare at my computer a lot and make things
Eugene Lee @egslee
782 Followers 195 Following Software Engineer at Google. I work on large scale and low latency data platforms. Opinions are my own.
Peter J. Liu @peterjliu
8K Followers 2K Following AI research-eneur. Hiring eng: https://t.co/fv5QBjsv90. Was Research Scientist @ Google Brain / DeepMind, language model research. 🇨🇦🇺🇸
Eater SF @eatersf
137K Followers 381 Following Food news and dining guides for San Francisco. Download the Eater app ⬇️