Zachary Charles @MatharyCharles
distributed machine learning @ google | sometimes mathematician zachcharles.com Seattle Joined September 2012-
Tweets791
-
Followers1K
-
Following406
-
Likes1K
Reading "Factoring Rational Polynomials over the Complexes" and this part jumped out at me. Is this "random Monte-Carlo NC" computational complexity class something well studied?
It will take me years to unravel the linguistic rapture of this tweet. It'll be 2030 and I'll still be talking about chubby cubby appetizers.
It will take me years to unravel the linguistic rapture of this tweet. It'll be 2030 and I'll still be talking about chubby cubby appetizers.
I'm beginning to think the JAX api docs aren't rendering correctly on my phone. Just a vague suspicion.
I've seen people (e.g. Goedel v2 report) use model averaging to mitigate this issue - but cannot personally vouch for how well this works!
Evergreen synopsis from @_katieeverett on scaling exponents - optimizers just often seem not to change them (though there are some recent theoretical works that purport to change this) x.com/_katieeverett/…
Evergreen synopsis from @_katieeverett on scaling exponents - optimizers just often seem not to change them (though there are some recent theoretical works that purport to change this) x.com/_katieeverett/…
I would love to see this done, but with the kind of evaluation methodology from the Algo Perf work. Though I guess one of the key takeaways from these two new works is that doing this in a "scaling aware" way requires new methodology.
I would love to see this done, but with the kind of evaluation methodology from the Algo Perf work. Though I guess one of the key takeaways from these two new works is that doing this in a "scaling aware" way requires new methodology. https://t.co/KOVeOjUiiY
I thought the fundamental insight from the model soup work was that you can bootstrap your own model collection just by changing training hparams - why isn't this used anywhere? E.g. From the (very cool) Goedel-Prover-V2 paper - no actual model soups, just model averaging.
I read the whole book a few years back. Graeber's explanations for why businesses would tolerate bullshit jobs was unconvincing - he basically invokes a nebulous "corporate feudalism" concept. More interesting were the insights on how a person feeling that their job was bs would…
I read the whole book a few years back. Graeber's explanations for why businesses would tolerate bullshit jobs was unconvincing - he basically invokes a nebulous "corporate feudalism" concept. More interesting were the insights on how a person feeling that their job was bs would…
Great read. Theoretical physics is so hard right now in part because the standard model is so good at fitting current data. The tough balance between "generating useful hypotheses" and "fitting past data" is also something I struggle with in AI!
Great read. Theoretical physics is so hard right now in part because the standard model is so good at fitting current data. The tough balance between "generating useful hypotheses" and "fitting past data" is also something I struggle with in AI!
Getting small batch sizes to work in bfloat16 precision can be challenging. In our recent paper on batch size, we ran all experiments in float32, but memory-constrained settings demand lower precision. Here are two tricks that we used to enable bf16 training at small batch sizes:

Dimitris Papailiopoul... @DimitrisPapail
20K Followers 1K Following Researcher @MSFTResearch, AI Frontiers Lab; Prof @UWMadison (on leave); learning in context; thinking about reasoning; babas of Inez Lily.
Amin Karbasi @aminkarbasi
11K Followers 3K Following Senior director of Cisco Foundation AI, Former Chief Scientist at Robust Intelligence. ex Professor at Yale University, ex staff research scientist at Google.
Alex Dimakis @AlexGDimakis
21K Followers 2K Following Professor, UC berkeley | Founder @bespokelabsai |
Peter Richtarik @peter_richtarik
8K Followers 649 Following Federated Learning Guru. Tweeting since 20.5.2020. Lived in 🇸🇰🇺🇸🇧🇪🇬🇧🇸🇦
Hongyi Wang @HongyiWang10
2K Followers 2K Following Assist. Prof. @RutgersCS; Head of Infra @genbioai; Ex @mldcmu @WisconsinCS
Konstantin Mishchenko @konstmish
7K Followers 652 Following Research Scientist @AIatMeta Previously Researcher @ Samsung AI Outstanding Paper Award @icmlconf 2023 Action Editor @TmlrOrg I tweet about ML papers and math
Fabian Pedregosa @fpedregosa
6K Followers 571 Following Keeping the gradients flowing since 2013. Loves open source. Sometime blogs and writes papers.
Ahmad Beirami @abeirami
10K Followers 4K Following sth new // ex Gemini RL+Inference @GoogleDeepMind // Chat AI @Meta // RL Agents @EA // ML+Information Theory @MIT+@Harvard+@GeorgiaTech // زن زندگی آزادی
Peter Kairouz @KairouzPeter
1K Followers 146 Following @GoogleAI researcher focusing on federated learning, security, and differential privacy.
Jason Lee @jasondeanlee
18K Followers 4K Following Associate Professor at UC Berkeley. Former Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learning.
Ankit Pensia @subGaussian
464 Followers 562 Following Research Fellow at @SimonsInstitute| Theoretical machine learning and statistics
Arya Mazumdar @MountainOfMoon
4K Followers 355 Following Professor @UCSanDiego Dy. Director+AD Research NSF AI Inst https://t.co/wblPm6DhUX, UCSD Site Lead @encoreinstitut Information Theory, Coding Theory, Machine Learning
César A. Uribe @CesarAUribe
1K Followers 744 Following @RiceECE 🦉. 🇨🇴 Sometimes control theorist, sometimes optimizer, and sometimes ML or data scientist. 🏐 setter. Zizek groupie.
Kartik Sreenivasan @KartikSreeni
1K Followers 573 Following Research scientist at MosaicML/Databricks. PhD from UW-Madison. Interested in LLMs, optimization, and the meaning of life.
Aryan Mokhtari @AryanMokhtari
2K Followers 342 Following Associate Prof at UT Austin. Visiting Researcher at Google Research. Interested in Optimization and ML/AI.
Damek @damekdavis
6K Followers 1K Following Optimization and ML, Prof @Wharton Stats, Prev: Prof @Cornell, PhD @uclamath, https://t.co/bfOIEx0lHj, (not quite a) blog: https://t.co/RFKUB4qDKF
Sara Hooker @sarahookr
49K Followers 9K Following I lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
Shiqian Ma @ShiqianMa
2K Followers 1K Following Professor@Rice University. PhD from Columbia IEOR. Work on optimization and machine learning.
Aakanksha Chowdhery @achowdhery
11K Followers 5K Following @Stanford @reflection_ai // Previously @GoogleDeepMind :: PaLM, Gemini // @MSFTResearch, @Princeton // views my own and subject to change
Emanuel Cammon @highsociety777
14 Followers 187 Following
Hiroki Naganuma @_Hiroki11x
975 Followers 789 Following PhD Candidate at @UMontreal ,@Mila_Quebec / HPC, Generalization, Large Scale Optimization / ex- @AIatMeta, @GoogleDeepMind, @MSFTResearch, @IBMResearch
Jeevesh Juneja @xdfbhkl
3 Followers 178 Following
Nadav Timor @NadavTimor
722 Followers 7K Following LLM inference, speculative decoding, open source. Built novel decoding algorithms – default in Hugging Face Transformers (147k+ ⭐). Making LLMs faster + cheaper
MayJoule @g5ifT84KVa793
0 Followers 82 Following
無 @xwuxwux
1 Followers 4K Following
Walker Baumbach @baumbach_w69109
103 Followers 4K Following
Eerajarj @Eerajarj654041
3 Followers 202 Following Focused on investing in U.S. stocks, happy to discuss stock market trends.
Ionut Calin @r72492642
6 Followers 411 Following
Rohan Varma @rvarm1
1K Followers 479 Following Research Engineer @Meta Superintelligence working on scaling. Former @PyTorch core developer
Todd Sherman @tdd
6K Followers 2K Following Product Lead for @YouTube Shopping. Previously Shorts, @Twitter, @Snapchat & @LockheedMartin. Interested in consumer products, platforms & AI.
Niki @IntenseRealist
276 Followers 3K Following
Richard Ngo @RichardMCNgo
62K Followers 2K Following studying AI and trust. ex @openai/@googledeepmind
Yash Malik @_yash_malik_
92 Followers 1K Following ML @AmazonScience Scaling RL for LLMs Prev @Google, SC
utkarsh @utkarsh_2105
656 Followers 2K Following he/him | CS undergrad BITS Pilani | (distributed, large, fast and so on) Systems for ML @MSFTResearch, prev @Inria
Thomas Möllenhoff @tmoellenhoff
794 Followers 602 Following Senior Research Scientist (tenured) @RIKEN_AIP @RIKEN_EN — PhD from @TU_Muenchen @tumcvg
Abhay Gupta @gupta__abhay
389 Followers 2K Following Scaling and efficiency lead @DbrxMosaicAI | Previously @CerebrasSystems @CMU_Robotics | Making GPUs and agents go brrrr !!
λux @novasarc01
20K Followers 2K Following tensor shepherd in a non-euclidean pasture | grazing on cuda cores
Agrippa @admiralagrippa
169 Followers 645 Following Acolyte of the Church of Rao. Spreading the gospel of Yuma @opentensor. τ/acc.
shreddington @shreddington4
41 Followers 2K Following
Adam Sadovsky @asadovsky
1K Followers 473 Following CVP, AI at Microsoft AI (past: @GoogleDeepMind, @Google)
Mowbray @MowbrayRv
21 Followers 2K Following $ whoami Grad student @iitmadras | prev @iiscbangalore $ cat research.txt RL | Controls $ echo "Shifting gears: from building machines → making them learn"
Areg Karapetyan @arnukk
38 Followers 263 Following Research Scientist at @NYUAbuDhabi. Researching in #Optimization, #Algorithms, #Theory and #AI with applications to #AutonomousSystems and #SmartCities.
Gustavo De Mari Perei... @guhdemari
470 Followers 5K Following machine learning, data science, software engineering
Ivy Xueqing Yang @YangIvyXQ
550 Followers 2K Following Incoming assistant professor of Economics&Finance @SIUE. Macro-finance. Topics in entrepreneurship. “Schue-Chueng Yang” 杨雪青
Léo Aparisi de Lanno... @LeoAparisidL
1K Followers 4K Following PhD student @UChi_Economics. Economics/History/Politics/Whatever.
rohan anil @_arohan_
25K Followers 2K Following
Bruno Batinica @bruno_batinica
21 Followers 2K Following
surya @suryaasub
267 Followers 844 Following gpu kernels @nvidia. prev. pytorch @aiatmeta, ml infra @pinteresteng. cs @georgiatech
Roger Jin @rogershijin
683 Followers 2K Following post-training @NousResearch. past: apple mle, google student researcher, mit math & cs
Teknium (e/λ) @Teknium1
50K Followers 5K Following Cofounder and Head of Post Training @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE
Distributed State @DistStateAndMe
3K Followers 2K Following cult leader/ exit liquidity at https://t.co/rH6EoYy4aK / basilica/ grail || Summoner of Divine Computation || Bittensor Maxi
Naoto Usuyama @naotous
1K Followers 702 Following Principal Researcher @Microsoft | AI for Health 🧬🔬 🎾 | Kamakura/Tokyo 🇯🇵 → Seattle 🏞️
DIENG Cheikh Ibra @dcheikhibra
56 Followers 2K Following Data & AI @ ENSAE 🤖 | From Dakar to Paris to the world 🌍 | Founder mindset ⚡ | (finance • media • sport • NLP • Crypto ) | Legacy. Growth. Impact.
Lain @not_so_lain
2K Followers 1K Following @huggingface fellow | Software engineer @ChonkieAI & MLE @ mermory PyTorchModelHubMixin hero
Christian @chrischarts8
211 Followers 3K Following mcgill md | built BreastGAN & BreastReconGAN | building PCare+
j @JC_9_1_1
33 Followers 612 Following
Dimitris Papailiopoul... @DimitrisPapail
20K Followers 1K Following Researcher @MSFTResearch, AI Frontiers Lab; Prof @UWMadison (on leave); learning in context; thinking about reasoning; babas of Inez Lily.
Gautam Kamath @thegautamkamath
57K Followers 568 Following Assistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Joining @NYU_Courant September 2026. Co-EiC @TmlrOrg. I lead @TheSalonML.
Amin Karbasi @aminkarbasi
11K Followers 3K Following Senior director of Cisco Foundation AI, Former Chief Scientist at Robust Intelligence. ex Professor at Yale University, ex staff research scientist at Google.
Alex Dimakis @AlexGDimakis
21K Followers 2K Following Professor, UC berkeley | Founder @bespokelabsai |
Peter Richtarik @peter_richtarik
8K Followers 649 Following Federated Learning Guru. Tweeting since 20.5.2020. Lived in 🇸🇰🇺🇸🇧🇪🇬🇧🇸🇦
Sebastian Raschka @rasbt
354K Followers 1K Following ML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).
Yann LeCun @ylecun
948K Followers 764 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
Jonathan Frankle @jefrankle
20K Followers 725 Following Chief AI Scientist @databricks via MosaicML.
Dan Roy @roydanroy
57K Followers 2K Following ML / AI researcher. Research Director and Canada CIFAR AI Chair, @VectorInst. Professor, @UofT (Statistics/CS).
Kyunghyun Cho @kchonyc
77K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
Aaron Defazio @aaron_defazio
8K Followers 584 Following Research Scientist at Meta Superintelligence Labs working on optimization algorithms. Fundamental AI Research (FAIR) team
Hongyi Wang @HongyiWang10
2K Followers 2K Following Assist. Prof. @RutgersCS; Head of Infra @genbioai; Ex @mldcmu @WisconsinCS
Konstantin Mishchenko @konstmish
7K Followers 652 Following Research Scientist @AIatMeta Previously Researcher @ Samsung AI Outstanding Paper Award @icmlconf 2023 Action Editor @TmlrOrg I tweet about ML papers and math
Sebastien Bubeck @SebastienBubeck
56K Followers 1K Following I work on AI at OpenAI. Former VP AI and Distinguished Scientist at Microsoft.
Fabian Pedregosa @fpedregosa
6K Followers 571 Following Keeping the gradients flowing since 2013. Loves open source. Sometime blogs and writes papers.
Francesco Orabona @bremen79
8K Followers 411 Following Dad and associate professor at @KAUST_News. Formerly @BU_ece, @sbucompsc, @YahooResearch, @TTIC_Connect. ML theory&practice, obsessed with history of science
Ahmad Beirami @abeirami
10K Followers 4K Following sth new // ex Gemini RL+Inference @GoogleDeepMind // Chat AI @Meta // RL Agents @EA // ML+Information Theory @MIT+@Harvard+@GeorgiaTech // زن زندگی آزادی
Peter Kairouz @KairouzPeter
1K Followers 146 Following @GoogleAI researcher focusing on federated learning, security, and differential privacy.
Aaron Roth @Aaroth
11K Followers 634 Following CS prof at Penn. Amazon Scholar at AWS. Author of The Ethical Algorithm (w/ Michael Kearns). I study machine learning, privacy, game theory, and uncertainty.
Tengyu Ma @tengyuma
37K Followers 565 Following Assistant professor at Stanford; Co-founder of Voyage AI (https://t.co/wpIITHLgF0) ; Working on ML, DL, RL, LLMs, and their theory.
exciting!!excellent!!... @superexcitebike
4K Followers 2K Following the world’s cutest most adorable emo chiptune girl band 🥰 tweets by jas (she/her)
Nadav Timor @NadavTimor
722 Followers 7K Following LLM inference, speculative decoding, open source. Built novel decoding algorithms – default in Hugging Face Transformers (147k+ ⭐). Making LLMs faster + cheaper
Fred Zhang @FredZhang0
1K Followers 500 Following research scientist @googledeepmind, prev phd @berkeley_eecs, DM open
Andrei Semenov @AndreiSemenov17
127 Followers 247 Following MSc in Data Science at @EPFL_en | MIPT Alumnus
OpenAI @OpenAI
4.3M Followers 3 Following OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6Lg202
Rohan Varma @rvarm1
1K Followers 479 Following Research Engineer @Meta Superintelligence working on scaling. Former @PyTorch core developer
Run more buses and tr... @headwaysmatter
835 Followers 989 Following 'worst kind of pedant and bore that I have had the displeasure of encountering over the years' @headwaysmatter.bsky.social and @[email protected]
Katie Wilson for Seat... @wilsonformayor
1K Followers 288 Following Progressive coalition-builder, mom, running for Mayor of Seattle. It's time for new leadership. Come join the team!
Leonardo @leonardofed
7K Followers 382 Following Desi@n on the edge @primeintellect, @openprest, @ogcompanyops
Richard Ngo @RichardMCNgo
62K Followers 2K Following studying AI and trust. ex @openai/@googledeepmind
Grad @Grad62304977
4K Followers 2K Following
David Kroman @KromanDavid
13K Followers 2K Following City Hall reporter for @SeattleTimes | Send tips to: [email protected] | Formerly @Crosscut | Sorry for the baseball tweets.
batuhan the fal guy @isidentical
9K Followers 607 Following just a chill guy. @fal. fastest inference on multiple diffusion models. Python core developer / @thePSF fellow
Ashwinee Panda @PandaAshwinee
3K Followers 723 Following Postdoc of @tomgoldsteincs, PhD @princeton, @Cal alum, currently working on LLMs
Sarah Catanzaro @sarahcat21
14K Followers 1K Following “All methods are sacred if they are internally necessary” (GP @amplifypartners, prev @canvasvc; Head of Data @Mattermark; @palantirtech; @c4ads)
surya @suryaasub
267 Followers 844 Following gpu kernels @nvidia. prev. pytorch @aiatmeta, ml infra @pinteresteng. cs @georgiatech
Nikita Bier @nikitabier
582K Followers 2K Following head of product @x, advisor @solana, venture partner @lightspeedvp, ex-founder @gasappteam (acq by discord), ex-founder @thetbhapp (acq by facebook)
Roger Jin @rogershijin
683 Followers 2K Following post-training @NousResearch. past: apple mle, google student researcher, mit math & cs
Distributed State @DistStateAndMe
3K Followers 2K Following cult leader/ exit liquidity at https://t.co/rH6EoYy4aK / basilica/ grail || Summoner of Divine Computation || Bittensor Maxi
Teknium (e/λ) @Teknium1
50K Followers 5K Following Cofounder and Head of Post Training @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE
Nous Research @NousResearch
81K Followers 73 Following The AI Accelerator Company https://t.co/vrD0aDIGDQ
Wanchao Liang @wanchao_
1K Followers 225 Following building @thinkymachines ex-PyTorch @ Meta. Author of PyTorch DTensor and TorchTitan. Opinions are my own
Yu Bai @yubai01
6K Followers 2K Following Research @OpenAI. Trained models for GPT5 Thinking / Mini; Contributor to gpt-oss, o3-mini, o1. Previously @SFResearch, PhD @Stanford.
Zeyuan Allen-Zhu, Sc.... @ZeyuanAllenZhu
20K Followers 451 Following physics of language models @ Meta (FAIR, not GenAI) 🎓:Tsinghua Physics — MIT CSAIL — Princeton/IAS 🏅:IOI x 2 — ACM-ICPC — USACO — Codejam — math MCM
Clément Farabet @clmt
19K Followers 2K Following AI @ Google DeepMind (AI Studio, Gemma, Gemini). Ex NVIDIA (self-driving cars, https://t.co/QtrCBg3wx0), Twitter (founded Cortex), MadBits (founded+sold) 🇺🇸🇫🇷
Chulin Xie @ChulinXie
1K Followers 916 Following CS PhD student at UIUC; IBM PhD fellow; prev. intern @GoogleAI @MSFTResearch @NvidiaAI
rohan anil @_arohan_
25K Followers 2K Following
Jeremy Bernstein @jxbz
6K Followers 606 Following 🧪 @thinkymachines ✍️ anon feedback @ https://t.co/RIhBhjMRdD
Druv Pai @druv_pai
439 Followers 322 Following PhD @berkeley_ai | using theory to improve practice for deep learning | ex @NexusflowX @GoogleResearch
Seth Hain @sethHain
381 Followers 684 Following Contemplations on computation, cognition, chords, and care.... Expressly individual insights
Naoto Usuyama @naotous
1K Followers 702 Following Principal Researcher @Microsoft | AI for Health 🧬🔬 🎾 | Kamakura/Tokyo 🇯🇵 → Seattle 🏞️
Shane Waxler @shanewaxler
134 Followers 266 Following Building Cosmos AI at @HeyEpic. Views are my own
Lain @not_so_lain
2K Followers 1K Following @huggingface fellow | Software engineer @ChonkieAI & MLE @ mermory PyTorchModelHubMixin hero