evolvingstuff @evolvingstuff
I post about machine learning and occasionally some other stuff. Joined December 2009-
Tweets4K
-
Followers3K
-
Following2K
-
Likes12K
This one paper might kill the LLM agent hype. NVIDIA just published a blueprint for agentic AI powered by Small Language Models. And it makes a scary amount of sense. Here’s the full breakdown:
How to build a thriving open source community by writing code like bacteria do 🦠. Bacterial code (genomes) are: - small (each line of code costs energy) - modular (organized into groups of swappable operons) - self-contained (easily "copy paste-able" via horizontal gene…
DeepSWE is a new state-of-the-art open-source software engineering model trained entirely using reinforcement learning, based on Qwen3-32B. together.ai/blog/deepswe Fantastic work from @togethercompute @Agentica_‼
DeepSWE is a new state-of-the-art open-source software engineering model trained entirely using reinforcement learning, based on Qwen3-32B. together.ai/blog/deepswe Fantastic work from @togethercompute @Agentica_‼ https://t.co/mLAbi2HD2Z
Text-to-LoRA: Instant Transformer Adaption arxiv.org/abs/2506.06105 Generative models can produce text, images, video. They should also be able to generate models! Here, we trained a Hypernetwork to generate new task-specific LoRAs by simply describing the task as a text prompt.
Text-to-LoRA: Instant Transformer Adaption arxiv.org/abs/2506.06105 Generative models can produce text, images, video. They should also be able to generate models! Here, we trained a Hypernetwork to generate new task-specific LoRAs by simply describing the task as a text prompt.
🚨 NEW: We made Claude, Gemini, o3 battle each other for world domination. We taught them Diplomacy—the strategy game where winning requires alliances, negotiation, and betrayal. Here's what happened: DeepSeek turned warmongering tyrant. Claude couldn't lie—everyone…
This is 🤯 Figure 02 autonomously sorting and scanning packages, including deformable ones. The speed and dexterity are amazing.
A major mistake I made in my undergrad is that I focused way too much on mathematical lens of computing - computability, decidability, asymptotic complexity etc. And too little on physical lens - energy/heat of state change, data locality, parallelism, computer architecture. The…
4 advanced attention mechanisms you should know: • Slim attention — 8× less memory, 5× faster generation by storing only K from KV pairs and recomputing V. • XAttention — 13.5× speedup on long sequences via "looking" at the sum of values along diagonal lines in the attention…
damn,.... this is so incredibly cool use case for discrete diffusion model
TL;DR: we are excited to release a powerful new open-weight language model with reasoning in the coming months, and we want to talk to devs about how to make it maximally useful: openai.com/open-model-fee… we are excited to make this a very, very good model! __ we are planning to…
Inspired by the success of LLMs, today on the blog we discuss how neural activity in the human brain aligns linearly with the internal contextual embeddings of speech and language within LLMs as they process everyday conversations. Learn more →goo.gle/4iiUoNj
How does the depth of a transformer affect reasoning capabilities? New preprint by myself and @Ashish_S_AI shows that a little depth goes a long way to increase transformers’ expressive power We take this as encouraging for further research on looped transformers!🧵
This is fun because LLMs can condition on free-form side information, and make predictions about anything. This turns qualitative knowledge into quantitative predictions. Here we condition Llama 3 on two datapoints, plus text. Changing the text changes the meaning of the data.
Diffusion language models are SO FAST!! A new startup, Inception Labs, has released Mercury Coder, "the first commercial-scale diffusion large language model" It's 5-10x faster than current gen LLMs, providing high-quality responses at low costs. And you can try it now!
Let me update my current belief: This is a certainty and a question of months.
Let me update my current belief: This is a certainty and a question of months.
uh it might be over... they put r1 in a loop for 15minutes and it generated: "better than the optimized kernels developed by skilled engineers in some cases"
uh it might be over... they put r1 in a loop for 15minutes and it generated: "better than the optimized kernels developed by skilled engineers in some cases" https://t.co/b2qxqqBKMZ
The future of robotics is RL with synthetic data. GRPO could teach robots to learn like humans do. But implementing it for robotics is non-trivial. Here's where this breakthrough technology remains trapped:
New open source reasoning model! Huginn-3.5B reasons implicitly in latent space 🧠 Unlike O1 and R1, latent reasoning doesn’t need special chain-of-thought training data, and doesn't produce extra CoT tokens at test time. We trained on 800B tokens 👇

Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
Jeremy Howard @jeremyphoward
259K Followers 6K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Prev: professor @ UQ; Stanford fellow; @kaggle president; @fastmail/@enlitic/etc founder https://t.co/16UBFTX7mo
Delip Rao e/σ @deliprao
61K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
Miles Brundage @Miles_Brundage
62K Followers 12K Following AI policy researcher, wife guy in training, fan of cute animals and sci-fi, Substack writer, stealth-ish non-profit co-founder
Thomas Wolf @Thom_Wolf
94K Followers 6K Following Co-founder at @HuggingFace - open-source and open-science
Dmytro Mishkin 🇺�... @ducha_aiki
24K Followers 703 Following Marrying classical CV and Deep Learning. I do things, which work, rather than being novel, but not working.
Richard Socher @RichardSocher
112K Followers 1K Following CEO @youdotcom MP @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMind
Sara Hooker @sarahookr
49K Followers 9K Following I lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
Leo Boytsov @srchvrs
9K Followers 2K Following Machine learning scientist and engineer speaking πtorch & C++. Past @LTIatCMU, @awscloud. Opinions sampled from MY OWN 100T param LM.
Christian Szegedy @ChrSzegedy
41K Followers 3K Following #deeplearning, #ai research scientist. Opinions are mine.
Nathan Benaich @nathanbenaich
61K Followers 34K Following solo member of investment staff @airstreet @airstreetpress @stateofaireport @raais
Ida Momennejad @criticalneuro
15K Followers 2K Following Principal Researcher @MSFTResearch. I study memory & planning in brains. I build & evaluate AI.
Jascha Sohl-Dickstein @jaschasd
24K Followers 706 Following Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.
云创兽Ai @Crocer54910
1 Followers 108 Following 💸 market heroine all in on vastly stock investing! thrilled to connect. DM me for stock screeners! 🎯 #NYSE #Markets
Maddie Marlowe @MaitresseM
91K Followers 2K Following Screenwriter/Filmmaker • interdisciplinary swer • decentralCINE • @Coders_Room • 🫀machine • Power Xchange 📌 current: off-grid writing 6-EP mixed-genre series
Hawking Zhang @wydszqd
10 Followers 548 Following
Cooper Larson @LarsonCoop48209
85 Followers 4K Following
Ivan Shkvarun @IvanShkvarun
400 Followers 2K Following CEO @_SocialLinks_ | OSINT, AI & Digital Risk Visionary | Building trust in the age of agents | Speaker | Founder | #AI #OSINT #DataTrust
Theathirs @Theathirs31t6a
62 Followers 3K Following
Xorca @Xorca9087
77 Followers 2K Following
Ardooferq @Ardooferq9389
72 Followers 2K Following
Vasanth Raghu @naironics
59 Followers 5K Following
Tanish Anand @TanishAnan66928
0 Followers 71 Following
ReginaGrant @rNG2Z54j3EV5pC1
74 Followers 2K Following
Qinglin Zhu @qinglin_zhu1
28 Followers 1K Following
Karl Weinmeister @kweinmeister
2K Followers 4K Following Cloud Engineering @ Google. AI/ML/Data, Blue Devil & Longhorn, wanna-be at home improvement. Opinions are my own.
TobeyWhitman @T7u1V25112xE6
31 Followers 877 Following
Kevin Sosa @dulqur
15 Followers 80 Following
Vamshi Thallapally @VamshiThallapa1
16 Followers 658 Following
Oliver Hennhöfer @OHennhoefer
390 Followers 5K Following statistics/ml, uncertainty quantification, anomaly detection, finance.
Ken Ngala @KenNgala2
25 Followers 225 Following Deep Learning Practitioner | Fastai Enthusiast | AI for Good | Kaggle Contributor | Always Learning
adhernem @adhernem12
284 Followers 4K Following
Connor T. Jerzak @JerzakConnor
504 Followers 753 Following @UTAustin "Nullius in verba" Discussion→https://t.co/81Fe6eR7Ys Jobs→https://t.co/hsHKsrtcsR
Arthur Schonbach @ArthurSchonbach
0 Followers 13 Following
Sababa @rubusursinus
23 Followers 357 Following
Morteza Zabihi @MortezaZabihi_
11 Followers 436 Following Associate Director of the MGB NeuroAI Center | Instructor at Harvard Medical School
Fajar | Data Analyst @muhfajarags
15 Followers 25 Following 💡 Practical Data Analyst 🔢 Agentic AI Enthusiast
Ashis Kumar Panda @ashiskumarpanda
190 Followers 924 Following 📌Data Scientist @EpsilonMktg . Simplifying tough data science concepts. Lifelong learner .
francois.victor @FVictor_bioinfo
17 Followers 818 Following
Kris @kmbroga
1 Followers 254 Following
Vladimir Frants @vavevol
1 Followers 63 Following
FaySmith @3uo2cbQzVENXP
68 Followers 7K Following
K6 @UpdateLiveware
1K Followers 970 Following ML PhD student | Neuromancer-in-training. Reformed Shrimp Uplifter. Shrike Cultist. Subspace Emissary.
Stef @stefano_kerope
131 Followers 304 Following Solo founder turning coffee into code & AI. Currently building an AI-powered trading companion. Follow my journey building in public.
🎱 BitcoinBananaBY @BitcoinBananaBY
710 Followers 2K Following GME x BBBY x CYDY to Uranus DD for ML, Retail, Biotech Tweets, Likes or Reweets are only personal opinions, not financial advice nor am I a financial advisor.
Atli Kosson @AtliKosson
241 Followers 480 Following PhD student at @EPFL🇨🇭 working on improved understanding of deep neural networks and their optimization. Previously did NN training @Tesla_AI @CerebrasSystems
Joel Martinez @joelmartinez
2K Followers 2K Following Principal Software Engineering Manager at @microsoft (via @xamarinhq), working on @msft4startups. Founded @onetug. #eldermillenial 🇺🇸 🇩🇴
Dmt Elf ∞/21M @DmtElf117
492 Followers 2K Following Truth and freedom for all sentient beings. #Bitcoin is the only peaceful weapon/tool of hope for defending ourselves from tyranny. [email protected]
Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
AK @_akhaliq
425K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5YmrQ
Sebastian Raschka @rasbt
354K Followers 1K Following ML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).
Jim Fan @DrJimFan
325K Followers 3K Following NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
Google DeepMind @GoogleDeepMind
1.2M Followers 279 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
Soumith Chintala @soumithchintala
250K Followers 1K Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.
Jürgen Schmidhuber @SchmidhuberAI
163K Followers 0 Following Invented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.
elvis @omarsar0
263K Followers 664 Following Building with AI agents @dair_ai • Prev: Meta AI, Galactica LLM, Elastic, PaperswithCode, PhD • I share insights on how to build with AI Agents ↓
Alfredo Canziani @alfcnz
117K Followers 296 Following Musician, math lover, cook, dancer, 🏳️🌈, and an ass prof of Computer Science at New York University
Lucas Beyer (bl16) @giffmana
108K Followers 519 Following Researcher (now: Meta. ex: OpenAI, DeepMind, Brain, RWTH Aachen), Gamer, Hacker, Belgian. Anon feedback: https://t.co/xe2XUqkKit ✗DMs → email
(((ل()(ل() 'yoav)))... @yoavgo
65K Followers 2K Following
Jeremy Howard @jeremyphoward
259K Followers 6K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Prev: professor @ UQ; Stanford fellow; @kaggle president; @fastmail/@enlitic/etc founder https://t.co/16UBFTX7mo
Sander Dieleman @sedielem
63K Followers 2K Following Research Scientist at Google DeepMind (WaveNet, Imagen, Veo). I tweet about deep learning (research + software), music, generative models (personal account).
Delip Rao e/σ @deliprao
61K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
Miles Brundage @Miles_Brundage
62K Followers 12K Following AI policy researcher, wife guy in training, fan of cute animals and sci-fi, Substack writer, stealth-ish non-profit co-founder
Yam Peleg @Yampeleg
38K Followers 2K Following The only AI researcher they sent a missile for 🇮🇱 | Co-host @thursdai_pod • AI news every Thursday
Zandria Eriksson @ZandriaEriksson
27 Followers 46 Following I talk about Stoic philosophy for modern women, building unshakeable confidence from within, and mind-body-lifestyle transformation.
Saurabh Kumar @drummatick
20K Followers 342 Following Building @kodomamo_JP Presently focusing on LLM Finetuning and Scaling
Femke Plantinga @femke_plantinga
9K Followers 600 Following learn with me about AI. growth @weaviate_io
Jorvon Moss_Odd_Jayy @Odd_Jayy
22K Followers 680 Following Thoughts and Opinions are my own https://t.co/ygdxgGpBBH https://t.co/cQ8BlV2AQ4 https://t.co/vIKWFMgee4 https://t.co/cleOHvdIVl
Francisco Fonseca @_Francis_co_Art
115K Followers 1K Following 29 years old Illustrator and Street Artist From Porto, Portugal Online Shop and Domestika Course 👇🏼
David Duvenaud @DavidDuvenaud
31K Followers 4K Following Machine learning prof @UofT. Former team lead at Anthropic. Working on generative models, inference, & latent structure.
ThePrimeagen @ThePrimeagen
297K Followers 1K Following skill issues: 🟩⬛️⬛️⬛️⬛️⬛️(69/420) https://t.co/qWJnB6p4EP https://t.co/IwY3FTx1ZE https://t.co/TYJ6aSpwYs
Ken Ngala @KenNgala2
25 Followers 225 Following Deep Learning Practitioner | Fastai Enthusiast | AI for Good | Kaggle Contributor | Always Learning
Ben Clavié @bclavie
6K Followers 1K Following regressing linearly on a daily basis. wife guy who does retrieval. research @mixedbreadai, prev answerdotai
Julia McCoy @JuliaEMcCoy
29K Followers 11K Following AGI + future of humanity. Founder, @FirstMoversAI, where we help business owners and marketers adapt to the AI age. YouTuber, https://t.co/XIBUWGRoV9
The Humanoid Hub @TheHumanoidHub
64K Followers 737 Following Humanoid Robots: Technology, Business, and Social Dynamics
Zhengyao Jiang @zhengyaojiang
4K Followers 417 Following Cofounder & CEO @WecoAI. Automating hill climbing with AI-Driven Exploration (AIDE). PhD in Machine Learning @UCL_DARK. (Zheng=j-uhng, j as in job; yao=y-aoww)
Daniel Han @danielhanchen
28K Followers 2K Following Building @UnslothAI. Finetune train LLMs faster. LLMs bug hunter. OSS package https://t.co/aRyAAgKOR7. YC S24. Prev ML at NVIDIA. Hyperlearn used by NASA.
Aryeh Kontorovich @aryehazan
10K Followers 604 Following probability, statistics, metric spaces, Markov chains, freedom (social & academic), Israel, Jew stuff. opinions represent my employer & all other groups I'm in
Oliver Hennhöfer @OHennhoefer
390 Followers 5K Following statistics/ml, uncertainty quantification, anomaly detection, finance.
Mariya I. Vasileva @mariyaivasileva
19K Followers 2K Following Research @Meta Superintelligence Labs •🦙 multimodal safety • ex @AWS • 🎓 @IllinoisCDS (PhD), @Caltech • @WiMLWorkshop, @CVFADworkshop, @ResistanceAI • 🇧🇬
Sergey Demyanov @sdemyanov
183 Followers 832 Following Founder & CEO of Beagle. Previously: ML manager @ Snap. 1x exit. PhD in Machine Learning.
Behnam Neyshabur @bneyshabur
29K Followers 857 Following Research @AnthropicAI (Co-lead Discovery team) 💼 Past: Gemini @GoogleDeepMind (Co-led Blueshift team) 🧠 LLM Reasoning / AI Scientist 🎒Traveling & Backpacking
Michael Tschannen @mtschannen
3K Followers 674 Following Research Scientist @GoogleDeepMind. Representation learning for multimodal understanding and generation. Personal account.
Min Choi @minchoi
316K Followers 1K Following AI Educator. 𝕏 about AI, solutions and interesting things. Showing how to leverage AI in practical ways for you and your business. Opinions are my own.
Aryan Pandey @AryanPa66861306
4K Followers 3K Following Machine Learning(RL+CV+Robotics+NLP) || DevOps || Open source
Saeed Salehi (ssnio.b... @ssn_io
396 Followers 309 Following @ml_tuberlin PhD student @TUBerlin 🖥️🔮alumnus of @bccn_berlin 🧠 and @BTU_CS ⚡️
Barlow Adams @BarlowAdams
22K Followers 9K Following Pie enthusiast. Historically preserved beard site. Waffle House Poet Laureate. Best Small Fictions, Best of the Net, Wigleaf Top 50. Rejected by Tiger Beat
Frank Manzano @loved_orleer
11K Followers 1K Following
Mark Tenenholtz @marktenenholtz
137K Followers 624 Following Head of AI @PredeloHQ. Building reliable agents. XGBoost peddler, transformer purveyor.
Jonathan Gorard @getjonwithit
40K Followers 17 Following Applied mathematician, computational physicist @Princeton Previously @Cambridge_Uni Making the universe computable.
K6 @UpdateLiveware
1K Followers 970 Following ML PhD student | Neuromancer-in-training. Reformed Shrimp Uplifter. Shrike Cultist. Subspace Emissary.
Defender @DefenderOfBasic
15K Followers 3K Following Memetics science writer / open memetics evangelist. Basically trying to do culture engineering transparently. Does that make sense?
Georgia Gkioxari @georgiagkioxari
11K Followers 453 Following Assistant professor in Computing + Mathematical Sciences @Caltech 🏛️ ∙ Computer vision enthusiast 🤖 ∙ Part-time at @metaai 👩🏻💻∙ From 🇬🇷
Timothy Nguyen @IAmTimNguyen
12K Followers 449 Following Machine learning researcher at @GoogleDeepMind & mathematician. Host of The Cartesian Cafe podcast. All opinions are my own.
Stef @stefano_kerope
131 Followers 304 Following Solo founder turning coffee into code & AI. Currently building an AI-powered trading companion. Follow my journey building in public.
JLarky @JLarky
10K Followers 943 Following Opinions are not my own. As soon as I say something I become new me who hates anything old me did. CEO of HTMX