sergio sainz @sergiosp13
engineer, deep learning enthusiast , climber , and slow runner Joined September 2010-
Tweets249
-
Followers38
-
Following329
-
Likes4K
1/ Google Research unveils new paper: "Titans: Learning to Memorize at Test Time" It introduces human-like memory structures to overcome the limits of Transformers, with one "SURPRISING" feature. Here's why this is huge for AI. 🧵👇
This page of common pytorch mistakes is pretty invaluable uvadlc-notebooks.readthedocs.io/en/latest/tuto…
Everything you know about the world is a belief about the statistics of your sensory input and how they depend on your output. There is nothing more to it, and understanding knowledge in this sense is one key to creating AI.
🔥llm.c update: Our single file of 2,000 ~clean lines of C/CUDA code now trains GPT-2 (124M) on GPU at speeds ~matching PyTorch (fp32, no flash attention) github.com/karpathy/llm.c… On my A100 I'm seeing 78ms/iter for llm.c and 80ms/iter for PyTorch. Keeping in mind this is fp32,…
Today we’re releasing Code Llama 70B: a new, more performant version of our LLM for code generation — available under the same license as previous Code Llama models. Download the models ➡️ bit.ly/3Oil6bQ • CodeLlama-70B • CodeLlama-70B-Python • CodeLlama-70B-Instruct
Do you want to adapt an LLM on your own data and domain? 🤔 Learn how to finetune a 7B parameter model on a typical consumer GPU (NVIDIA T4 16GB) with LoRA and tools from the PyTorch and Hugging Face ecosystem in our latest post. Details 👉 hubs.la/Q02g0x1N0
Several people have told me over drinks that these puzzles are being used for ML tech interviews 🤣. github.com/srush/Tensor-P… github.com/srush/GPU-Puzz…
One of the best tutorial-style repos since @karpathy's minGPT! GPT-Fast: a minimalistic, PyTorch-only decoding implementation loaded with best practices: int8/int4 quantization, speculative decoding, Tensor parallelism, etc. Boosts the "clock speed" of LLM OS by 10x with no model…
The UAE Minister of AI @OmarSAlolama points to a historical precedent of premature technology regulation motivated by fear: the ban of the printing press in 1515 by Sultan Selim I led to the decline of the Ottoman Empire. “We overregulated a technology, which was the printing…
OpenAI Developer Day Announcement and Implications for Open-Source AI development OpenAI's developer day was filled with a bunch of annoncements. Most notable amongst them was a 3x price cut on GPT-4-turbo, Custom GPTs and Assistants APIs Sadly and predictably, there was no…
Here is an incredible GPT-4 prompt for engineers. Use it to speed up any code by identifying inefficiencies and rectifying them: --- <prompt_explanation> You are a world expert in making code run faster. You use any resource you can to do so. Given some code, first, explain…
Artificial lifeforms are super fascinating to watch. These self-organizing, self-replicating, “lifeforms” emerged from a continuous time cellular automata system called Flow-Lenia. Lenia is a family of CAs generalizing Conway’s Game of Life to continuous space, time and states.
Beijing Fall 2023
@elonmusk Sama playbook: 1) scrape the open internet 2) train a large model to compress it 3) sell a slow drip back to users via API 4) make it illegal for other players to do the same The internet collectively getting mugged and being forced to buy back its assets from the bandits
Introducing Neural Developmental Programs (NDPs)🧬🧠Instead of neural networks with fixed architectures, we allow neural networks to grow through a dynamic self-organizing process, inspired by how biological nervous systems develop👇 PDF: arxiv.org/abs/2307.08197
A key way you can make your LLM/RAG chatbot more “advanced” over complex data sources is to add a router - dynamically decide which data to query / which parameters to use using LLMs. It’s a key step towards LLM workflow automation. It sounds simple, but there’s some trickiness:…
Today we’re releasing Code Llama, a large language model built on top of Llama 2, fine-tuned for coding & state-of-the-art for publicly available coding tools. Keeping with our open approach, Code Llama is publicly-available now for both research & commercial use. More ⬇️
"How is LLaMa.cpp possible?" great post by @finbarrtimbers finbarr.ca/how-is-llama-c… llama.cpp surprised many people (myself included) with how quickly you can run large LLMs on small computers, e.g. 7B runs @ ~16 tok/s on a MacBook. Wait don't you need supercomputers to work…
Built a notebook that makes it dumb simple to fine-tune LLaMA 2. Just load in a dataset, and run it! colab.research.google.com/drive/1Zmaceu6…
The Japanese multiplication method makes everybody feel "I wish they taught math like this in school." It's not just a cute visual tool: it illuminates how and why long multiplication works. Here is the full story.

Conette Yomina @ConetteY86393
1 Followers 136 Following
Louie Peters @_LouiePeters
8K Followers 8K Following Posts on AI, tech & investing (reflexresearch). CEO @towards_ai Making AI Accessible since 2019: Courses, Discord, Blogs, Books, B2B LLM Training & Consultancy
Katherine @broadwater_kath
274 Followers 3K Following
John McCone @McconeJohn
5K Followers 2K Following Founder: "Philosophy For The Future", a philosophy, technology and economics blog offering perspective and preparing you for the future. Basic Income supporter.
Furong Huang @furongh
9K Followers 2K Following Associate professor of @umdcs @umiacs @ml_umd at UMD. Researcher in #AI/#ML, AI #Alignment, #RLHF, #Trustworthy ML, #EthicalAI, AI #Democratization, AI for ALL.
Arianna @AsmiiNurlii
58 Followers 1K Following hope to top* study , travel and learning *☺️ reading improve skill *^ *^ still living freedoms 🤩
Jorge Escamilla @JorgeEscamilla
119 Followers 142 Following Making videos is my pastime more than it is my job.
Marisela🇲🇽 🛡... @marion_7212
14K Followers 14K Following Soy alegre, optimista, sencilla, llena de amor, detesto las injusticias, la mentira y la hipocresía.#Amlovers #RedAMLOVE
Dora Alicia Santiago @DoraAliciaSant4
8 Followers 180 Following si lo puedes soñar, lo puedes lograr
yangzisheng @yangzisheng1
44 Followers 512 Following Sincere friends . A happy life, the best self .
dataScienceRetreat @dataScienceRet
5K Followers 5K Following Data Science Retreat turns talented tech professionals into effective data scientists with three months of intensive mentoring in Berlin.Portfolio-project-focus
Derek McCormack @Toronto2Tokyo28
279 Followers 3K Following Ready to settle down. she's definitely the one and I'm ready to ask her the big question.
Erika M @ErickaMG84
152 Followers 436 Following
cecy ramos @AceciliaCecy
12 Followers 67 Following
Valerie Dragone @ValerieDragone
3 Followers 78 Following
عافك الخاطر @NNweeeeee
1K Followers 2K Following
Ambabad @ambabad
645 Followers 2K Following Stay calm and game on. Blockchain, Biohacking, Nootropics, Productivity, BI, AI, Gamification, Cryptocurrencies, Video Games, Star Trek and Star Wars!!
Emm Fernández @Fernandievski
137 Followers 214 Following Algún día mi copete será tipo Steven y mi bigote, tipo manija. Mientras, a ratos, curo listas para @stellitaradio 🎧 y escribo en @bloomartwriters (Medium) 🖊️
Ed sainz @sainzed
25 Followers 173 Following
Jorge Argüelles @jarguellesv
199 Followers 372 Following
bacacho_911 @bacacho_911
176 Followers 714 Following
Laura Yu @JewelzJewelz
78 Followers 81 Following
Vic V @portenez
140 Followers 640 Following Code, food, music and lolz make me :). Opinions are my own and not my employer's.
Alejandro Ortiz Tapia @ortiztapia
361 Followers 945 Following Ingeniero de día, escritor ocasional en mi blog, creador de NFTs y también me gusta correr un chingo. 🏃🏻♂️ 💨
gabroe @gabroe
136 Followers 231 Following
Ignacio Villarruel @ivillarruel
241 Followers 873 Following
Ruben Pacheco @ChileISC
25 Followers 34 Following
Benjamín Hernández ... @tonklis
933 Followers 668 Following Me gustan los videojuegos, el box y las motos.
Ashlynn Ortega @Lizzbeth9bbd
25 Followers 133 Following
Naveen Rao @NaveenGRao
32K Followers 879 Following VP GenAI @Databricks. Former CEO/cofounder MosaicML & Nervana/IntelAI. Neuro + CS. I like to build stuff that will eventually learn how to build other stuff.
Noam Brown @polynoamial
92K Followers 856 Following Researching reasoning @OpenAI | Co-created Libratus/Pluribus superhuman poker AIs, CICERO Diplomacy AI, and OpenAI o3 / o1 / 🍓 reasoning models
Alexander Wei @alexwei_
24K Followers 194 Following Reasoning @OpenAI. Co-built CICERO @MetaAI | @Berkeley_AI PhD '23 | @Harvard '20
Horace He @cHHillee
42K Followers 536 Following @thinkymachines Formerly @PyTorch "My learning style is Horace twitter threads" - @typedfemale
❄️Andrew Zhao❄�... @_AndrewZhao
4K Followers 3K Following PhD @Tsinghua_Uni. Absolute Zero,ExpeL,Diver-CT Ex. intern@MSFTResearch,@ BIGAI. Interested in RL, Reasoning/Safety 4 LLMs, Agents. On industry job market 2026
Zhe Zeng @zhezeng0908
863 Followers 718 Following Assist. Prof. @CS_UVA | Faculty fellow @NYU_Courant | CS Ph.D @UCLA | Neurosymbolic AI, Probabilistic ML, Constraints, AI4Science | https://t.co/pZJZxyzrio
Polina Kirichenko @polkirichenko
4K Followers 1K Following Research Scientist at FAIR @AIatMeta & visiting researcher at Princeton @VisualAILab prev. PhD at New York University 🇺🇦
Meihua Dang @meihuadang
843 Followers 172 Following Ph.D. student @StanfordAILab | Previous M.S. student @UCLA StarAI Lab
Prof. Anima Anandkuma... @AnimaAnandkumar
34K Followers 2K Following Godmother of AI+Science, Bren Professor @caltech, Time100, Fmr Sr Director of #AI research @nvidia Fmr Principal Scientist @awscloud
Engineering at Meta @fb_engineering
282K Followers 198 Following Engineering at Meta is a technical news resource for engineers interested in how we solve large-scale technical challenges at Meta.
Kaichao You @KaichaoYou
4K Followers 134 Following phd student in tsinghua university, working on @vllm_project
Hugo Larochelle @hugo_larochelle
121K Followers 640 Following Mila Scientific Director. Ex @Google DeepMind & Twitter Cortex. Father of 4. // Directeur scientifique à Mila. Ex @Google DeepMind & Twitter Cortex. Père de 4.
Sebastian Ruder @ ACL @seb_ruder
93K Followers 1K Following Research Scientist @AIatMeta • Ex @Cohere @GoogleDeepMind
Chip Huyen @chipro
120K Followers 613 Following AI Engineering: https://t.co/94dv4uTU1H Designing ML Sys: https://t.co/G81hL2dWmr Entanglements: https://t.co/W27aXeiySY @aisysbooks
Peking University @PKU1898
679K Followers 209 Following Peking University was established in 1898. This account is dedicated to #PekingUniversity’s global outreach and communications.
Peiyi Wang @sybilhyz
11K Followers 301 Following PhD @PKU1898; Researcher @deepseek_ai; Recent: DeepSeek-R1/CoderV2/Math/V1/V2/V3, Mathshepherd, FairEval, Speculative Decoding.
llamafile @llamafile
792 Followers 17 Following llamafile is the easiest and fastest way to run open LLMs on your own computer. An open source project from Mozilla. https://t.co/wkakXoAxgD
Hadi Salman @hadisalmanX
7K Followers 361 Following Research Scientist @OpenAI Previously: PhD @MIT @MSFTResearch @UberATG @SCSatCMU @AUB_Lebanon
Zico Kolter @zicokolter
24K Followers 688 Following Professor and Head of Machine Learning Department at @CarnegieMellon. Board member @OpenAI and @Qualcomm. Chief Technical Advisor @GraySwanAI.
Shuchao Bi @shuchaobi
13K Followers 690 Following Research @Meta Superintelligence Labs, RL/post-training/agents; Previously Research @OpenAI on multimodal and RL; Opinions are my own.
Yunyao Li @yunyao_li
5K Followers 717 Following Bring GenAI and Knowledge Graph to enterprise systems. | Director of ML @Adobe Experience Platform | Previously @Apple @IBMResearch. Tweets are all mine.
Alessandro Palmarini @abpalmarini
2K Followers 214 Following AI @ndea. Previously @sfiscience. Ex-@vainglory pro for @teamsecret & @Fnatic. Three time European Champion.
Stephen McAleer @McaleerStephen
12K Followers 999 Following Researching scalable AI safety at Anthropic
Rose Yu @yuqirose
9K Followers 581 Following Machine Learning Prof @UCSanDiego, Scholar @amazon, Previously @google, @Northeastern, @Caltech, @USC, #Physics-Guided #AI, MIT TR-35 Innovator.
Zeyuan Allen-Zhu, Sc.... @ZeyuanAllenZhu
21K Followers 460 Following physics of language models @ Meta (FAIR, not GenAI, not TBD) 🎓:Tsinghua Physics — MIT CSAIL — Princeton/IAS 🏅:IOI x 2 — ACM-ICPC — USACO — Codejam — math MCM
Alex Tong @AlexanderTong7
3K Followers 498 Following PI at Aithyra making models for cells and proteins.
Krisztina Sinkovics @KristaSinkovics
3K Followers 198 Following AI Researcher @Cambridge_Uni | Generative models, controllable generation | 7 years of industry ML experience
Matteo Gallici @MatteoGallici
322 Followers 95 Following PhD Student UPC Barcelona - Reinforcement Learning
EXO Labs @exolabs
37K Followers 2 Following AI on any device. 12 Days of EXO: https://t.co/VMrJ6Vi4h3 We're hiring: https://t.co/BzEO8ZCvBV
Qdrant @qdrant_engine
12K Followers 110 Following High-performance Rust-based vector search engine. https://t.co/362gvLXHcw
NWS Baltimore-Washing... @NWS_BaltWash
78K Followers 283 Following Official X Account for National Weather Service Baltimore/Washington. For NWS Posting Policy, click here: https://t.co/TsuJRpKxOT
Imbue @imbue_ai
7K Followers 15 Following We're rekindling the dream of personal computing by making reliable software creation accessible to all. Join us: https://t.co/9UF0rR6YUr
Dragonfly @dragonflydbio
3K Followers 100 Following Dragonfly is a drop-in Redis replacement, delivering 25X better performance at 80% lower cost. Host it yourself for free or pay per GB for Dragonfly Cloud.
Rohan Paul @rohanpaul_ai
96K Followers 8K Following Compiling in real-time, the race towards AGI. The Largest Show on X for AI. 🗞️ Get my daily AI analysis newsletter to your email 👉 https://t.co/6LBxO8215l
Aran Komatsuzaki @arankomatsuzaki
145K Followers 302 Following Looking for a cofounder. Sharing AI research. Early work on AI (GPT-J, LAION, scaling, MoE). Ex ML PhD (GT) & Google.
N M Anoop Krishnan @anoopnm007
1K Followers 851 Following Decoding materials through data- and physics-driven modeling | Civil and AI/ML @iitdelhi | Classical musician
Hyung Won Chung @hwchung27
38K Followers 301 Following AI Research Scientist @Meta Superintelligence Labs. Past: @OpenAI / @Google Brain / PhD @MIT
Liliang Ren @liliang_ren
4K Followers 581 Following Senior Researcher at Microsoft GenAI | UIUC CS PhD graduate | Efficient LLM | NLP | Former Intern @MSFTResearch @Azure @AmazonScience
clem 🤗 @ClementDelangue
156K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI builders
Jack Morris @jxmnop
46K Followers 988 Following research @cornell // language models, information theory, science of AI