SoftMax @DataGod_v1
In God We trust all others must bring data and memes San Francisco, CA Joined June 2022-
Tweets561
-
Followers427
-
Following887
-
Likes614
(1/N) How close are we to enabling robots to solve the long-horizon, complex tasks that matter in everyday life? 🚨 We are thrilled to invite you to join the 1st BEHAVIOR Challenge @NeurIPS 2025, submission deadline: 11/15. 🏆 Prizes: 🥇 $1,000 🥈 $500 🥉 $300
New in-depth blog post - "Inside vLLM: Anatomy of a High-Throughput LLM Inference System". Probably the most in depth explanation of how LLM inference engines and vLLM in particular work! Took me a while to get this level of understanding of the codebase and then to write up…
GPT-OSS bug fixes + Flex Attention support is here! 1. Fixed float16 infinite losses (>65504 overflows) 2. SWA=128 Flex default uses 129 tokens (extra 1) 3. Fixed MXFP4 inference swiglu_limit=7.0 not set 4. Sink token moved to index 0 5. FA3 doesn't have attn sink dX Details:…
GPT-OSS bug fixes + Flex Attention support is here! 1. Fixed float16 infinite losses (>65504 overflows) 2. SWA=128 Flex default uses 129 tokens (extra 1) 3. Fixed MXFP4 inference swiglu_limit=7.0 not set 4. Sink token moved to index 0 5. FA3 doesn't have attn sink dX Details:… https://t.co/rUbvvjGW7W
A brilliant essay that really captures the world view that all the players in the game are operating under. 'AGI by 2027 is strikingly plausible' - all it needs is for the trend lines to hold just a little bit longer. Buckle in for wild few years as we find out.
A brilliant essay that really captures the world view that all the players in the game are operating under. 'AGI by 2027 is strikingly plausible' - all it needs is for the trend lines to hold just a little bit longer. Buckle in for wild few years as we find out.
Router wasn't learning at first, we debugged it step-by-step and showed you how despite perfect load balancing, routing can be completely useless. We root caused it and fixed the problem. Papers skip the methodology, but you can find all details in our part 3 of MoE 101 series…
Router wasn't learning at first, we debugged it step-by-step and showed you how despite perfect load balancing, routing can be completely useless. We root caused it and fixed the problem. Papers skip the methodology, but you can find all details in our part 3 of MoE 101 series…
Just wrote a post on my understanding of the statistics behind block sparse attention. My take is that it works by using the "learned similarity gap," which creates a simple SNR formula connecting retrieval quality with model architecture. Read more: guangxuanx.com/blog/block-spa…
i remade tiny-tpu to support both inference and training! we successfully tested our architecture on the classic XOR problem. here's what i learned throughout the process:👇
i remade tiny-tpu to support both inference and training! we successfully tested our architecture on the classic XOR problem. here's what i learned throughout the process:👇 https://t.co/upJoqqH4I0
The Illustrated GPT-OSS New post! A visual tour of the architecture, message formatting, and reasoning of the latest GPT. Link in 🧵
Just published my new article, "Marketplace": my first attempt at efficient GPU training without backprop 😄🎉 I've been considering eliminating backprop for a while. I had an idea, experimented for two weeks, and it worked! Here's how it works:
@MaximeRivest You might find this interesting: arxiv.org/abs/2505.18350
Fastest inference engine for LLMs! LMCache is an LLM serving engine that reduce Time to First Token (TTFT) and increase throughput, especially under long-context scenarios. 100% Open Source
puzzles repo: github.com/srush/Triton-P… answers repo (i recommend understanding from the first few answers and then not looking at latter ones): github.com/SiriusNEO/Trit…
this repo is a good place to start learning about triton kernels, it will take you a day or two to complete these puzzles once you get the hang of it.
Google has released a new open source model... That runs on just 0.5 GB of RAM. Yes. You can fine-tune it for free to make it better than the giant models at your tasks. Quick steps to fine-tune Gemma 3 270M below
I think I found a based Substack on low-level GPU programming by accident. He has some extensive articles on CUDA programming, building LLM inference engines, looking inside GPUs and much more. even the name is cool: "From Scratch". bro.
The perfect weekend article just dropped by @rasbt on gpt-oss architectures. It’s wild to see how fast these architectures have leveled up since GPT-2 in 2019. abs positional emb → RoPE GELU MLP → Swish + SwiGLU one dense feed-forward → MoE Multi-head attention → GQA …
this is the most comprehensive resource i found so far which gathers all the key things to know about Context Engineering. Absolutely beautiful work.
Latest TRL release brings major upgrades for multimodal alignment! We dive into 3 new techniques to improve VLM post-training in our NEW BLOG: 🌋 GRPO 🎞️ GSPO 🐙 MPO ➕ Extra: vLLM integration for online training w/ transformers backend link in the next one

EileenFred @3IWifb0DW7nU7
3 Followers 304 Following
SPAC_Tracker🇺🇸 @Florcoo883832
35 Followers 2K Following 15-30% Monthly | 2 High-Conviction Stocks.Short-Term Gains: 15-20% in Days/Weeks.DM "JOIN" for WhatsApp Alerts. Live Trade Signals • Market Analysis
Rosie @Wikor882851
29 Followers 1K Following
Yasmin @869i3I8gHN4uxQo
10 Followers 894 Following
Myrtle Haley @MHaley32015
66 Followers 3K Following
Big Wisky (LTW) @JonnyCasino999
777 Followers 7K Following Geronimo was the leader of the last Native American fighting force to captivate the United States. He died a POW, but his soul was never captured. 🇺🇸🏴☠️
neo @stankneo
855 Followers 4K Following Cyberpunk Metamodernism. Aspiring hyperwrangler. Searching for lcm(∞-axia). CS ∪ CogSci ∪ Complex Systems.
DJ Goosen @dj_goosen
322 Followers 226 Following Building & advising companies to win with AI/ML + automation @LevelUpTechHQ. I 💙 LA.
Adaline Olson @AOlson69714
32 Followers 2K Following
Ycoujaf @Ycoujaf137
16 Followers 969 Following
Darshith V @DarshithV25205
75 Followers 191 Following Software Intern - Wabtec Corporation, https://t.co/TZjH2qWgI4 in Software Engineering - RV College, Bangalore
Lelook @Lelook2360
41 Followers 2K Following
Priorjaw @Priorjaw262942
32 Followers 1K Following
bun.bun.🐽 @ds_bun_
19K Followers 6K Following love #pugs, lead data scientist @datafying my tweet = data science, machine learning, ai, deep learning and pug as well.
Anibal Pfannerstill @AnibalPfan83392
79 Followers 3K Following
Srespear @Srespear746
43 Followers 2K Following
Allison Leannon @ALeannon53210
53 Followers 4K Following
Resmoon @ResmoonJnJawjV
28 Followers 594 Following
Thote @ThoteTIkuhj
32 Followers 863 Following
Thuesisto @Thuesisto3Z6
37 Followers 977 Following
👑 Goddess Hunny �... @kreamiebunny89
16 Followers 125 Following I was created to be your everything 🥰 🐷 tribute: $50
Doydee @Doydee4etsEC
22 Followers 745 Following
Thetha @Thetha954748
53 Followers 2K Following
CelestialWitch324 @Srajir2271
5 Followers 174 Following
AgnesHuxley @fH1gQRN4SZS62Kf
81 Followers 2K Following
Loatou @Loatouocx7
15 Followers 282 Following
Stosecr @Stosecrfm_S
34 Followers 4K Following
Thares @TharesHJC
54 Followers 4K Following
Smeighth @SmeighthnSBJzN
97 Followers 4K Following
Tratairsm @TratairsmR7IO9
45 Followers 4K Following
Marynel11 @Marryera61
68 Followers 809 Following
AIformedicine @ai4medicine4
548 Followers 7K Following
Smeatob @smeatob58589
72 Followers 7K Following
Will M @ipadicWillma
1 Followers 167 Following
Shirley @Dnoaner8xGmtV
28 Followers 3K Following
Trairt @TrairtbkIxju
29 Followers 2K Following
Gamethoughs @gamethough9070
70 Followers 7K Following A strong woman is one who is determined to do what others are determined not to do.
Shirley @VoretheuhIOf
38 Followers 3K Following
Tytatteigh @TytatteighdBP
20 Followers 2K Following
Crear @CrearZsinFi
41 Followers 4K Following
Shirley @SoasmarexbMrcj
29 Followers 3K Following
ちゃーめいこ @chameiko196307
47 Followers 4K Following 15. The sun washes your face, the morning breeze brushes your teeth, smile, and cheer yourself up.🍓🍓
FredaScripps @tQ0GnM2nZaPxR
55 Followers 7K Following
Bojan Tunguz @tunguz
252K Followers 8K Following ML ex Nvidia. Creator of @trainxgb. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. Memelord. e/xgb. AMDG.
Kirk Borne @KirkDBorne
469K Followers 6K Following Advisor to startups. Freelancer. Founder of @LeadershipData. Global Speaker. Top influencer #BigData #DataScience #AI #IoT #ML #B2B. PhD Astrophysics @Caltech
Dan | Machine Learnin... @DanKornas
83K Followers 501 Following End-to-End ML Engineer. Building the best AI learning resource at https://t.co/lC2UKMtRjj. Youtube: https://t.co/pjpX8NvUn5
Charly Wargnier @DataChaz
136K Followers 45K Following Ex @Streamlit @Snowflake Maestro 🪄 • X about AI agents, LLMs, web apps, Python & SEO • My ❤️ is open source • DM for collabs 📩
abhishek @abhi1thakur
93K Followers 962 Following AI and ML, ex-Hugging Face, World's First 4x GM @kaggle, YouTube 100k+: https://t.co/BHnem8fTu5
Wall Street Apes @WallStreetApes
1.1M Followers 31K Following We Are The Resistance. Unfiltered Breaking News | Followed By @elonmusk 𝕏 @joerogan 🎙️ @DonaldjTrumpJr 🇺🇸 @dbongino ⚖️ @RealAlexJones 🪬 @JamesOKeefeIII 🗞️
Rona likes compilers @ronawang
31K Followers 615 Following compiler engineer (please hire me) // @mit math & cs
Stéphane Liem Nguyen @stephliemnguyen
31 Followers 100 Following PhD student in Machine Learning at @UNIGE_en
Daniel Han @danielhanchen
28K Followers 2K Following Building @UnslothAI. Finetune train LLMs faster. LLMs bug hunter. OSS package https://t.co/aRyAAgKOR7. YC S24. Prev ML at NVIDIA. Hyperlearn used by NASA.
Ruiqi Gao @RuiqiGao
9K Followers 779 Following Research scientist @GoogleDeepmind | Generative models. Veo3, Veo2, CAT3D, Imagen Video, etc. | Mom of Mochi.
Lilian Weng @lilianweng
163K Followers 166 Following Co-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log
Taco Cohen @TacoCohen
27K Followers 3K Following Post-trainologer at FAIR. Into codegen, RL, equivariance, generative models. Spent time at Qualcomm, Scyfer (acquired), UvA, Deepmind, OpenAI.
Lisan al Gaib @scaling01
21K Followers 655 Following lead them to paradise | intelligence is inherently about scaling | be kind to us AGI
Mr John C @Mister_John_C
28 Followers 560 Following
Merty @mertologico11
692 Followers 7K Following growth specialist @appgco | MVP in 10 Days • Fix My Design
Krishna Kumar @krishnakumar_nn
7 Followers 129 Following interested in AI, Education, Economics, Politics, Evolution, ...
Ashutosh Kumar @ashu_1069
678 Followers 893 Following safely auto-driving while stalking physics like Kohli on 99
DJ Goosen @dj_goosen
322 Followers 226 Following Building & advising companies to win with AI/ML + automation @LevelUpTechHQ. I 💙 LA.
Alfredo Canziani @alfcnz
117K Followers 296 Following Musician, math lover, cook, dancer, 🏳️🌈, and an ass prof of Computer Science at New York University
Jim Jimson 🦙 @jimjimson_
1K Followers 4K Following Teacher of Geoffrey Hinton. Founder of AI. Retired geneticist. Software developer. Mechatronics engineer. Man of action.
neo @stankneo
855 Followers 4K Following Cyberpunk Metamodernism. Aspiring hyperwrangler. Searching for lcm(∞-axia). CS ∪ CogSci ∪ Complex Systems.
François Fleuret @francoisfleuret
45K Followers 487 Following Research Scientist @meta (FAIR), Prof. @Unige_en, co-founder @nc_shape. I like reality.
Evan @StockMKTNewz
635K Followers 398 Following Free Stock Market News that is FAST, ACCURATE, CONSISTENT, and RELIABLE | Not Just Stock News | My Daily Stock Market Recap is the link in my bio ⬇️
Mario Souto @mariohsouto
247 Followers 356 Following Building AI for energy @ stealth startup / ex-AWS Energy
Jimmy Apples 🍎/acc @apples_jimmy
59K Followers 2K Following Wagmi. 2025. As featured in Bloomberg. As quoted by Nobel Prize winner Demis Hassabis. As mentioned on the Lex Fridman Podcast💺
jason liu @jxnlco
43K Followers 2K Following independent ai consultant, a16z scout, creator of instructor prev. @stitchfix @meta
Bindu Reddy @bindureddy
163K Followers 326 Following CEO of @abacusai, the world’s first AI super assistant and general-purpose agent, DeepAgent, for enterprises and professionals. ex-GM, AWS and Google
DIRTY DAN @captdirtydan
405 Followers 1K Following Full-time shitposter, part-time shipbuilding enjoyer.
Manzoor Strange @_realmanzoor
176 Followers 621 Following Full-stack dev dropping fire with JS, C++, Python, React & Next.js. Into AI, gaming & apps for good. Wanna build something sick? DM me!
Katsu @Katsuu_9
135 Followers 1K Following
dunce fundz (L4 RTRD ... @DunceFundz
524 Followers 1K Following nevergoon neversell... $L4 thisshithits! 💥 Bloodhounds finance & investing 🩸 GET 5 sol (.25 minimum) for FREE at the link below
nikhil tayal @Alloutnikhil
3K Followers 5K Following I love to build products and services that people want to use. Trying my best to be max useful to the humanity.
John Allan @JohnFAllan
593 Followers 2K Following e/acc + exec search partner + ai startup co-founder
robert mine @imrobertmine
1K Followers 859 Following
Alex Kehr @alexkehr
29K Followers 5K Following ceo, @superlocalmaps (acq by foursquare) • i like maps, design, and making apps (@machineofideas)
Inverse Gary Marcus @InverseMarcus
602 Followers 3K Following Professional Goal-Post mover. Parody account.
flowstate @k_flowstate
4K Followers 686 Following still looking for a C healer ❤️🩹 | ALX grad 🎓 | AI Aficionado 🤖
Citizen Lane @laneshetron
338 Followers 837 Following swe @aws / ☀️ founder / math & econ @columbia ♔ | prev: @wbd, @StockX
Chris Chambless will ... @Lumenbeing
193 Followers 274 Following coincidence theorist debunker, fact checker checker
Tim Ryan @tarproductions
1K Followers 438 Following diet & exercise. entrepreneur & building an app.
GosuCoder @GosuCoder
3K Followers 67 Following Programmer that Loves to Share Things on YouTube especially about AI and new technology! Building https://t.co/mEanyLBqsA
goutham kamath @goutham_kamath
186 Followers 140 Following co-Founder: @voxela_ai Past: @foghorn programmer, PhD AI; https://t.co/fjHFnNPTGX