Mechanical Dirk @mechanicaldirk
Principal Engineer at @allen_ai. Engineering Lead of the OLMo project. Seattle, WA Joined August 2018-
Tweets1K
-
Followers879
-
Following270
-
Likes365
Love you too Cody 😃
Love you too Cody 😃
The goal of AI coding tools doesn't need to be to write the code for me. That part is easy. AI needs to save me from having to look up documentation every two lines.
thx to all the feedback from OSS community! our olmOCR lead @jakepoznanski shipped a new model fixing lotta issues + some more optimization for better throughput have fun converting PDFs!
thx to all the feedback from OSS community! our olmOCR lead @jakepoznanski shipped a new model fixing lotta issues + some more optimization for better throughput have fun converting PDFs!
Are we just alternating conferences between Vancouver and Vienna now? Because honestly, I'm down.
Are we just alternating conferences between Vancouver and Vienna now? Because honestly, I'm down.
@Thom_Wolf 6m later "Nobel Prize is actually a poor measure of intelligence. In this paper we show that ..."
Product idea: Notion except every keystroke doesn't feel like I'm SSH'd into a server on Mars.
Our new ICML 2025 oral paper proposes a new unified theory of both Double Descent and Grokking, revealing that both of these deep learning phenomena can be understood as being caused by prime numbers in the network parameters 🤯🤯 🧵[1/8]
The bottleneck in AI isn't just compute - it's access to diverse, high-quality data, much of which is locked away due to privacy, legal, or competitive concerns. What if there was a way to train better models collaboratively, without actually sharing your data? Introducing…
The bottleneck in AI isn't just compute - it's access to diverse, high-quality data, much of which is locked away due to privacy, legal, or competitive concerns. What if there was a way to train better models collaboratively, without actually sharing your data? Introducing… https://t.co/mmDhWBp9AZ
🚨 Just announced: OLMo, Molmo & Tülu are now LIVE on the Cirrascale Inference Platform! It’s official, Cirrascale is the first to offer commercial inference endpoints for @ai2’s OLMo, Molmo & Tülu models on our Inference Platform. Our Inference Platform provides a fully open,…
My latest post: The American DeepSeek Project Build fully open models in the US in the next two years to enable a flourishing, global scientific AI ecosystem to balance China's surge in open-source and an alternative to building products ontop of leading closed models.
This project is a perfect model of an OLMo contribution. Well scoped, practical, sound theoretical underpinnings, and @lambdaviking submitted the paper 24h before the deadline 😍. Integrated into the code here: github.com/allenai/OLMo-c…
This project is a perfect model of an OLMo contribution. Well scoped, practical, sound theoretical underpinnings, and @lambdaviking submitted the paper 24h before the deadline 😍. Integrated into the code here: github.com/allenai/OLMo-c…
The #1 question we get is, when will we have an OLMo 1B? We finally do!
In Singapore @iclr_conf - feel free to come by our OLMoE Oral! Meta recently switched from Dense to MoEs for Llama 4 but hasn't released many details on this yet --- We'll explore MoEs vs Dense & other MoE insights!
🔭 Science relies on shared artifacts collected for the common good. 🛰 So we asked: what's missing in open language modeling? 🪐 DataDecide 🌌 charts the cosmos of pretraining—across scales and corpora—at a resolution beyond any public suite of models that has come before.
🔭 Science relies on shared artifacts collected for the common good. 🛰 So we asked: what's missing in open language modeling? 🪐 DataDecide 🌌 charts the cosmos of pretraining—across scales and corpora—at a resolution beyond any public suite of models that has come before.
Reinforcement learning has shown success in eliciting reflection from LLMs, but what if this capability actually manifests earlier in pre-training? We investigated this question and our results are surprising 👇 [1/4]
Today we're unveiling OLMoTrace, a tool that enables everyone to understand the outputs of LLMs by connecting to their training data. We do this on unprecedented scale and in real time: finding matching text between model outputs and 4 trillion training tokens within seconds. ✨
Today we're unveiling OLMoTrace, a tool that enables everyone to understand the outputs of LLMs by connecting to their training data. We do this on unprecedented scale and in real time: finding matching text between model outputs and 4 trillion training tokens within seconds. ✨
From ‘black box to glass box’: Ai2 (@allen_ai) links AI outputs to training data in breakthrough for transparency geekwire.com/2025/from-blac… via @geekwire
Biggest one yet! Scroll to the bottom of the blog post (allenai.org/blog/olmo2-32B) for a few fun training stories.
Biggest one yet! Scroll to the bottom of the blog post (allenai.org/blog/olmo2-32B) for a few fun training stories.
people are talking about whether scaling laws are broken or pretraining is saturating. so what does that even mean? consider the loss curves from our recent gemstones paper. as we add larger models, the convex hull doesn’t flatten out on this log-log plot. that's good!
Introducing olmOCR, our open-source tool to extract clean plain text from PDFs! Built for scale, olmOCR handles many document types with high throughput. Run it on your own GPU for free—at over 3000 token/s, equivalent to $190 per million pages, or 1/32 the cost of GPT-4o!

World Wide NDT Instit... @wwndtis
12 Followers 64 Following Take your career to the next level? With NDT + QA/QC & Fire and Safety Course And Get certified with Job Assistence. Join World wide NDT institute Now
degen_bobo 🤖💎 @agostino90
633 Followers 4K Following
Charlie Tang @tang_1c
199 Followers 607 Following AI Researcher, Quant, Founder, and Investor | Previously at D.E. Shaw, Apple Inc, PhD (deep learning) from Univ of Toronto.
Pratik Karki @ai_evals
13 Followers 196 Following Co-Founder at @anthromindinc | Ex-Google AI Engineer | Scalable Oversight For AI Systems
Mátyás Vincze @vinczematyas_
63 Followers 2K Following PhD Student focusing on Cooperative AI @UniTrento_DISI @FBK_research @MobS_FBK https://t.co/Zq2rXcF6nc
Brickroad @TryBrickroad
66K Followers 334 Following The front door to the world's data. A full-stack agentic data procurement and monetization network, built on Irys. Beta is out now.
Nachman Kaul-Seidman @nachmanks331
42 Followers 2K Following
Alice🎉 @soniamendezrome
1K Followers 957 Following Faith, family, freedom Patriot, America First! Identity verified🇺🇸 🇺🇸🇺🇸 🚫No DM🚫No porn🚫No cryptocurrency
Xuheng Li @xuhengli_
956 Followers 2K Following CS PhD candidate @UCLA, supervised by @QuanquanGu | RL, deep learning theory, diffusion model | Previously BSc @PKU1898 | Stargazer
Jon Saad-Falcon @JonSaadFalcon
1K Followers 641 Following AI PhD @hazyresearch @StanfordAILab | Previously @databricks @allen_ai @GeorgiaTech
Teknium (e/λ) @Teknium1
50K Followers 5K Following Cofounder and Head of Post Training @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE
Rishabh Adiga @RishabhAdiga01
53 Followers 283 Following MTS @datologyai | MSCS @UofIllinois | @iitmadras
Peter Chen @peterxichen
3K Followers 2K Following Covariant CEO and Co-Founder. Previously @OpenAI, @UCBerkeley PhD.
Ryan @youngre6310
0 Followers 3 Following
Hanna Hajishirzi @HannaHajishirzi
9K Followers 443 Following Sr. Director of AI at @allen_ai, Prof at @uw_cse, lead OLMo, Tulu
Freeman Lewin @Freeman_Lewin
743 Followers 1K Following Brick layer behind @TryBrickroad Building the future of data licensing.
Hazel Smith @HazelSmith62017
1 Followers 72 Following
Brendan Duke @brendanwduke
6 Followers 64 Following
Bram @BramVanroy
1K Followers 827 Following @ku_leuven @ccl_kuleuven: Creative #NLG 🖋️ @ivdnt: Dutch #NLProc and #LLMs Creator of Dutch LLMs 🤖 Fellow at @huggingface 🤗 Prev. @lt3ugent, @SignON
Corey Lynch @coreylynch
14K Followers 1K Following Director of AI at @figure_robot, building Helix 🧬
bob mannix @bobmannix
27 Followers 70 Following
Ashwinee Panda @PandaAshwinee
3K Followers 723 Following Postdoc of @tomgoldsteincs, PhD @princeton, @Cal alum, currently working on LLMs
Siddharth Betala @SiddharthBetal
229 Followers 3K Following ML @entalpic_ai || 24 || IIT Madras || Prev interns @ UofT, UW || Interested in NLP, Explainability and AI4Science
Otealrar @Otealrar104116
21 Followers 1K Following
Catherine Arnett @linguist_cat
797 Followers 572 Following NLP Researcher @AiEleuther. PhD @UCSanDiego Linguistics. Previously @pleiasfr @EdinburghUni. Interested in multilingual NLP, tokenizers, open science. She/her.
Patrick Da Silva @patrickqdasilva
43 Followers 28 Following Incoming PhD Student at The Ohio State University
Abraham Owodunni @AbrahamOwos
847 Followers 1K Following Exploring multilingual NLP & Speech. PhDing @osunlp. Organizer and member @masakhanenlp, @mrl2024_emnlp, ex: Researcher @Intronhealth
Tom Sheffer @TomSheffer17807
112 Followers 459 Following M.D candidate | Computer Science Master's candidate | @Google Research Software Engineer intern in Neuroscience.
Cody Blakeney @code_star
5K Followers 1K Following Data Dawg @datologyai | Formerly Data Research Lead @DbrxMosaicAI | Visiting Researcher @ Facebook | Ph.D | #TXSTFOOTBALL fan | https://t.co/4G6Jf3at5w
Kevin Farhat @notkevinfarhat
216 Followers 298 Following Research @allen_ai prev @uwcse @truemediadotorg
Juan Gutiérrez @jgnav_
3 Followers 48 Following
Brian Sadler @sadleb
160 Followers 665 Following Coder, Traveler, Outdoorsman. Raised in NH, loving NYC.Cosmin Negruseri @cosminnegruseri
3K Followers 3K Following founder of rag startup, ex Pinterest Search / Homefeed, https://t.co/0VwMvjB9Xh, Altiscale, Google Ads, Search, Google Code Jam organizer
Özgür Güler @ozgurgulerx
532 Followers 2K Following Algorithmic Resistance — Human Dignity by Design #Ataturk #komorebi
betterest @betterestli
23 Followers 553 Following MS student (2023-2026) 📖 ; Feel free to contact ✉️; sampling_params = {'temperature': 2.0, 'top_p': 1.0} 🤯; I'm a fool who needs a reasoning model🫠
Inna Lin @iwylin
915 Followers 1K Following PhD Student @uwcse @uwnlp | Visiting Researcher @AIatMeta
Varun @varunsaagar_ai
84 Followers 2K Following
Cody Blakeney @code_star
5K Followers 1K Following Data Dawg @datologyai | Formerly Data Research Lead @DbrxMosaicAI | Visiting Researcher @ Facebook | Ph.D | #TXSTFOOTBALL fan | https://t.co/4G6Jf3at5w
Dirk Groeneveld @marvinalone
37 Followers 134 Following
Amanda Bertsch @abertsch72
2K Followers 856 Following PhD student @LTIatCMU / @SCSatCMU, researching long context + decoding | she/her | also @ abertsch on bsky or https://t.co/L4HBUh0R9f or by email (https://t.co/bsHqwIMFPL)
Tim Rocktäschel @_rockt
39K Followers 2K Following Director and Open-Endedness Team Lead @GoogleDeepMind, Professor of AI @AI_UCL, PI @UCL_DARK, Fellow @ELLISforEurope.
Jason Weston @jaseweston
13K Followers 705 Following @MetaAI+NYU. NLP from scratch(Pretrain+FT LLM) 2008, MemNets (pre-Transformer) 2015, DrQA(pre-RAG) 2017, BlenderBot(dialog pre-ChatGPT) 2018+,Self-Reward+ more!
Ashwinee Panda @PandaAshwinee
3K Followers 723 Following Postdoc of @tomgoldsteincs, PhD @princeton, @Cal alum, currently working on LLMs
Alon Albalak @AlbalakAlon
2K Followers 597 Following Open-endedness, Data-centric AI @LilaSciences Previously: RS @synth_labs, PhD @ucsbNLP, Internships @AIatMeta @MSFTResearch All puns are my own
Muru Zhang @zhang_muru
565 Followers 306 Following First-year PhD @nlp_usc | Student Researcher @GoogleDeepmind | bsms @uwcse | Prevs. @togethercompute @AWS
Sam Bowman @sleepinyourhat
50K Followers 3K Following AI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
Percy Liang @percyliang
84K Followers 417 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist
Pang Wei Koh @PangWeiKoh
4K Followers 920 Following Assistant professor at @uwcse and visiting research scientist at @allen_ai. Formerly @StanfordAILab @GoogleAI @Coursera. 🇸🇬
Cerebras @CerebrasSystems
35K Followers 255 Following The world's fastest AI inference and training. Try the latest open models at: https://t.co/jREGhLI2nj
hailey @hailsalt
25 Followers 27 Following
Michael M. Pieler @MichaelMPieler
375 Followers 2K Following
Dylan Patel @dylan522p
94K Followers 941 Following SemiAnalysis Boutique AI & Semiconductor Research and Consulting DMs are open for consulting, quotes, or to talk shop
Allen School @uwcse
11K Followers 3K Following The Paul G. Allen School of Computer Science & Engineering educates tomorrow's innovators while developing solutions to humanity's greatest challenges.
Antoine Bosselut @ABosselut
4K Followers 612 Following Helping machines make sense of the world. Asst Prof @ICepfl; Before: @stanfordnlp @allen_ai @uwnlp @MSFTResearch #NLProc #AI
Sewon Min @sewon__min
13K Followers 813 Following Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp
Vaishaal Shankar @Vaishaal
2K Followers 362 Following Member of Technical Staff @ Anthropic Trying to find artificial intelligence. Opinions are my own.
Matt Jordan @rev_bucket
82 Followers 94 Following Playing with robustness in ML, @UTCompSci Serial walker/runner. My feet have more miles than my car
Mike Lewis @ml_perception
8K Followers 242 Following Llama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.
Pete Walsh @epwalsh
116 Followers 157 Following Research Engineer @allen_ai | Python | Rust | Neovim
Hannah Teufel @hannah_teufel
610 Followers 230 Following Research Engineer @GoogleDeepMind | prev @StabilityAI ,@Aleph__Alpha , @UCL_DARK | edu @UCL
Aaron Defazio @aaron_defazio
8K Followers 584 Following Research Scientist at Meta Superintelligence Labs working on optimization algorithms. Fundamental AI Research (FAIR) team
Jon Tow @jonbtow
195 Followers 244 Following
emozilla @theemozilla
7K Followers 1K Following catholic, ai researcher, co-founder/ceo of @NousResearch alignment: whatever the opposite of yudkowsky + bryan johnson is. blessed be God in all his designs.
Teknium (e/λ) @Teknium1
50K Followers 5K Following Cofounder and Head of Post Training @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE
Matthew Finlayson @mattf1n
1K Followers 905 Following PhD at @nlp_usc | Former predoc at @allen_ai on @ai2_aristo | Harvard 2021 CS & Linguistics
Adaptive ML @AdaptiveML
312 Followers 30 Following Evaluate, tune, and serve the best LLMs for your business. If you can measure it, reinforcement learning can optimize it.
François Fleuret @francoisfleuret
45K Followers 487 Following Research Scientist @meta (FAIR), Prof. @Unige_en, co-founder @nc_shape. I like reality.
Delip Rao e/σ @deliprao
61K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
AI at Meta @AIatMeta
712K Followers 288 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
Databricks Mosaic Res... @DbrxMosaicAI
41K Followers 120 Following We remove the barriers to state-of-the-art generative AI model development and make data + AI available to all.
Adept @AdeptAILabs
31K Followers 19 Following Adept has built the most robust and reliable agent tech stack on the market.