Andrew Drozdov @mrdrozdov
Research Scientist @ Databricks mrdrozdov.github.io NYC Joined August 2010-
Tweets13K
-
Followers3K
-
Following2K
-
Likes13K
This applies to science and scientific writing. No amount of experiments will make up for a bad story. Don't go do all the work and then try to tell a mediocre story with the results. Michal Irani once told me that she writes the introductions to her papers long before she has…
This applies to science and scientific writing. No amount of experiments will make up for a bad story. Don't go do all the work and then try to tell a mediocre story with the results. Michal Irani once told me that she writes the introductions to her papers long before she has…
Hard to overstate the under-investment into information retrieval. So much energy behind open LLMs. Where are the open Web search engines? Like for the actual Web. I spent a couple of weeks training ColBERTv2 model in 2021. It's still a major one today. Inconceivable in LLMs.
Hard to overstate the under-investment into information retrieval. So much energy behind open LLMs. Where are the open Web search engines? Like for the actual Web. I spent a couple of weeks training ColBERTv2 model in 2021. It's still a major one today. Inconceivable in LLMs. https://t.co/F7FXDjRaC6
@orionweller Was able to get Hits@1 = 0.967 on limit-small using gte-small by cheating :) Rewriting every document into multiple short documents. Before rewriting, Hits@1 = 0.149. Fwiw, think LIMIT could be a nice test bed for doc rewriting! But my approach was a bit extreme.
Orion is the 🐐 of negative results. No capability is safe! This is quite fascinating work that gives insights about the limitations of dense embeddings.
Orion is the 🐐 of negative results. No capability is safe! This is quite fascinating work that gives insights about the limitations of dense embeddings.
Instructions/reasoning are now everywhere in retrieval - we want embeddings to do it all! 🚀 But... is it even possible? 🤔 Turns out, it's not possible for single-vector models 😱 theoretically and empirically! To make it obvious we OSS a simple eval SoTA models flop on! 🧵
Interested in building and benchmarking deep research systems? Excited to introduce DeepScholar-Bench, a live benchmark for generative research synthesis, from our team at Stanford and Berkeley! 🏆Live Leaderboard guestrin-lab.github.io/deepscholar-le… 📚 Paper: arxiv.org/abs/2508.20033 🛠️…
@jobergum You’re missing something
Once you train on the test data, it no longer the test data.
tired: The Responsive Web wired: The Responsive Context A lot of pieces of text need versions that make sense for short, medium, and long contexts.
Retrieval Capabilities of Large Language Models Scale with Pretraining FLOPs Databricks demonstrates that retrieval performance on zero-shot BEIR tasks predictably scales with LLM size, training duration, and estimated FLOPs. 📝arxiv.org/abs/2508.17400
GEPA has landed in DSPy 3.0!! 🛠️🧰 I am SUPER EXCITED to publish a new video sharing my experience using GEPA to optimize a Listwise Reranker! 🚀 The main takeaway I hope to share is how to monitor your GEPA optimization run to know if you are on the right track, or need to…
Thrilled that this work was accepted to #EMNLP2025! It presents the first large scale analysis of users collaborating with LLMs in-the-wild! Also thrilled that this kind of NLP+HCI work received generous support (and feedback) from reviewers and landed in the top 15% of papers!
Thrilled that this work was accepted to #EMNLP2025! It presents the first large scale analysis of users collaborating with LLMs in-the-wild! Also thrilled that this kind of NLP+HCI work received generous support (and feedback) from reviewers and landed in the top 15% of papers! https://t.co/gHSGOTwExw
we already have self-driving cars. why not self-folding shirts??
we already have self-driving cars. why not self-folding shirts??
a shirt with a built-in MCP server that exposes the "fold" API
My team (agent bricks) is hiring! We are building the future of agentic workflows on enterprise data at scale—I’m talking AI to process billions of invoices, call transcripts, patient notes, 10Ks & M&A deals, nursing license exam questions, synthetic model training data, movie…
My team (agent bricks) is hiring! We are building the future of agentic workflows on enterprise data at scale—I’m talking AI to process billions of invoices, call transcripts, patient notes, 10Ks & M&A deals, nursing license exam questions, synthetic model training data, movie…
Feels like for router-based system eval should incorporate at least a few settings: 1. Oracle: Explore all model settings, and pick the best for each input. 2. Manual Router: For each query, pick the model you think should be used. 3. DIY Router 4. Default Router

(((ل()(ل() 'yoav)))... @yoavgo
65K Followers 2K Following
Jacob Andreas @jacobandreas
20K Followers 951 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL / @NLP_MIT (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw
Kyunghyun Cho @kchonyc
77K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
Jim Fan @DrJimFan
325K Followers 3K Following NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
Sam Bowman @sleepinyourhat
50K Followers 3K Following AI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
Tal Linzen @tallinzen
18K Followers 898 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAI, inventor of the word "bertology"
Najoung Kim 🫠 @najoungkim
3K Followers 521 Following At @BULinguistics, previously @GoogleDeepmind. Human & machine CogSci 🤖🔠🐱
Jason Wei @_jasonwei
98K Followers 634 Following ai researcher @meta superintelligence labs, past: openai, google 🧠
Akari Asai @AkariAsai
18K Followers 867 Following Incoming Assistant Professor @SCSatCMU & research scientist @allen_ai. akariasai @ 🦋
Shaily @shaily99
7K Followers 2K Following PhD @LTIatCMU. Prev: @allen_ai @GoogleAI @MSFTResearch. #NLProc. Often ranting about research.
Nathan Schneider @complingy
5K Followers 1K Following Computational Linguist and Professional Nerd at Georgetown University he/him pronouns, ALL the prepositions @[email protected] @complingy.bsky.social
Sewon Min @sewon__min
13K Followers 815 Following Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp
Ofir Press @OfirPress
15K Followers 6K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
Naomi Saphra @nsaphra
10K Followers 1K Following Waiting on a robot body. All opinions are universal and held by both employers and family. Now a dedicated grok hate account. Accepting ML/NLP PhD students.
Sebastian Gehrmann @sebgehr
6K Followers 2K Following Head of Responsible AI, CTO office, @Bloomberg. (he/him) Formerly LLMs @ Google Brain / Harvard. views my own
Greg Durrett @gregd_nlp
8K Followers 894 Following Associate professor at NYU (Courant CS + Center for Data Science) | advisor for @bespokelabsai | large language models and NLP | he/him
rishi @RishiBommasani
6K Followers 2K Following Societal/economic impacts of AI; AI policy & governance @StanfordHAI Stanford CS PhD w/ @percyliang @jurafsky Cornell CS undergrad w/ @clairecardie
Vishnu Dev Shankar Sh... @Panditsharmaji0
1 Followers 139 Following insta. @vishusharma0915 fb. Pandit pandit pandit ji
S G @SG_CIL
17 Followers 5K Following
Ryan Steubs @ryan_steubs
15 Followers 504 Following Director of AI Alignment @Kwaai | Game Theory, Applied RL, AI Safety | Author @scalingalignment | Building AI that aligns with human values
Zrasir @Zrasir974
43 Followers 861 Following
DIENG Cheikh Ibra @dcheikhibra
59 Followers 2K Following Data & AI @ ENSAE 🤖 | From Dakar to Paris to the world 🌍 | Founder mindset ⚡ | (finance • media • sport • NLP • Crypto ) | Legacy. Growth. Impact.
PollyHelina @d4FbXP918ZHVt
2 Followers 117 Following
The Cosmic Foundry @CosmicFoundry
53 Followers 738 Following A Small Indie Game Dev Team based in South Africa.
Xinyi Wang (Cindy) @cindyxinyiwang
1K Followers 318 Following Research Scientist at Google DeepMind previously: PhD candidate at Language Technologies Institute at CMU
xiaolong dong @Dxl87277Dong
9 Followers 421 Following
. @bidulestruc
297 Followers 6K Following
Tushar Gupta @_tushargupta_
211 Followers 1K Following Parallel AI @p0 prev: Video Ranking/ML Engineer @Google Search, @GoldmanSachs, @IITDelhi. Tweets about tech, economic policy, and India.
SJ @_Shubham_Jha
300 Followers 5K Following I'm an Engineer, so to save time let's just assume I'm always right
Amin Nematollahi @am_neema
2 Followers 81 Following
huduga @zaph0id
655 Followers 4K Following Finding the cadence of life. Hoarder of books, stories and experiences, entrepreneur.
Arsenio Bellingham @l2_norm
76 Followers 1K Following I did data entry for 45 years. Now I’m retired, my new hobby is sitting down.
Fred Bliss @fblissjr
580 Followers 2K Following ◎ fractional cto & ai (👉 DM for inquiries) 🔎 (prev: founder @ Aptitive, acquired 2021; applied ai @ vantage discovery) 📈 20+ years experience midmarket ent
Irene Chen @irenetrampoline
9K Followers 915 Following ML for healthcare and equity. Assistant Professor @UCBerkeley and @UCSF. Prev @Harvard, @MIT, @MSFTResearch
Kat @Kat_Build
34 Followers 812 Following Market Researcher @Amazon AGI SF Lab. Quant for people. Love iced coffee and my quiet Player Piano paranoia.
Zahra Abbasiantaeb @z_abbasiantaeb
143 Followers 345 Following PhD candidate at @uva_amsterdam, Conversational Search and Information Retrieval
Preetham Mysore @a0preetham
2 Followers 83 Following
ibm @i18nbigmouse
0 Followers 2K Following
Ravi @r_mulp
145 Followers 2K Following
Sudhir Gajre @SudhirGajre
106 Followers 276 Following Head of GenAI, working on GenAI enterprise adoption frameworks
Tomer Wolfson @TomerWolfson
226 Followers 264 Following Postdoctoral Fellow, University of Pennsylvania @Penn @upennnlp
Sunil @Sunil678353
6 Followers 437 Following
Saber Darabi @SADarabi
315 Followers 7K Following
light928 @light928
4 Followers 375 Following
Joe Mayo @JoeMayo
16K Followers 7K Following Author and Independent Consultant Recent books: - Programming the Microsoft Bot Framework/MSPress - C# Cookbook/O'Reilly Agents, AI, Generative AI, MCP, RAG
sourabh baligade @SB_1_4
0 Followers 65 Following
Victor Hugo @VictorHugo45995
0 Followers 7K Following
Weawker @Weawker39746
79 Followers 1K Following
Yoshinari Fujinuma @akkikiki
1K Followers 2K Following Member of Technical Staff@Cantina; CS PhD @CUBoulder; Ex-Senior Applied Scientist@AWS AI Labs; Ex-SWE@Amazon JP; 🇹🇭→🇯🇵→🇫🇷→🇯🇵→🇹🇭→🇯🇵→🇺🇸; JA/EN
Fangcong Yin @fangcong_y10593
272 Followers 665 Following CS PhD Student @UTAustin studying NLP. Prev: @CornellCIS
Diersau @Diersau21829
21 Followers 1K Following
V Sriram @VSriram23
125 Followers 4K Following
(((ل()(ل() 'yoav)))... @yoavgo
65K Followers 2K Following
Yann LeCun @ylecun
949K Followers 764 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
AK @_akhaliq
425K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5YmrQ
Jacob Andreas @jacobandreas
20K Followers 951 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL / @NLP_MIT (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw
Kyunghyun Cho @kchonyc
77K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
Jim Fan @DrJimFan
325K Followers 3K Following NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
Yoav Artzi @yoavartzi
17K Followers 183 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaC and @COLM_conf
Sam Bowman @sleepinyourhat
50K Followers 3K Following AI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
Percy Liang @percyliang
84K Followers 417 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist
Tal Linzen @tallinzen
18K Followers 898 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAI, inventor of the word "bertology"
Najoung Kim 🫠 @najoungkim
3K Followers 521 Following At @BULinguistics, previously @GoogleDeepmind. Human & machine CogSci 🤖🔠🐱
Graham Neubig @gneubig
40K Followers 708 Following Associate professor @LTIatCMU. Co-founder/chief scientist @allhands_ai. I mostly work on modeling language.
Jason Wei @_jasonwei
98K Followers 634 Following ai researcher @meta superintelligence labs, past: openai, google 🧠
Christopher Manning @chrmanning
151K Followers 228 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋
Akari Asai @AkariAsai
18K Followers 867 Following Incoming Assistant Professor @SCSatCMU & research scientist @allen_ai. akariasai @ 🦋
Shaily @shaily99
7K Followers 2K Following PhD @LTIatCMU. Prev: @allen_ai @GoogleAI @MSFTResearch. #NLProc. Often ranting about research.
Nathan Schneider @complingy
5K Followers 1K Following Computational Linguist and Professional Nerd at Georgetown University he/him pronouns, ALL the prepositions @[email protected] @complingy.bsky.social
Meng Jiang @Meng_CS
1K Followers 526 Following Frank M. Freimann Collegiate Professor at Notre Dame CSE | Data Mining | NLP | AI
Noah Ziems @NoahZiems
1K Followers 1K Following Visiting Researcher @MIT_CSAIL. PhD student @NotreDame advised by @Meng_CS. Creator of Arbor RL library for @DSPyOSS
Xinyi Wang (Cindy) @cindyxinyiwang
1K Followers 318 Following Research Scientist at Google DeepMind previously: PhD candidate at Language Technologies Institute at CMU
Neha Narula @neha
49K Followers 2K Following I work on scaling systems and platforms for the internet. Director of Digital Currency @medialab, BoD @blocks. PhD from @MIT_CSAIL, formerly @digg, @google.
karl the fog @KarlTheFog
344K Followers 243 Following All that is sunny does not glitter, not all those in the fog are lost.
Glenn Gabe @glenngabe
79K Followers 7K Following SEO Consultant at G-Squared Interactive focused on Google algorithm update recovery, technical SEO audits, and SEO training. Podcast: "SEO From The Front Lines"
Gavin Guo @Zhen4good
563 Followers 466 Following Embodiment @MSL Previously @Apple Siri @MITIBMLab @MIT_CSAIL @BerkeleyPhysics Opinions Are My Own
Ernest Ryu @ErnestRyu
6K Followers 346 Following Professor of Mathematics at UCLA. Interested in deep learning and optimization.
Stephanie Milani @steph_milani
4K Followers 322 Following Incoming Faculty Fellow @NYU_Courant, then Assistant Professor @JHUCompSci. Human-centered reinforcement learning & AI agents
Alex Graveley @alexgraveley
38K Followers 1K Following Co-creator of GitHub Copilot, Dropbox Paper, AI Tinkerers, Hackpad, MobileCoin, Minion AI, etc. Working on @PerplexityComet. Survivor 🎗️
Tomer Wolfson @TomerWolfson
226 Followers 264 Following Postdoctoral Fellow, University of Pennsylvania @Penn @upennnlp
Ari Morcos @arimorcos
7K Followers 2K Following CEO and Co-founder @datologyai working to make it easy for anyone to make the most of their data. Former: RS @AIatMeta (FAIR), RS @DeepMind, PhD @PiN_Harvard.
Maxime Rivest 🧙... @MaximeRivest
4K Followers 780 Following Easy LLM context for all! ✨pip install attachments Inspired by: ggplot2, DSPy, claudette, dplyr, OpenWebUI! Follow for: API design, AI, and Data 🐍CC📜🛠 maker
Jan Luca Scheerer @jlscheerer
9 Followers 35 Following
Fangcong Yin @fangcong_y10593
272 Followers 665 Following CS PhD Student @UTAustin studying NLP. Prev: @CornellCIS
Fuxiao Liu @FuxiaoL
741 Followers 639 Following Research Scientist @Nvidia | CS PhD @UMDCSI, working on LLM, Multimodal Stuff
Laurens van der Maate... @lvdmaaten
4K Followers 2K Following Member of Technical Staff at Anthropic. Ex-Meta. t-SNE. Llama 3. DenseNet. Web-scale weakly supervised vision. CrypTen.
μ @michalwols
840 Followers 2K Following
Ed H. Chi @edchi
13K Followers 4K Following Research VP @ GoogleDeepMind. ex-Lead for LaMDA/Bard. Now focused on personalized reasoning & Astra universal personalized assistants. ACM Fellow.
Lichang Chen @LichangChen2
772 Followers 660 Following Context Engineer & Agents | ex GenAI & Science Unit Intern @GoogleDeepmind | PhD’25 @umdcs and BS @ZJU_China
Lakshya A Agrawal @LakshyAAAgrawal
2K Followers 2K Following AI PhD @ UC Berkeley | GEPA Creator (https://t.co/EdPqvzj7k4) | Created https://t.co/YxPZsXZJeS | Past: AI4Code Research Fellow @MSFTResearch | Hobbyist Saxophonist
Dhravya Shah @DhravyaShah
38K Followers 3K Following 20. Chief builder, Solo Founder, CEO @SupermemoryAI. "extraordinary" @O1Visa. Lifelong learner and serial shipper. contributing to AGI with memory
supermemory @supermemoryai
9K Followers 17 Following Context engine for your LLMs, Personalized for your users. 15k+ total ⭐ on Github, 6 OSS projects. Join the community - https://t.co/ttj0wU4e8z
Prashanth Rao @tech_optimist
2K Followers 2K Following AI engineer working with graphs & LLMs @kuzudb. Blogging about AI @ https://t.co/gLektr01zQ and https://t.co/QT8Lnl6WPB
Shengjia Zhao @shengjia_zhao
52K Followers 230 Following Chief Scientist @ Meta MSL. Formerly MTS @ OpenAI, PhD @ Stanford. I train models. All opinions my own.
Ao Qu @ao_qu18465
63 Followers 151 Following PhD at @MIT @mitidss @medialab | Building self-evolving AI & multisensory agents
Suzanna Sia @suzyahyah
344 Followers 90 Following Embracing existential crisis as technological progress 1st Author @ NeurIPS,AAAI,EMNLPx2,NAACL,EACL. Ideas cheap, execution lacking human and compute cycles
Oleg Zendel @OlegZendel
348 Followers 1K Following Research Fellow @ADMScentre, previously PhD in Comp Sci. @RMITComputing
Yushi Hu @huyushi98
3K Followers 1K Following 🎓PhD student @uwnlp | Prev. @AIatMeta @allen_ai @GoogleAI @UChicago | Building multimodal intelligence
Mandeep Rathee @rathee_mandeep
62 Followers 154 Following PhD @l3s_luh Research Center, Hannover Germany
Iryna Gurevych @IGurevych
1K Followers 57 Following #NLProc professor @CS_TUDarmstadt @TUDarmstadt @mbzuai @INSAITinstitute | Co-Founder @hessian_AI | @ELLISforEurope | @ATHENECenter | @emergen_CITY | @Leopoldina
Ziyi Yang @yzy_ai
447 Followers 213 Following Research Scientist @DbrxMosaicAI | @Stanford Ph.D. | post-train lead for Phi-3/4
Fangzheng Tian @DanielTian97
47 Followers 102 Following A PhD student at University of Glasgow. Working on Information Retrieval and Natural Language Processing.
Luis Ceze @luisceze
4K Followers 2K Following computer architect. marveled by biology. professor @uwcse. ceo @OctoAICloud. venture partner @madronaventures.
Javier Sanz-Cruzado @JavierSanzCruza
307 Followers 298 Following Postdoctoral research associate at @ir_glasgow, at the University of Glasgow
Krisztian Balog @krisztianbalog
2K Followers 280 Following Professor of computer science @UniStavanger, leading @iai_group & Staff research scientist @GoogleDeepMind. Current focus: https://t.co/5JiH909fhx
Anja Reusch @anja_reu
45 Followers 60 Following Postdoc @ Technion, working on Interpretability in Information Retrieval 🔎 and NLProc 💬
Joel Mackenzie @joelmmackenzie
368 Followers 411 Following Lecturer at the University of Queensland. Information Retrieval Research. https://t.co/Aquuj1pPup @UQSchoolEECS @ielabgroup
Richard McCreadie @richardm_
831 Followers 214 Following Lecturer (Univ. of Glasgow): Specialized in real-time search, event detection, summarization, social media, crowdsourcing, big streaming data
Charlie Marsh @charliermarsh
28K Followers 830 Following Building @astral_sh: Ruff, uv, and other high-performance Python tools. Prev: Staff engineer @SpringDiscovery, @KhanAcademy, BSE @PrincetonCS.
Quentin Anthony @QuentinAnthon15
4K Followers 268 Following I make models more efficient. Google Scholar: https://t.co/kzVsAKPLgX