Shashank Gupta @shashank_bits
Researcher at Ai2 || Work on NLP, LLMs, Reasoning, Agents, AI4Code || Prev: Microsoft AI, Univ. of Illinois (UIUC), Max Planck, IIT-Bombay || @shashanknlp 🟦sky shashankgupta.info Seattle, WA Joined December 2010-
Tweets604
-
Followers472
-
Following1K
-
Likes6K
As part of Asta, our initiative to accelerate science with trustworthy AI agents, we built AstaBench—the first comprehensive benchmark to compare them. ⚖️
Introducing Asta—our bold initiative to accelerate science with trustworthy, capable agents, benchmarks, & developer resources that bring clarity to the landscape of scientific AI + agents. 🧵
Test-time scaling w/ GRPO boosts accuracy, but also adds “filler tokens” increasing length w/o real progress. We present Group Filtered Policy Optimization (GFPO):🧵 1️⃣ Sample more per prompt 2️⃣ Rank by token efficiency (reward ÷ length) 3️⃣ Train on top-k 4️⃣ 🚀 Cut 80% of…
Thinking Less at test-time requires Sampling More at training-time! GFPO is a new, cool, and simple Policy Opt algorithm is coming to your RL Gym tonite, led by @VaishShrivas and our MSR group: Group Filtered PO (GFPO) trades off training-time with test-time compute, in order…
🚨 We're hiring a #ResearchScientist in #AI for Scientific Discovery at Ai2! Are you passionate about intelligent agents, data-driven discovery, and AI systems that accelerate science? Join us in shaping the future of research. 🧬🧠 Apply now: job-boards.greenhouse.io/thealleninstit…
Introducing SciArena, a platform for benchmarking models across scientific literature tasks. Inspired by Chatbot Arena, SciArena applies a crowdsourced LLM evaluation approach to the scientific domain. 🧵
Today we’re releasing a prototype of Genesys, an autonomous multi-agent LLM discovery system that aims to discover new types of language model architectures. We found Genesys can discover novel architectures competitive with the industry-standard transformer. 🧵
✨New edition of our community-building workshop series!✨ Tomorrow at @CVPR, we invite speakers to share their stories, values, and approaches for navigating a crowded and evolving field, especially for early-career researchers. Cheeky title🤭: How to Stand Out in the…
✨New edition of our community-building workshop series!✨ Tomorrow at @CVPR, we invite speakers to share their stories, values, and approaches for navigating a crowded and evolving field, especially for early-career researchers. Cheeky title🤭: How to Stand Out in the…
Excited to announce AlphaEvolve A powerful AI coding agent developed by our team in @GoogleDeepMind that is able to discover impactful new algorithms for important problems in Maths and Computing by combining the creativity of large language models with automated evaluators.
We've just released HealthBench — a new eval for AI systems for health. Developed with 262 physicians who have practiced in 60 countries.
We've just released HealthBench — a new eval for AI systems for health. Developed with 262 physicians who have practiced in 60 countries.
Scientific discovery with LLMs has so much potential yet is underexplored. Our new benchmark **LLM-SRBench** enable rigorous evaluations of equation discovery with LLMs! 🧠Key takeaway: Even SOTA discovery models with strong LLM backbones still fail to discover mathematical…
Excited to release R2E-Gym - 🔥 8.1K executable environments using synthetic data - 🧠 Hybrid verifiers for enhanced inference-time scaling - 📈 51% success-rate on the SWE-Bench Verified - 🤗 Open Source Data + Models + Trajectories 1/
Announcing OLMo 2 32B: the first fully open model to beat GPT 3.5 & GPT-4o mini on a suite of popular, multi-skill benchmarks. Comparable to best open-weight models, but a fraction of training compute. When you have a good recipe, ✨ magical things happen when you scale it up!
Here is Tülu 3 405B 🐫 our open-source post-training model that surpasses the performance of DeepSeek-V3! The last member of the Tülu 3 family demonstrates that our recipe, which includes Reinforcement Learning from Verifiable Rewards (RVLR) scales to 405B - with performance on…
Excited to share a sneak peek into what we've been building at Yutori! What you see below is our trained model and internal prototype — multiple agents running in parallel in the background, completing tasks of varying complexity, relevant information and cues to step in being…
Interested in knowing more about LLMs agents and in contributing to this topic?🚀 📢We're thrilled to announce REALM: The first Workshop for Research on Agent Language Models 🤖 #ACL2025NLP in Vienna 🎻 We have an exciting lineup of speakers 🗓️ Submit your work by *March 1st*
Can AI really help with literature reviews? 🧐 Meet Ai2 ScholarQA, an experimental solution that allows you to ask questions that require multiple scientific papers to answer. It gives more in-depth, detailed, and contextual answers with table comparisons, expandable sections…
Excited to share that LLM-SR will appear at ICLR'25 🥳🎉🇸🇬
Excited to share that LLM-SR will appear at ICLR'25 🥳🎉🇸🇬
How expensive are the best SWE-Bench agents? Do reasoning models outperform language models? Can we trust agent evaluations? 📢 Announcing HAL, a Holistic Agent Leaderboard for evaluating AI agents, with 11 benchmarks, 90+ agents, and many more to come.

Luca Soldaini 🎀 @soldni
11K Followers 1K Following I like tokens! I lead the OLMo data team at @allen_ai w/ @kylelostat. Open source is fun 🤖☕️🍕🏳️🌈 Opinions are sampled from my own stochastic parrot
Swaroop Mishra @Swarooprm7
13K Followers 813 Following MTS @Microsoft AI, Prev: RS @GoogleDeepMind (Gemini). Opinions my own.
Sarah Wiegreffe @sarahwiegreffe
5K Followers 1K Following Research in NLP (mostly LM interpretability & explainability). Assistant prof @umdcs @clipumd Formerly @allen_ai @uwnlp @icatgt @gtcomputing Views my own.
Faeze Brahman @faeze_brh
2K Followers 1K Following Research Scientist @allen_ai | Prev. Postdoc @allen_ai @uw | Ph.D. from UCSC | Former Intern @MSFTResearch , @allen_ai | Researcher in #NLProc, #ML #AI
Yao Fu @Francis_YAO_
20K Followers 2K Following Research Scientist at @GoogleDeepMind I study complex, multimodal, interactive reasoning. Opinions are my own
Matthew Finlayson @mattf1n
1K Followers 905 Following PhD at @nlp_usc | Former predoc at @allen_ai on @ai2_aristo | Harvard 2021 CS & Linguistics
Nouha Dziri @nouhadziri
5K Followers 692 Following Research Scientist @allen_ai, PhD in NLP 🤖 UofA. Ex @GoogleDeepMind @MSFTResearch @MilaQuebec 🚨🚨 NEW BLOG about LLMs reasoning: https://t.co/Ox0iOaqY7e
Weijia Shi @WeijiaShi2
9K Followers 1K Following PhD student @uwnlp @allen_ai | Prev @MetaAI @CS_UCLA | 🏠 https://t.co/Q6Mzg8ow2j
Valentina Pyatkin @valentina__py
3K Followers 1K Following Postdoc at the Allen Institute for AI @allen_ai and @uwnlp
Yizhong Wang @yizhongwyz
6K Followers 1K Following Incoming assistant professor @UTCompSci, RS @BytedanceTalk, PhD from @uwcse, formerly @allen_ai @AIatMeta @MSFTResearch
Harsh Trivedi @harsh3vedi
665 Followers 909 Following 🤖 Building AI agents & interactive environments: 🌍 AppWorld (https://t.co/dIawTLcI7a) #NLProc PhD @stonybrooku. Prev: @allen_ai @CILVRatNYU. On 🦋 same handle.
Shaily @shaily99
7K Followers 2K Following PhD @LTIatCMU. Prev: @allen_ai @GoogleAI @MSFTResearch. #NLProc. Often ranting about research.
ELONMUSKTESLA @elonneurlink
33 Followers 797 Following Live life to the fullest, keep things simple, truthful & filter the noise. I am a long term investor MAGA🚀🇺🇸
Tegan Jegede @jegede_tegan
198 Followers 6K Following Tegan = print( “👨🏾💻passionate engineer ,AI/ML enthusiast , Real Madrid ,arsenal ⚽️ and GSW🏀: ”)
Yqeaxer @Yqeaxer55607
23 Followers 979 Following
Rahel Jhirad @RahelJhirad
2K Followers 7K Following Founder, Imaginator ai knowledge discovery 2D navigation TS ML DL recsys econ math incentives mech design finance networks bridges boundaries, Time, 3d type
Wenceslas @n5809079088339
69 Followers 2K Following self-töt developer; It’s a small world and coincidences abound
Florent Pelsy @PelsyPelsy
17 Followers 363 Following
4am @adososerious
28 Followers 586 Following
Valentina Tardelli @ValentinaT32922
92 Followers 6K Following
New. Robato @NewRobato19707
109 Followers 1K Following
Oleg Zendel @OlegZendel
348 Followers 1K Following Research Fellow @ADMScentre, previously PhD in Comp Sci. @RMITComputing
Gbàdàmósí 🇳�... @Muizz_999
527 Followers 5K Following 📍#Partaker of the Inheritance of the saints in Light📍
Dnousawt @DnousawtI8mwX9
105 Followers 1K Following International marriage.Match one to one until you meet the perfect fit.Welcome to inquire via private message.
Matt Ramage @ramagetime
13K Followers 13K Following Optimizing Web & AI Ops for Businesses. Book a call today. 👇
Dara @dara_tourt
13 Followers 8K Following
WebAgentlab @webagentlab
450 Followers 1K Following WebAgentLab is building an open-source community focused on Web Agent and the broader GUI Agent field.
Alex Wettig @_awettig
2K Followers 584 Following PhD @Princeton trying to make sense of language models and their training data; trying to train agents @cursor_ai
Aniruddha Mukherjee @annimukh
536 Followers 2K Following papers @ IEEE, ACM | RL at IISc Bangalore | IIT-M BSc & KIIT
EileenWebster @il4Qh2fLqqJg6Tq
73 Followers 7K Following
Thijs Bergkamp @ThijsBergkamp
83 Followers 7K Following
Surendrabikram Thapa ... @therealthapa
538 Followers 2K Following Faculty @virginia_tech | Figuring out life! All tweets are personal. Repost is not endorsement.
Anirudh Thatipelli @AThatipelli
535 Followers 5K Following PhD-CS @UCFCRCV, MS-CS @UCR_CSE, Former Applied Science Intern at @amazon
Ghazal Khalighinejad @ghazalkhn
307 Followers 428 Following CS PhD @duke_nlp, Intern @Google | Foundation Models & AI4Science | Guest Researcher @SimonsFdn @PolymathicAI prev @AdobeResearch
Raven @1h25sl93nDqCxJ
97 Followers 7K Following
Yakmaz @yYakmazz
328 Followers 6K Following
Agnes @Agnes44523
12 Followers 641 Following "Success is not final, failure is not fatal: It is the courage to continue that counts."
Pratyay Banerjee (ন... @Neilzblaze007
295 Followers 7K Following I live in the shadows, but I watch everything.
Wanru Zhao @Renee42581826
2K Followers 3K Following PhD Student @Cambridge_Uni; Visiting @VectorInst; Intern @MSFTResearch | Prev: @AWS AI Lab | Do not go gentle into that good night 🧗 | https://t.co/MOPcMcPqcc
DeepBrain AI @DeepBrain_ai
22K Followers 22K Following Deepbrain AI services AI technologies such as video and speech synthesis, live chatbots, and more required to create AI Humans. https://t.co/l6BCYy0n8l
Siddharth Betala @SiddharthBetal
229 Followers 3K Following ML @entalpic_ai || 24 || IIT Madras || Prev interns @ UofT, UW || Interested in NLP, Explainability and AI4Science
Luca Weihs @LucaWeihs
1K Followers 273 Following Co-Founder @ Vercept; Prev: research scientist @allen_ai; stat PhD @UW; math undergrad @UCBerkeley.
Shiolez @ShiolezdPZHoHF
10 Followers 641 Following
Shayekh Islam @shayekhbinislam
61 Followers 244 Following Learning about intelligence, one principle at a time
Zhili Feng @zhilifeng
588 Followers 907 Following Research @OpenAI | prev: ML PhD @mldcmu | Research intern: @MSRNE, @Amazon, @BoschGlobal
Diksha Shrivastava @Diksha1713
270 Followers 1K Following Theoretical AI @Lossfunk | Causal Discovery for AI Safety
arion das @ArionDas
831 Followers 7K Following gen ai intern @Techolution_com || research @ aiisc, usc || author @naacl || reviewer @aclmeeting, aia @COLM_conf
Vijay Murari Tiyyala @VijayTiyyala
181 Followers 1K Following @JHUCompSci Research Assistant @jhuclsp @mdredze Research interests: Alignment LLMs, Model Editing, Interpretability. (విజయ్ మురారి)
Yann LeCun @ylecun
949K Followers 764 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
(((ل()(ل() 'yoav)))... @yoavgo
65K Followers 2K Following
William Wang @WilliamWangNLP
19K Followers 759 Following CEO & Founder, @AlphaDesignAI. We make https://t.co/1LfDYicsF2 I'm also Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS.
AK @_akhaliq
425K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5YmrQ
Ai2 @allen_ai
73K Followers 409 Following Breakthrough AI to solve the world's biggest problems. › Join us: https://t.co/MjUpZpKPXJ › Newsletter: https://t.co/k9gGznstwj
Jason Wei @_jasonwei
98K Followers 636 Following ai researcher @meta superintelligence labs, past: openai, google 🧠
Yi Tay @YiTayML
46K Followers 81 Following research scientist @googledeepmind ✨♊, model co-lead/captain of gemini deepthink imo gold medal 🥇, opinions are my own.
Yoav Artzi @yoavartzi
17K Followers 183 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaC and @COLM_conf
Percy Liang @percyliang
84K Followers 417 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist
Kyunghyun Cho @kchonyc
77K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
rishi @RishiBommasani
6K Followers 2K Following Societal/economic impacts of AI; AI policy & governance @StanfordHAI Stanford CS PhD w/ @percyliang @jurafsky Cornell CS undergrad w/ @clairecardie
François Chollet @fchollet
572K Followers 813 Following Co-founder @ndea. Co-founder @arcprize. Creator of Keras and ARC-AGI. Author of 'Deep Learning with Python'.
Luca Soldaini 🎀 @soldni
11K Followers 1K Following I like tokens! I lead the OLMo data team at @allen_ai w/ @kylelostat. Open source is fun 🤖☕️🍕🏳️🌈 Opinions are sampled from my own stochastic parrot
Mark Dredze @mdredze
6K Followers 783 Following John C Malone Professor at @JohnsHopkins @JHUCompSci @jhuclsp @jhumceh; Part time @techatbloomberg (tweets my own) @mdredze.bsky.social🦋
Graham Neubig @gneubig
40K Followers 708 Following Associate professor @LTIatCMU. Co-founder/chief scientist @allhands_ai. I mostly work on modeling language.
Pan Lu @lupantech
6K Followers 1K Following Postdoc @Stanford | PhD @CS_UCLA @uclanlp | Amazon/Bloomberg/Qualcomm Fellows | Ex @Tsinghua_Uni @Microsoft @allen_ai | ML/NLP: AI4Math, AI4Science, LLM, Agents
Naman Goyal @NamanGoyal21
2K Followers 620 Following Research @thinkymachines, previously pretraining LLAMA at GenAI Meta
Alexander Kirillov @_alex_kirillov_
8K Followers 364 Following Multimodality @thinkymachines. Previously: post-training MM lead @openai, research Scientist @facebookai Projects: GPT-4o, Advance Voice Mode, SegmentAnything.
Aditya Grover @adityagrover_
12K Followers 507 Following Co-founder and CTO @_inception_ai. AI Prof @UCLA. Denoising intelligence.
Scaled Cognition @ScaledCognition
467 Followers 13 Following The first AI system designed and trained for agentic applications. Register for early access: https://t.co/26WXpyCnde
Alex Wettig @_awettig
2K Followers 584 Following PhD @Princeton trying to make sense of language models and their training data; trying to train agents @cursor_ai
Jiao Sun @sunjiao123sun_
12K Followers 572 Following Senior Research Scientist at Google DeepMind \n\n NLP PhD @ USC, Amazon ML Fellow \n\n ex-{Google Brain, Alexa AI} nlper, IIIS Tsinghua-Ren
Sayash Kapoor @sayashk
10K Followers 2K Following CS PhD candidate @PrincetonCITP. I tweet about AI agents, AI evals, AI for science. AI as Normal Technology: https://t.co/5amOkqKDf2 Book: https://t.co/DabpkhNrcM
Jitendra MALIK @JitendraMalikCV
5K Followers 1 Following
Kiana Ehsani @ehsanik
4K Followers 596 Following Co-Founder @ Vercept, Ph.D. @uwcse, Interested in computer vision, Agents and AI, Climber on the weekends.
Matt Deitke @mattdeitke
13K Followers 299 Following AI Researcher @ Meta Superintelligence Lab Ph.D. dropout at @uwcse
Shital Shah @sytelus
13K Followers 11K Following Mostly research and code. If universe is an optimizer, what is its loss function? All opinions are my own.
Alexandre Lacoste @alex_lacoste_
1K Followers 456 Following MegaSenior Research Scientist at ServiceNow Research, Former Google. WebAgents, Remote Sensing, Climate Change, Opinions are my own
Yanni Shawn @iMean AI @Yanni_Shawn
357 Followers 96 Following Helping Web Agents become true allies for humans. Product https://t.co/nbPkpUuJk8 Research https://t.co/vv0IxpZtBV
swyx 🇸🇬 @swyx
125K Followers 3K Following achieve ambition with intentionality, intensity, & integrity - @smol_ai - @dxtipshq - @sveltesociety - @aidotengineer - @coding_career - @latentspacepod
Skild AI @SkildAI
7K Followers 0 Following Building general purpose robotic intelligence. Apply https://t.co/UKh2kQYkAV
Luca Weihs @LucaWeihs
1K Followers 273 Following Co-Founder @ Vercept; Prev: research scientist @allen_ai; stat PhD @UW; math undergrad @UCBerkeley.
Orby AI - A Uniphore ... @OrbyAI
348 Followers 59 Following Orby is fundamentally transforming the way enterprise teams perform, giving you the power to delegate tedious tasks to automation.
Peng Qi @qi2peng2
4K Followers 386 Following Research Lead @OrbyAI. Previously: @AWS AI, $JD AI, PhD @stanfordnlp, UG @Tsinghua_Uni. He/him. Opinions my own.
OSU NLP Group @osunlp
2K Followers 137 Following Natural Language Processing Group at The Ohio State University directed by @ysu_nlp @hhsun1 @shocheen
Boyuan Zheng@ICML @boyuan__zheng
773 Followers 803 Following Phd student at @osunlp | Research Intern at AI2 PRIOR @allen_ai | Previous: MS @jhuclsp; Intern @Amazon
DeepSeek @deepseek_ai
973K Followers 0 Following Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
Harsh Jhamtani @harsh_jhamtani
558 Followers 404 Following Researcher @Microsoft | NLP / ML PhD from @LTIatCMU | Previously at @UCSanDiego @allen_ai @AdobeResearch @facebookai @iitroorkee | Opinions are my own.
Pavel Izmailov @Pavel_Izmailov
8K Followers 1K Following Researcher @AnthropicAI 🤖 Incoming Assistant Professor @nyuniversity 🏙️ Previously @OpenAI #StopWar 🇺🇦
Shirley Wu @ShirleyYXWu
3K Followers 295 Following CS PhD candidate @Stanford working w/ @jure & @james_y_zou on LLM agents and alignment | Prev USTC, Intern @MSFTResearch, @NUSingapore
Gagan Bansal @bansalg_
3K Followers 496 Following AI research @msftresearch | Built AutoGen @pyautogen | Previously @uwcse, @iitdelhi
Hailey Schoelkopf @haileysch__
5K Followers 1K Following hillclimbing towards generality @anthropicai | prev @AiEleuther | views my own
Ani Kembhavi @anikembhavi
3K Followers 317 Following Director @wayve_ai. Former Director @allen_ai. Molmo, VisualProg, ProcThor, Poliformer, BiDAF, Unified-IO, Objaverse, SPOC, SATLAS.
Erik Schluntz @ErikSchluntz
5K Followers 277 Following Member of Technical Staff at Anthropic Co-founder at @CobaltRobotics Co-founder at Posmetrics (acquired) GoogleX, @SpaceX, @Harvard EE '15, Forbes 30u30 '18
Yifei Wang @yifeiwang77
2K Followers 2K Following Postdoc @MIT_CSAIL. Self-supervised learning. Foundation Models. AI Safety. Prior BS+BA+PhD @PKU1898.
Alex Albert @alexalbert__
97K Followers 637 Following Claude Relations @AnthropicAI. Opinions are my own!
Xiaochuan Li @xiaochuanlee
83 Followers 351 Following Incoming Ph.D. student @LTIatCMU, working with @XiongChenyan. Previously @Tsinghua_Uni Now Intern @Alibaba_Qwen
Alon Albalak @AlbalakAlon
2K Followers 597 Following Open-endedness, Data-centric AI @LilaSciences Previously: RS @synth_labs, PhD @ucsbNLP, Internships @AIatMeta @MSFTResearch All puns are my own
Sihao Chen @soshsihao
953 Followers 541 Following Researcher @ Microsoft #OAR. Learning AI models from experience. Previously: @upennnlp @cogcomp @GoogleAI. Opnions my own.
Evgenii Nikishin @nikishin_evg
4K Followers 938 Following Researcher @OpenAI working on RL & Reasoning. Past: PhD @Mila_Quebec, Intern @GoogleDeepMind #StopWar 🇺🇦
Zaid Khan @codezakh
553 Followers 862 Following @uncnlp with @mohitban47 working on grounded reasoning + multimodal agents // currently @allen_ai formerly @neclabsamerica // bs+ms CompE @northeastern