Zaid Khan @codezakh
@uncnlp with @mohitban47 working on grounded reasoning + multimodal agents // currently @allen_ai formerly @neclabsamerica // bs+ms CompE @northeastern zaidkhan.me Boston, USA Joined June 2023-
Tweets443
-
Followers553
-
Following862
-
Likes1K
🚨 Excited to share new work on LLMs and loopholes, accepted to #EMNLP2025 main! When models are faced with conflicting goals and ambiguous instructions that would let them exploit a loophole, many of the strongest models (Qwen, GPT4o, Claude, Gemini) do. This is a new risk and…
🎉 Excited to share that our Video-Skill-CoT paper has been accepted to #EMNLP2025 Findings! Video-Skill-CoT is a domain-adaptive video reasoning framework that automatically constructs skill-aware Chain-of-Thought (CoT) supervisions. It builds a shared skill taxonomy from…
🎉 Excited to share that our Video-Skill-CoT paper has been accepted to #EMNLP2025 Findings! Video-Skill-CoT is a domain-adaptive video reasoning framework that automatically constructs skill-aware Chain-of-Thought (CoT) supervisions. It builds a shared skill taxonomy from…
🎉Excited to share that our MEXA paper is accepted to #EMNLP2025 Findings! 🚀MEXA is a general, training-free multimodal reasoning framework that dynamically selects and aggregates experts/skills for deep, free-form reasoning, and is flexible & extensible to new…
🎉Excited to share that our MEXA paper is accepted to #EMNLP2025 Findings! 🚀MEXA is a general, training-free multimodal reasoning framework that dynamically selects and aggregates experts/skills for deep, free-form reasoning, and is flexible & extensible to new…
🎉 RACCooN got accepted at #EMNLP2025 Main! 🚀 Our MLLM+Video Diffusion (Video-to-Paragraph-to-Video, V2P2V) framework enables effortless video editing w/ auto-generated descriptions, multi-granular pooling & mask planning. RACCooN Achieves +9.4%p human eval & 49.7%↓ FVD,…
🎉 RACCooN got accepted at #EMNLP2025 Main! 🚀 Our MLLM+Video Diffusion (Video-to-Paragraph-to-Video, V2P2V) framework enables effortless video editing w/ auto-generated descriptions, multi-granular pooling & mask planning. RACCooN Achieves +9.4%p human eval & 49.7%↓ FVD,…
Excited to share that MAgICoRe has been accepted to #EMNLP2025 main! 🎉 Our work identifies 3 key challenges in LLM refinement for reasoning: 1) Over-correction on easy problems 2) Fail to localize and fix its own errors 3) Too few refinement iterations for harder problems…
Excited to share that MAgICoRe has been accepted to #EMNLP2025 main! 🎉 Our work identifies 3 key challenges in LLM refinement for reasoning: 1) Over-correction on easy problems 2) Fail to localize and fix its own errors 3) Too few refinement iterations for harder problems…
🎉Our Video-RTS paper has been accepted at #EMNLP2025 Main!! We propose a novel video reasoning approach that combines data-efficient reinforcement learning (GRPO) with video-adaptive test-time scaling, improving reasoning performance while maintaining efficiency on multiple…
🎉Our Video-RTS paper has been accepted at #EMNLP2025 Main!! We propose a novel video reasoning approach that combines data-efficient reinforcement learning (GRPO) with video-adaptive test-time scaling, improving reasoning performance while maintaining efficiency on multiple…
📢 Introducing RotBench, which tests whether SoTA MLLMs (e.g., GPT-5, GPT-4o, o3, Gemini-2.5-pro) can identify the rotation of input images (0°, 90°, 180°, and 270°). Even frontier MLLMs struggle at this spatial reasoning task that humans solve with >98% Acc. ➡️ Models struggle…
🤔 Can we bridge MLLMs and diffusion models more natively and efficiently, by having MLLMs produce patch-level CLIP latents already aligned with their visual encoders, while fully preserving MLLM's visual reasoning capabilities? Introducing Bifrost-1: 🌈 > High-Fidelity…
🚀 I'm recruiting PhD students to join my lab (jaehong31.github.io) at NTU Singapore (@NTUsg), starting Spring 2026. If you're passionate about doing cutting-edge and high-impact research in multimodal AI, Trustworthy AI, continual learning, or video generation/reasoning,…
🚀 I'm recruiting PhD students to join my lab (jaehong31.github.io) at NTU Singapore (@NTUsg), starting Spring 2026. If you're passionate about doing cutting-edge and high-impact research in multimodal AI, Trustworthy AI, continual learning, or video generation/reasoning,…
🚀 We introduce GrAInS, a gradient-based attribution method for inference-time steering (of both LLMs & VLMs). ✅ Works for both LLMs (+13.2% on TruthfulQA) & VLMs (+8.1% win rate on SPA-VL). ✅ Preserves core abilities (<1% drop on MMLU/MMMU). LLMs & VLMs often fail because…
🇦🇹 I’m on my way to #ACL2025 to help present two papers (🧵s below) ➡️ MAT-Steer (07/30 at 11am), our method for steering LLMs w/ multiple attributes (e.g. truthfulness, bias reduction, and toxicity mitigation) simultaneously. ➡️ LAQuer (07/28 at 11am), a new task/framework for…
🇦🇹 I’m on my way to #ACL2025 to help present two papers (🧵s below) ➡️ MAT-Steer (07/30 at 11am), our method for steering LLMs w/ multiple attributes (e.g. truthfulness, bias reduction, and toxicity mitigation) simultaneously. ➡️ LAQuer (07/28 at 11am), a new task/framework for…
🎉 Our paper, GenerationPrograms, which proposes a modular framework for attributable text generation, has been accepted to @COLM_conf! GenerationPrograms produces a program that executes to text, providing an auditable trace of how the text was generated and major gains on…
🎉 Our paper, GenerationPrograms, which proposes a modular framework for attributable text generation, has been accepted to @COLM_conf! GenerationPrograms produces a program that executes to text, providing an auditable trace of how the text was generated and major gains on…
🥳 Gap year update: I'll be joining @allen_ai/@UW for 1 year (Sep2025-Jul2026 -> @JHUCompSci) & looking forward to working with amazing folks there, incl. @RanjayKrishna, @HannaHajishirzi, Ali Farhadi. 🚨 I’ll also be recruiting PhD students for my group at @JHUCompSci for Fall…
🥳 Gap year update: I'll be joining @allen_ai/@UW for 1 year (Sep2025-Jul2026 -> @JHUCompSci) & looking forward to working with amazing folks there, incl. @RanjayKrishna, @HannaHajishirzi, Ali Farhadi. 🚨 I’ll also be recruiting PhD students for my group at @JHUCompSci for Fall…
The MUGen workshop at #ICML2025 is happening now! Stop by for talks on adversarial ML, unlearning as rational belief revision, failure modes in unlearning, robust LLM unlearning, and the bright vs. dark side of forgetting in generative AI!
The MUGen workshop at #ICML2025 is happening now! Stop by for talks on adversarial ML, unlearning as rational belief revision, failure modes in unlearning, robust LLM unlearning, and the bright vs. dark side of forgetting in generative AI!
📢📢📢 Releasing OpenThinker3-1.5B, the top-performing SFT-only model at the 1B scale! 🚀 OpenThinker3-1.5B is a smaller version of our previous 7B model, trained on the same OpenThoughts3-1.2M dataset.
Overdue job update -- I am now: - A Visiting Scientist at @schmidtsciences, supporting AI safety and interpretability - A Visiting Researcher at the Stanford NLP Group, working with @ChrisGPotts I am so grateful I get to keep working in this fascinating and essential area, and…
I’ll be at #ICML2025 this week to present ScPO: 📌 Wednesday, July 16th, 11:00 AM-1:30 PM 📍East Exhibition Hall A-B, E-2404 Stop by or reach out to chat about improving reasoning in LLMs, self-training, or just tips about being on the job market next cycle! 😃
I’ll be at #ICML2025 this week to present ScPO: 📌 Wednesday, July 16th, 11:00 AM-1:30 PM 📍East Exhibition Hall A-B, E-2404 Stop by or reach out to chat about improving reasoning in LLMs, self-training, or just tips about being on the job market next cycle! 😃
🥳 Excited to share our work -- Retrieval-Augmented Generation with Conflicting Evidence -- on addressing conflict in RAG due to ambiguity, misinformation, and noisy/irrelevant evidence has been accepted to @COLM_conf #COLM2025! Our new benchmark RAMDocs proves challenging for…
🥳 Excited to share our work -- Retrieval-Augmented Generation with Conflicting Evidence -- on addressing conflict in RAG due to ambiguity, misinformation, and noisy/irrelevant evidence has been accepted to @COLM_conf #COLM2025! Our new benchmark RAMDocs proves challenging for…
🚨Introducing Video-RTS: Resource-Efficient RL for Video Reasoning with Adaptive Video TTS! While RL-based video reasoning with LLMs has advanced, the reliance on large-scale SFT with extensive video data and long CoT annotations remains a major bottleneck. Video-RTS tackles…

GemmaSalome @nhZ6592cg145I6
1 Followers 86 Following
DeborahConrad @5nSaGu8q4J86F5
2 Followers 218 Following
Connor Treacy @theconnortreacy
8K Followers 12K Following
SweetLaceyCupcake @Tlisa0471146
13 Followers 1K Following "Compassion is the heart of healing, and science is the tool."
Xorwor @Xorwor0920
1 Followers 188 Following Focused on investing in U.S. stocks, happy to discuss stock market trends.
Maarwuf @Maarwuf998
16 Followers 1K Following
Eleanor @7kOonhzEU239P
22 Followers 1K Following
Sylvia @0O1YDBmZ34CWzW
9 Followers 832 Following
Thea @RlulG981xyRt5
10 Followers 541 Following No one can make you feel inferior without your consent.
Tianyi Niu @niu_tianyi
29 Followers 173 Following MS Computer Science at @UNC, @unccs | Research Assistant @ MURGe-Lab w/ @mohitban47. Previously BS Comp. Sci. & BA Linguistics at @UNC.
AI Native Foundation @AINativeF
4K Followers 4K Following Non-profit Org., Empowering Humanity with Ethical AI, Latest insights about AI Native. 🤝 Community: https://t.co/b1mRBfQYi5
Anchal @aaanchh
3 Followers 109 Following Research papers | half-baked theories | fully caffeinated.
Bridget @afU6MdF3DQJQf
31 Followers 1K Following
Teawpkor @Teawpkor6559
34 Followers 1K Following
Noah Ziems @NoahZiems
1K Followers 1K Following Visiting Researcher @MIT_CSAIL. PhD student @NotreDame advised by @Meng_CS. Creator of Arbor RL library for @DSPyOSS
sunil kumar @__sunil_kumar_
2K Followers 560 Following ml research and eng @groundlightai ex. @meta @harveymudd
Nikolai Rozanov @ai_nikolai
116 Followers 365 Following CS PhD in LLM Agents & Reasoning @ImperialCollege || ex tech-founder || LLMs, Agent AI, NLP, RL. #NLProc
Igor Kan @1gor_kan
5K Followers 7K Following ⊣|─ math, physics, stats, ai @UofT ─⊗─ building @lesenheit @styles_lab ─▭─ https://t.co/EuZcbXZ0wZ _/ _ philosophy, ancient near east lit.,history ⏚ :wq && exit
Jenna Russell @jennajrussell
308 Followers 240 Following CS PhD Student @umdcs @ClipUmd, undergrad @CornellCIS
Joshua Ong @joshuaongg21
77 Followers 164 Following Visiting Researcher @EdinburghNLP | PhD Student @imperialcollege LLM Reasoning | Autoformalisation | Neurosymbolic AI
Mehul Damani @MehulDamani2
585 Followers 401 Following PhD-ing @mit @MIT_CSAIL | language models, reinforcement learning
Teiener @Teiener92254
32 Followers 963 Following
Youngmin Oh @OhYoungmin41460
1 Followers 210 Following
Gautier Hamon @hamongautier
280 Followers 686 Following PhD student at INRIA Flowers team @flowersInria. MVA master
Yukyung Lee @yukyunglee_
159 Followers 159 Following Postdoc at Boston University 🇺🇸 | PhD at Korea University 🇰🇷 | #nlproc | Prev: intern at NAVER, HUFS
Cary Smitham @cary_smith59495
70 Followers 4K Following
Ekdeep Singh @EkdeepL
2K Followers 1K Following Member of Technical Staff @GoodfireAI; Previously: Postdoc / PhD at Center for Brain Science, Harvard and University of Michigan
William Yijiang Li @Williamiumli
119 Followers 380 Following Ph.D. student @UCSanDiego, M.S. in CS @JohnsHopkins
Hokin Deng @DengHokin
481 Followers 430 Following prev neuroscientist @Harvard @JohnsHopkins | philosopher @GrowAiLikeChild | Founding member of technical staff @MyolabAI
renAI (Human-Centric ... @renAI_Lab101
38 Followers 114 Following A group of researchers at @HKUST working on fun stuff that matters, with 6 main+finding papers to be presented at ACL’25. Follow for latest updates :)
Ausalju @Ausalju601945
34 Followers 2K Following
Maud @m_meade93
225 Followers 3K Following
meh @theXmaverick
26 Followers 259 Following highly prone to sink in Deep Learning Rabbit holes. | XAI | Dissecting LLMs @iiscbangalore | Senior Engg. undergrad @iitjodhpur
Vonbau @Vonbau7643523
44 Followers 2K Following
Huy Le @huile1611
71 Followers 2K Following Working on generalizing and optimizing foundation multimodal models 👀✍️🤖🌍 @Mila_Quebec & @UMontrealDIRO
Eunkyu Eunice Park @uunicee_
129 Followers 390 Following current Ph.D. Candidate at @SeoulNatlUni @SNUVL current Visiting Researcher at @cmuhcii prev @columbia @CUSEAS
souvik @batikbabu
13 Followers 751 Following
Lifan Yuan @lifan__yuan
2K Followers 137 Following PhD student @uiuc_nlp @GoogleDeepMind. Prev: @TsinghuaNLP
Jiaqi Liu @JiaqiLiu835914
25 Followers 165 Following CS PhD student @UNC @unccs |VLM, RL, Agent, Embodied Intelligence
Tim Dettmers @Tim_Dettmers
38K Followers 991 Following Creator of bitsandbytes.Research Scientist @allen_ai and incoming professor @CarnegieMellon. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.
Tianqi Chen @tqchenml
18K Followers 1K Following AssistProf @CarnegieMellon. Distinguished Eng @NVIDIA. Creator of @XGBoostProject, @ApacheTVM. Member https://t.co/QYyfjQNp4p, @TheASF. Views are on my own
Hallvard Holte @HallvardHolte
1K Followers 5K Following MA Classics Oxford '17 🏛️ MSc Business Analytics NHH '21 📈
Jiawei Zhao @jiawzhao
3K Followers 242 Following Research Scientist at Meta FAIR @AIatMeta, PhD @Caltech, GaLore, DeepConf
Dynamics Lab @DynamicsLab_AI
5K Followers 166 Following An applied research company building at the frontier of AI
Samuel Schmidgall @SRSchmidgall
3K Followers 483 Following Research Scientist @GoogleDeepmind // PhD @JohnsHopkins
Tianyi Niu @niu_tianyi
29 Followers 173 Following MS Computer Science at @UNC, @unccs | Research Assistant @ MURGe-Lab w/ @mohitban47. Previously BS Comp. Sci. & BA Linguistics at @UNC.
Prophet Arena @ProphetArena
2K Followers 14 Following The AI benchmark for predictive intelligence, advancing collective foresight via human–AI collaboration, from SIGMA Lab @UChicagoCS @DSI_UChicago
Anuar @_startuphacker
3K Followers 720 Following CTO @ https://t.co/73ZP71gNDB - AI agent for documents | my journey from Kazakhstan's steppes to building AI B2B SaaS
Vaish Shrivastava @VaishShrivas
357 Followers 417 Following Reinforcement Learning @MSFTResearch 🧠 MS CS @Stanford @StanfordNLP @StanfordAILab BS CS @Caltech
❄️Andrew Zhao❄�... @_AndrewZhao
4K Followers 3K Following PhD @Tsinghua_Uni. Absolute Zero,ExpeL,Diver-CT Research Intern @MSFTResearch, Ex. @ BIGAI. Interested in RL, Reasoning/Safety 4 LLMs, Agents. On job market 26'
Tom McCoy @RTomMcCoy
4K Followers 581 Following Assistant professor @YaleLinguistics. Studying computational linguistics, cognitive science, and AI. He/him.
Jason Liu @JasonLiu106968
76 Followers 71 Following
Noah Ziems @NoahZiems
1K Followers 1K Following Visiting Researcher @MIT_CSAIL. PhD student @NotreDame advised by @Meng_CS. Creator of Arbor RL library for @DSPyOSS
Basu Dasgupta @thebdasgupta
453 Followers 86 Following Professor of Theoretical Physics at @TIFRScience X-noob, here for the cats
Binyuan Hui @huybery
34K Followers 649 Following 🥝 Building Qwen @Alibaba_Qwen. Focus on CodeLLM (Pre-training and Post-training) / Reasoning / Agent. Ideas my own.
p(doom) @prob_doom
163 Followers 1 Following
sunil kumar @__sunil_kumar_
2K Followers 560 Following ml research and eng @groundlightai ex. @meta @harveymudd
Georgia Channing @cgeorgiaw
680 Followers 208 Following AI4Science @ 🤗, PhD @OxfordTVG — sharing the world of science
Jon Chu // Khosla Ven... @heyjchu
8K Followers 499 Following Partner @khoslaventures, founder @ Koality (exited), OG @PalantirTech, @Opendoor, @Docker, ML @facebook
Jiayi Weng @Trinkle23897
3K Followers 142 Following MTS @openai, author of the entire post-training RL infra, core contributor of ChatGPT/GPT4/GPT4o etc. 30U30
Marek Rei @MarekRei
2K Followers 266 Following Researcher in #MachineLearning and #NLProc, working on representation learning and language. Lecturer at @imperialcollege, visiting researcher at @Cambridge_Uni
Nikolai Rozanov @ai_nikolai
116 Followers 365 Following CS PhD in LLM Agents & Reasoning @ImperialCollege || ex tech-founder || LLMs, Agent AI, NLP, RL. #NLProc
Pushmeet Kohli @pushmeet
17K Followers 90 Following Computer Scientist, Leading Science and Strategic Initiatives @ Google DeepMind.
Jared Moore @jaredlcm
222 Followers 301 Following @jaredlcm.bsky.social AI Researcher, Writer Stanford
Dan Austin @DanAiTuning
143 Followers 289 Following I get LLMs to do things whilst drinking large amounts of tea 🤓🍵
Igor Kan @1gor_kan
5K Followers 7K Following ⊣|─ math, physics, stats, ai @UofT ─⊗─ building @lesenheit @styles_lab ─▭─ https://t.co/EuZcbXZ0wZ _/ _ philosophy, ancient near east lit.,history ⏚ :wq && exit
Alex Kontorovich @AlexKontorovich
29K Followers 806 Following Mathematician (Distinguished Professor of #Math at @RutgersU). Here to learn about research, education, and community. Let’s build something together.
Stas Bekman @StasBekman
9K Followers 286 Following Toolmaker. Software creator, optimizer and harmonizer. Makes ML systems work and fly @ Snowflake.
Jenna Russell @jennajrussell
308 Followers 240 Following CS PhD Student @umdcs @ClipUmd, undergrad @CornellCIS
Gabriele Berton @gabriberton
6K Followers 1K Following Postdoc @Amazon working on VLM - ex @CarnegieMellon @PoliTOnews @IITalk
Lakshya A Agrawal @LakshyAAAgrawal
2K Followers 2K Following AI PhD @ UC Berkeley | GEPA Creator (https://t.co/EdPqvzj7k4) | Created https://t.co/YxPZsXZJeS | Past: AI4Code Research Fellow @MSFTResearch | Hobbyist Saxophonist
Paul Röttger @paul_rottger
2K Followers 547 Following Postdoc @MilaNLProc, researching LLM safety and societal impacts.
Omar Shaikh @oshaikh13
1K Followers 839 Following member of sociotechnical staff @Stanford - previously @GeorgiaTech
ℏεsam @Hesamation
36K Followers 576 Following ai engineer | rigorously overfitting on a learning curve
X. Dong @SimonXinDong
899 Followers 385 Following Research Scientist@NVIDIA . Making LLMs e.g., Hymba, Nemotron serials. Ex @Harvard @Meta @Tencent| Views and opinions are my own
Joshua Ong @joshuaongg21
77 Followers 164 Following Visiting Researcher @EdinburghNLP | PhD Student @imperialcollege LLM Reasoning | Autoformalisation | Neurosymbolic AI