Zaid Khan @codezakh

@uncnlp with @mohitban47 working on grounded reasoning + multimodal agents // currently @allen_ai formerly @neclabsamerica // bs+ms CompE @northeastern zaidkhan.me Boston, USA Joined June 2023

Tweets

443
Followers

553
Following

862
Likes

1K

Elias Stengel-Eskin @EliasEskin

a week ago

🚨 Excited to share new work on LLMs and loopholes, accepted to #EMNLP2025 main! When models are faced with conflicting goals and ambiguous instructions that would let them exploit a loophole, many of the strongest models (Qwen, GPT4o, Claude, Gemini) do. This is a new risk and…

3 29 98 7K 37

Download Image

Daeun Lee @danadaeun

2 weeks ago

🎉 Excited to share that our Video-Skill-CoT paper has been accepted to #EMNLP2025 Findings! Video-Skill-CoT is a domain-adaptive video reasoning framework that automatically constructs skill-aware Chain-of-Thought (CoT) supervisions. It builds a shared skill taxonomy from…

Daeun Lee @danadaeun

3 months ago

2 32 83 28K 29

Download Video

0 20 83 7K 15

Shoubin Yu @shoubin621

2 weeks ago

🎉Excited to share that our MEXA paper is accepted to #EMNLP2025 Findings! 🚀MEXA is a general, training-free multimodal reasoning framework that dynamically selects and aggregates experts/skills for deep, free-form reasoning, and is flexible & extensible to new…

Shoubin Yu @shoubin621

2 months ago

2 31 70 17K 24

Download Image

0 19 49 4K 5

Jaehong Yoon @jaeh0ng_yoon

2 weeks ago

🎉 RACCooN got accepted at #EMNLP2025 Main! 🚀 Our MLLM+Video Diffusion (Video-to-Paragraph-to-Video, V2P2V) framework enables effortless video editing w/ auto-generated descriptions, multi-granular pooling & mask planning. RACCooN Achieves +9.4%p human eval & 49.7%↓ FVD,…

Jaehong Yoon @jaeh0ng_yoon

a year ago

1 34 85 18K 21

Download Image

1 22 70 6K 14

Justin Chih-Yao Chen @cyjustinchen

2 weeks ago

Excited to share that MAgICoRe has been accepted to #EMNLP2025 main! 🎉 Our work identifies 3 key challenges in LLM refinement for reasoning: 1) Over-correction on easy problems 2) Fail to localize and fix its own errors 3) Too few refinement iterations for harder problems…

Justin Chih-Yao Chen @cyjustinchen

12 months ago

3 77 246 47K 200

Download Image

0 36 98 7K 25

Ziyang Wang @ZiyangW00

2 weeks ago

🎉Our Video-RTS paper has been accepted at #EMNLP2025 Main!! We propose a novel video reasoning approach that combines data-efficient reinforcement learning (GRPO) with video-adaptive test-time scaling, improving reasoning performance while maintaining efficiency on multiple…

Ziyang Wang @ZiyangW00

2 months ago

1 31 42 16K 8

Download Image

1 26 39 3K 2

Jaemin Cho @jmin__cho

2 weeks ago

📢 Introducing RotBench, which tests whether SoTA MLLMs (e.g., GPT-5, GPT-4o, o3, Gemini-2.5-pro) can identify the rotation of input images (0°, 90°, 180°, and 270°). Even frontier MLLMs struggle at this spatial reasoning task that humans solve with >98% Acc. ➡️ Models struggle…

2 37 85 10K 11

Download Image

AK @_akhaliq

4 weeks ago

Bifrost-1 Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents

5 35 184 31K 85

Download Image

Han Lin @hanlin_hl

4 weeks ago

🤔 Can we bridge MLLMs and diffusion models more natively and efficiently, by having MLLMs produce patch-level CLIP latents already aligned with their visual encoders, while fully preserving MLLM's visual reasoning capabilities? Introducing Bifrost-1: 🌈 > High-Fidelity…

2 49 133 14K 57

Download Image

Jaehong Yoon @jaeh0ng_yoon

a month ago

🚀 I'm recruiting PhD students to join my lab (jaehong31.github.io) at NTU Singapore (@NTUsg), starting Spring 2026. If you're passionate about doing cutting-edge and high-impact research in multimodal AI, Trustworthy AI, continual learning, or video generation/reasoning,…

Jaehong Yoon @jaeh0ng_yoon

3 months ago

30 31 229 51K 28

Download Image

9 48 241 28K 93

Duy Nguyen @duynguyen772

a month ago

🚀 We introduce GrAInS, a gradient-based attribution method for inference-time steering (of both LLMs & VLMs). ✅ Works for both LLMs (+13.2% on TruthfulQA) & VLMs (+8.1% win rate on SPA-VL). ✅ Preserves core abilities (<1% drop on MMLU/MMMU). LLMs & VLMs often fail because…

2 32 69 12K 25

Download Image

Elias Stengel-Eskin @EliasEskin

a month ago

🇦🇹 I’m on my way to #ACL2025 to help present two papers (🧵s below) ➡️ MAT-Steer (07/30 at 11am), our method for steering LLMs w/ multiple attributes (e.g. truthfulness, bias reduction, and toxicity mitigation) simultaneously. ➡️ LAQuer (07/28 at 11am), a new task/framework for…

Elias Stengel-Eskin @EliasEskin

4 months ago

92 65 453 54K 52

Download Image

2 17 62 5K 3

David Wan @meetdavidwan

2 months ago

🎉 Our paper, GenerationPrograms, which proposes a modular framework for attributable text generation, has been accepted to @COLM_conf! GenerationPrograms produces a program that executes to text, providing an auditable trace of how the text was generated and major gains on…

David Wan @meetdavidwan

3 months ago

6 43 94 16K 31

Download Image

0 25 37 3K 6

Jaemin Cho @jmin__cho

a month ago

🥳 Gap year update: I'll be joining @allen_ai/@UW for 1 year (Sep2025-Jul2026 -> @JHUCompSci) & looking forward to working with amazing folks there, incl. @RanjayKrishna, @HannaHajishirzi, Ali Farhadi. 🚨 I’ll also be recruiting PhD students for my group at @JHUCompSci for Fall…

Jaemin Cho @jmin__cho

4 months ago

65 48 429 74K 36

Download Image

14 25 224 29K 29

Vaidehi Patil @vaidehi_patil_

2 months ago

The MUGen workshop at #ICML2025 is happening now! Stop by for talks on adversarial ML, unlearning as rational belief revision, failure modes in unlearning, robust LLM unlearning, and the bright vs. dark side of forgetting in generative AI!

Vaidehi Patil @vaidehi_patil_

5 months ago

2 19 84 20K 22

Download Image

1 9 29 3K 1

Sedrick Keh @sedrickkeh2

2 months ago

📢📢📢 Releasing OpenThinker3-1.5B, the top-performing SFT-only model at the 1B scale! 🚀 OpenThinker3-1.5B is a smaller version of our previous 7B model, trained on the same OpenThoughts3-1.2M dataset.

1 32 118 12K 31

Download Image

Peter Hase @peterbhase

2 months ago

Overdue job update -- I am now: - A Visiting Scientist at @schmidtsciences, supporting AI safety and interpretability - A Visiting Researcher at the Stanford NLP Group, working with @ChrisGPotts I am so grateful I get to keep working in this fascinating and essential area, and…

15 22 174 16K 20

Archiki Prasad @ArchikiPrasad

2 months ago

I’ll be at #ICML2025 this week to present ScPO: 📌 Wednesday, July 16th, 11:00 AM-1:30 PM 📍East Exhibition Hall A-B, E-2404 Stop by or reach out to chat about improving reasoning in LLMs, self-training, or just tips about being on the job market next cycle! 😃

Jason Weston @jaseweston

10 months ago

1 106 442 105K 335

Download Image

0 18 95 8K 19

Han Wang @HanWang98

2 months ago

🥳 Excited to share our work -- Retrieval-Augmented Generation with Conflicting Evidence -- on addressing conflict in RAG due to ambiguity, misinformation, and noisy/irrelevant evidence has been accepted to @COLM_conf #COLM2025! Our new benchmark RAMDocs proves challenging for…

Han Wang @HanWang98

5 months ago

2 36 69 28K 19

Download Image

0 22 37 3K 3

Ziyang Wang @ZiyangW00

2 months ago

🚨Introducing Video-RTS: Resource-Efficient RL for Video Reasoning with Adaptive Video TTS! While RL-based video reasoning with LLMs has advanced, the reliance on large-scale SFT with extensive video data and long CoT annotations remains a major bottleneck. Video-RTS tackles…