Excited to share ReasonRank—a listwise reranker that reasons before it ranks.
On complex queries, ReasonRank-32B is #1 on the BRIGHT reasoning-intensive leaderboard (Aug 9, 2025; +.6 vs. strong baselines).
Working with Wenhan, @sunweiwei12, etc.
Most AI agents are tested in a bubble. But real ML breakthroughs happen in communities.
We introduce CoMind, an research agent that learns from community knowledge.
📊 CoMind outperforms ~70% human teams in a CVPR 2025 workshop competition.
🧵👇
Can LLM solve PDEs? 🤯
We present CodePDE, a framework that uses LLMs to automatically generate solvers for PDE and outperforms human implementation! 🚀
CodePDE demonstrates the power of inference-time algorithms and scaling for PDE solving. More in 🧵:
#ML4PDE#AI4Science
Thrilled to introduce FrontierCO — a benchmark of real, challenging (some unsolved) combinatorial optimization problems. It’s built to push frontier AI beyond toy tasks and toward advancing the boundaries of human problem-solving!
Paper: arxiv.org/abs/2505.16952
Thrilled to introduce FrontierCO — a benchmark of real, challenging (some unsolved) combinatorial optimization problems. It’s built to push frontier AI beyond toy tasks and toward advancing the boundaries of human problem-solving!
Paper: arxiv.org/abs/2505.16952
🌟Get rid of the evaluation on synthetic toy problems and advance human intelligence like #AlphaEvolve!
🚀 Introducing FrontierCO — our new Machine Learning for Combinatorial Optimization benchmark featuring high-quality NP-hard instances from real-world applications and…
We invite you to explore our latest work on RAG.
We conceptualize RAG as a multi-agent collaboration task, aiming to align and unify the optimization objectives of its various modules with the ultimate goal of generating high-quality answers.
We invite you to explore our latest work on RAG.
We conceptualize RAG as a multi-agent collaboration task, aiming to align and unify the optimization objectives of its various modules with the ultimate goal of generating high-quality answers.
This new approach for RAG modeling and optimization, MMOA-RAG, treats RAG as a multi-agent collaboration task.
It uses MARL to simultaneously optimize multiple modules, aligning their objectives with the final goal of generating high-quality responses.
This new approach for RAG modeling and optimization, MMOA-RAG, treats RAG as a multi-agent collaboration task.
It uses MARL to simultaneously optimize multiple modules, aligning their objectives with the final goal of generating high-quality responses.
Improving RAG through Multi-Agent RL
This work treats RAG as a multi-agent cooperative task to improve answer generation quality.
It models RAG components like query rewriting, document selection, and answer generation as reinforcement learning agents working together toward…
I will be presenting our paper “MAIR: A Massive Benchmark for Evaluating Instructed Retrieval” at #EMNLP2024!
Date: Tuesday, Nov 12
Time: 14:00-15:30
Session 03: Resources and Evaluation 1
Paper: arxiv.org/abs/2410.10127
See you there!
I will be presenting our paper “MAIR: A Massive Benchmark for Evaluating Instructed Retrieval” at #EMNLP2024!
Date: Tuesday, Nov 12
Time: 14:00-15:30
Session 03: Resources and Evaluation 1
Paper: arxiv.org/abs/2410.10127
See you there!
3 Followers 5 FollowingI'm a Phd student at Gaoling School of Artificial Intelligence, Renmin University of China. My research interest includes AI search and LLM4ranking.
188 Followers 172 Following#NLProc PhD student at Seoul National University, on IR & RAG. Previously research Intern at @ Clova AI, Naver Corp, @ Exaone Lab, LG Research.
2K Followers 7K FollowingFounder, Imaginator ai
knowledge discovery 2D navigation TS ML DL recsys econ math incentives mech design finance networks bridges boundaries, Time, 3d type
15K Followers 6K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
1K Followers 1K FollowingLecturer of NLP @SheffieldNLP, previous Research Fellow @UCL, PhD @TerrierTeam Glasgow Uni. Research interests in Conversational AI, RAG and topics of NLP & IR.
48 Followers 293 FollowingPhD student at GSAI, Renmin University of China. Interested in IR, NLP, and LLM. Intern @LabTongyi96898. Previous @JinaAI_ @xiaohongshu
188 Followers 172 Following#NLProc PhD student at Seoul National University, on IR & RAG. Previously research Intern at @ Clova AI, Naver Corp, @ Exaone Lab, LG Research.
3 Followers 5 FollowingI'm a Phd student at Gaoling School of Artificial Intelligence, Renmin University of China. My research interest includes AI search and LLM4ranking.
6K Followers 1K FollowingCo-founder @allhands_ai, building OpenHands | PhD candidate @IllinoisCDS | BS @UMichCSE ('22) | Ex Intern @GoogleAI @Microsoft | Opinions are my own
15K Followers 6K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
48 Followers 293 FollowingPhD student at GSAI, Renmin University of China. Interested in IR, NLP, and LLM. Intern @LabTongyi96898. Previous @JinaAI_ @xiaohongshu
264K Followers 670 FollowingBuilding with AI agents @dair_ai • Prev: Meta AI, Galactica LLM, Elastic, PaperswithCode, PhD • I share insights on how to build with AI Agents ↓
2K Followers 3K FollowingPhD Student @Cambridge_Uni; Visiting @VectorInst; Intern @MSFTResearch
| Prev: @AWS AI Lab | Do not go gentle into that good night 🧗 | https://t.co/MOPcMcPqcc