Shijie Chen @ShijieChen98
PhD student @osunlp Ohio, USA Joined April 2018-
Tweets69
-
Followers217
-
Following217
-
Likes34
Computer Use: Modern Moravec's Paradox A new blog post arguing why computer-use agents may be the biggest opportunity and challenge for AGI. tinyurl.com/computer-use-a… Table of Contents > Moravec’s Paradox > Moravec's Paradox in 2025 > Computer use may be the biggest opportunity…
Remember “Son of Anton” from the Silicon Valley show(@SiliconHBO)? The experimental AI that “efficiently” orders 4,000 lbs of meat while looking for a cheap burger and “fixes” a bug by deleting all the code? It’s starting to look a lot like reality. Even 18 months ago, my own…
Remember “Son of Anton” from the Silicon Valley show(@SiliconHBO)? The experimental AI that “efficiently” orders 4,000 lbs of meat while looking for a cheap burger and “fixes” a bug by deleting all the code? It’s starting to look a lot like reality. Even 18 months ago, my own… https://t.co/XsrYkqEIw0
🔎Agentic search like Deep Research is fundamentally changing web search, but it also brings an evaluation crisis⚠️ Introducing Mind2Web 2: Evaluating Agentic Search with Agents-as-a-Judge - 130 tasks (each requiring avg. 100+ webpages) from 1,000+ hours of expert labor -…
📢 Introducing AutoSDT, a fully automatic pipeline that collects data-driven scientific coding tasks at scale! We use AutoSDT to collect AutoSDT-5K, enabling open co-scientist models that rival GPT-4o on ScienceAgentBench! Thread below ⬇️ (1/n)
📈 Scaling may be hitting a wall in the digital world, but it's only beginning in the biological world! We trained a foundation model on 214M images of ~1M species (50% of named species on Earth 🐨🐠🌻🦠) and found emergent properties capturing hidden regularities in nature. 🧵
🔬 Introducing ChemMCP, the first MCP-compatible toolkit for empowering AI models with advanced chemistry capabilities! In recent years, we’ve seen rising interest in tool-using AI agents across domains. Particularly in scientific domains like chemistry, LLMs alone still fall…
Checkout InsightAgent (ACL'25 main), our latest work on accelerating systematic reviews from taking months to just hours with interactive AI agents! While full automation is handy, human expertise is still a must in many high-stake domains. Different from the regular…
Checkout InsightAgent (ACL'25 main), our latest work on accelerating systematic reviews from taking months to just hours with interactive AI agents! While full automation is handy, human expertise is still a must in many high-stake domains. Different from the regular…
⁉️Can you really trust Computer-Use Agents (CUAs) to control your computer⁉️ Not yet, @AnthropicAI Opus 4 shows an alarming 48% Attack Success Rate against realistic internet injection❗️ Introducing RedTeamCUA: realistic, interactive, and controlled sandbox environments for…
🔧What if your web agent could abstract its experience into programmatic skills—and improve itself autonomously? 🌟 Introducing SkillWeaver: a framework to enable self-improvement through autonomous exploration and constructing an ever-growing library of programmatic skills. 🧠…
LLMs exhibit the Reversal Curse, a basic generalization failure where they struggle to learn reversible factual associations (e.g., "A is B" -> "B is A"). But why? Our new work uncovers that it's a symptom of the long-standing binding problem in AI, and shows that a model design…
🚀 Excited to co-organize the Workshop on Computer Use Agents (CUA) at #ICML2025 in Vancouver! This workshop takes a comprehensive look at computer use agents—covering learning algorithms, orchestration, interfaces, safety, benchmarking, applications, and more. We’re also…
🚀 Excited to co-organize the Workshop on Computer Use Agents (CUA) at #ICML2025 in Vancouver! This workshop takes a comprehensive look at computer use agents—covering learning algorithms, orchestration, interfaces, safety, benchmarking, applications, and more. We’re also…
🔥2025 is the year of agents, but are we there yet?🤔 🤯 "An Illusion of Progress? Assessing the Current State of Web Agents" –– our new study shows that frontier web agents may be far less competent (up to 59%) than previously reported! Why were benchmark numbers inflated? -…
Introducing ✨HippoRAG 2 ✨ 📣 📣 “From RAG to Memory: Non-Parametric Continual Learning for Large Language Models” HippoRAG 2 is a memory framework for LLMs that elevates our brain-inspired HippoRAG system to new levels of performance and robustness. 🔓 Unlocks Memory…
What's actually different between CLIP and DINOv2? CLIP knows what "Brazil" looks like: Rio's skyline, sidewalk patterns, and soccer jerseys. We mapped 24,576 visual features in vision models using sparse autoencoders, revealing surprising differences in what they understand.
🚀Our ScienceAgentBench is covered by @Nature News! With the help of @ShijieChen98 and @YifeiLiPKU, we sampled 20 tasks from ScienceAgentBench to conduct a head-to-head comparison of OpenAI o1 (2024-12-17) and DeepSeek R1. 🔹Performance: Given three attempts, R1 can solve 7 out…
🚀Our ScienceAgentBench is covered by @Nature News! With the help of @ShijieChen98 and @YifeiLiPKU, we sampled 20 tasks from ScienceAgentBench to conduct a head-to-head comparison of OpenAI o1 (2024-12-17) and DeepSeek R1. 🔹Performance: Given three attempts, R1 can solve 7 out…
🎉ScienceAgentBench is accepted at #ICLR2025! 🚀 Ready to step beyond ML R&D? Test your agents on real-world, data-driven R&D tasks across diverse scientific disciplines. 🔬 👇 Resources and previous posts below:
🎉ScienceAgentBench is accepted at #ICLR2025! 🚀 Ready to step beyond ML R&D? Test your agents on real-world, data-driven R&D tasks across diverse scientific disciplines. 🔬 👇 Resources and previous posts below:
Thrilled to announce that our work, In-context Re-ranking, is accepted to #ICLR2025! TL;DR: By simply aggregating attention weights, we turn LLMs into powerful and efficient re-rankers generating a single token. More details below 👇:
Thrilled to announce that our work, In-context Re-ranking, is accepted to #ICLR2025! TL;DR: By simply aggregating attention weights, we turn LLMs into powerful and efficient re-rankers generating a single token. More details below 👇:
🚀ScienceAgentBench evaluation is now containerized! Inspired by SWE-Bench, we leverage Docker for task isolation, enabling multi-threaded execution and slashing evaluation time to under 30 minutes. Plus, evaluate your agents with just one bash command! Great work done by…
With recent advancements like Claude 3.5 Computer Use and Gemini 2.0, the field of GUI Agents is rapidly evolving. 🚀 Excited to introduce GUI Agent Paper List, your go-to repo for the latest in GUI Agent research! 🌟 ✨ Key Features: - 170+ Papers grouped by environments,…
✈️Flying to #NeurIPS2024 tmr! Excited to reconnect with old friends and meet new ones. I co-authored 6 papers at NeurIPS👇. I'm on the faculty job market this year. My work focuses on advancing the reasoning abilities of LLMs across modalities and contexts. Ping me for a chat☕

Kenan Jiang @JiangKenan
49 Followers 304 Following CS PhD student @ 🦅Emory | Undergrad CS+Math @ 🐻Berkeley | Multi-agent LLM; Decision-Making & Planning
MarketPulse @m02apa1nb179229
0 Followers 345 Following 止损是生存,止盈是享受。没 有纪律,就别谈盈利。 👉 Join Telegram: https://t.co/Hbyjbq6QS5 👉 Join WhatsApp:https://t.co/CchhFQpCqp
Eric Villegas @evillegas90
203 Followers 3K Following
Zhehao Zhang @Zhehao_Zhang123
392 Followers 678 Following First-year PhD @osunlp; Prev. Visiting Research Intern @SALT_NLP Research Intern @amazon @adobe @MSFTResearch; NLP&ML #NLProc
wawawa @bianzoubianshuo
8 Followers 511 Following
StBash @StBashSA
362 Followers 1K Following 'Pressure, Vol.1' https://t.co/RbYAnym3XM… All around nice guy.
Sacramento King @SSacrament94313
71 Followers 2K Following
Wenyue Hua @HuaWenyue31539
1K Followers 637 Following senior researcher @ Microsoft Research, AI Frontiers Postdoc @ucsbNLP Ph.D. @RutgersCS KAUST AI Rising Star LLM-based agent, LLM reasoning
Wauqar @Wauqar7543766
28 Followers 1K Following
The 69 Controversies ... @69AIControversy
262 Followers 8K Following The 69 Controversies of AI Adoption | Spreading the Word on AI Adoption | From the author of The Last AI @The_Last_AI @s_m_sohn |5/25/25| https://t.co/eMyARc66RG
Patrick Drake @time8machine
17K Followers 6K Following Neurodivergent physics student with a keen interest in multisensory integration and emergent perception. Exploring research on a proposed ‘sixth sense’. Δ
henry castillo @henrycstllo
78 Followers 613 Following phd @tamu. prev: swe @stripe, bs @utaustin. i want to mechanistically understand models through the lens of training dynamics. 🇵🇪🏳️🌈
huifuhha @alsdnlbsc
0 Followers 244 Following
keesha @KeeshaBrown96
647 Followers 7K Following The wind is free to come and go, and we will meet when we are supposed to meet. If you decide to be brilliant, there is no mountain to block you, and no sea to
russell @russell1101103
69 Followers 4K Following
Ryan Finley @rnfinley
292 Followers 4K Following
Make Money Online @larrykearney1
12K Followers 4K Following Tips And Advice On How To Make Money Online! Replace Your Full Time Income..
Memss @SuperYzet
14 Followers 256 Following
Simrnjeet @simrnjeet5
55 Followers 2K Following ▫️Software Development Instructor ▫️Full Stack Developer
Ahmed El-farra @ahmedelfarraaa
32 Followers 1K Following Software Engineer | MS @ UIUC | Ex SWE - EA Sports Interests: LLMs, Distributed systems, HPC
Cuong Dang @ ACL24 @CaptainCuong
141 Followers 2K Following x-Research Resident @fpt_software, VietNam. Incoming CE PhD Student @virginia_tech. Working on Machine Learning, Fair AI, Explainable AI, Robust AI, NLP
杨洋 @shyangsh
27 Followers 534 Following
King Hong Chuang @KingHongChuang
34 Followers 2K Following
Therlosh @TherloshrXYO0
53 Followers 2K Following
Zheda Mai @Zheda_Mai
1 Followers 88 Following
Negoreslores @NegoresloreskX
26 Followers 1K Following
Catherine Li 🍵 @daikonland
395 Followers 1K Following Synthetic rare cats @AdvexAI ✨| ex-Waymo, ex-Twitter | Cheese, art, memes, and machine learning ✨✨| Views are my own | Random posting
Kevin Marquardt @kevin_marq56168
20 Followers 935 Following
Weijian Qi @weijian_qi
24 Followers 130 Following 2nd year master @osunlp used to train at @HandWavyLab
0xWulf @hexawulf
322 Followers 3K Following Sharing AI insights & study strategies 📚 | Computer Science @ IU International University 🐺 “Wulf” is my real name | Based in Taipei | 📧 [email protected]
Dewther @DewtherwSp
20 Followers 530 Following
xiaoboliang @xiaobolian66449
4 Followers 300 Following
Tanettech @ElvisTanghang
24 Followers 978 Following
Yougang Lyu @yougang_lyu
94 Followers 387 Following PhD student @irlab_amsterdam & @UvA_Amsterdam | Intern @Baidu_Inc | Working on language agents and alignment
Steven (Shaobo) Wang ... @ShaoboWang6
386 Followers 1K Following Ph.D Candidate @sjtu1896, Intern @yaledatascience and @Alibaba_Qwen. Exploring Data-Centric AI on Foundation Models.
Oghenetega Godwin @itztycon
388 Followers 8K Following A backend engineer• Software intern and Product marketer @dextroux (https://t.co/5daIyamICt)• Founder of @issocorp. Computer Science Student @uniben
Weijian Qi @weijian_qi
24 Followers 130 Following 2nd year master @osunlp used to train at @HandWavyLab
Zhehao Zhang @Zhehao_Zhang123
392 Followers 678 Following First-year PhD @osunlp; Prev. Visiting Research Intern @SALT_NLP Research Intern @amazon @adobe @MSFTResearch; NLP&ML #NLProc
机器之心 JIQIZHIX... @jiqizhixin
9K Followers 709 Following China's leading media & information provider for #AI & #MachineLearning
OpenAI Developers @OpenAIDevs
222K Followers 1 Following Updates for developers building with the OpenAI Platform and API • Service status: https://t.co/kZwnwdYqOS • Support: https://t.co/qCi6M5ESZU
Zhiting Hu @ZhitingHu
5K Followers 435 Following Assist. Prof. at UC San Diego; Artificial Intelligence, Machine Learning, Natural Language Processing
Stanford AI Lab @StanfordAILab
211K Followers 332 Following The Stanford Artificial Intelligence Laboratory (SAIL), a leading #AI lab since 1963. ⛵️🤖 Emmy-winning video: https://t.co/lV9smZTC1m
Zhuang Liu @liuzhuang1234
11K Followers 1K Following Assistant Professor @PrincetonCS. researcher in deep learning, vision, models. previously @MetaAI, @UCBerkeley, @Tsinghua_Uni
Barsee 🐶 @heyBarsee
275K Followers 862 Following Daily tweets on the latest AI and Tech developments to stay ahead of the curve | Founder of https://t.co/bpf7Dytcqj
Zhengyao Jiang @zhengyaojiang
4K Followers 417 Following Cofounder & CEO @WecoAI. Automating hill climbing with AI-Driven Exploration (AIDE). PhD in Machine Learning @UCL_DARK. (Zheng=j-uhng, j as in job; yao=y-aoww)
Jiao Sun @sunjiao123sun_
12K Followers 572 Following Senior Research Scientist at Google DeepMind \n\n NLP PhD @ USC, Amazon ML Fellow \n\n ex-{Google Brain, Alexa AI} nlper, IIIS Tsinghua-Ren
DeepSeek @deepseek_ai
973K Followers 0 Following Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
Omar Khattab @lateinteraction
24K Followers 3K Following Asst professor @MIT EECS & CSAIL (@nlp_mit). Author of https://t.co/VgyLxl0oa1 and https://t.co/ZZaSzaRaZ7 (@DSPyOSS). Prev: CS PhD @StanfordNLP. Research @Databricks.
Eddie Vendrow @EdwardVendrow
392 Followers 424 Following PhD Student at @MIT_CSAIL making science happen faster. Previously at @StanfordSVL @GoogleAI @nvidia
Xinliang (Frederick) ... @FrederickXZhang
324 Followers 228 Following CS/AI PhD @UMichCSE (@launchnlp/@michigan_AI). Research in #NLProc. Intern @GoogleDeepMind, ex @Adobe & @Bloomberg. UG @OhioStateCSE (@osunlp). From @Hangzhou
Caiming Xiong @CaimingXiong
7K Followers 465 Following SVP, AI Research Lead at @Salesforce | ex-MetaMind (Opinions are personal.)
Huan Wang @huan__wang
2K Followers 2K Following Director @ Salesforce Research. Research Interest: Large Language Model, Action Agent, Reinforcement Learning, Time Series Analytics, Learning Theory.
Andrew White 🐦�... @andrewwhite01
27K Followers 2K Following Head of Sci/cofounder @FutureHouseSF. Prof of chem eng @UofR (on sabbatical). Automating science with AI and robots in biology. Corvid enthusiast
Alex Cheema - e/acc @alexocheema
37K Followers 2K Following Building @exolabs | prev @UniOfOxford We're hiring: https://t.co/UlkApFndnH
Kai-Wei Chang @kaiwei_chang
8K Followers 713 Following Associate Professor @UCLAengineering/@UCLA. Area: #NLProc/#ML/#AI https://t.co/zj1ssZj9ox
Manling Li @ManlingLi_
8K Followers 735 Following Assistant Professor@NU, Amazon Scholar, Postdoc@Stanford, PhD@UIUC #NLP #CV Language+Vision/EmbodiedAI, Reasoning, Planning, Compositionality, Trustworthiness
@emilymbender.bsky.so... @emilymbender
57K Followers 2K Following Prof, Linguistics, UW // Faculty Director, CLMS // she/her // @[email protected] & bsky // rep by @ianbonaparte
The AI Timeline @TheAITimeline
24K Followers 1 Following covering the latest AI & LLM research /// see "highlights" for all previous weekly threads /// building the best AI paper search engine @findmypapersai
Quanquan Gu @QuanquanGu
16K Followers 2K Following Professor @UCLA, Pretraining and Scaling at ByteDance Seed | Recent work: Build AGI | Opinions are my own
Yizhong Wang @yizhongwyz
6K Followers 1K Following Incoming assistant professor @UTCompSci, RS @BytedanceTalk, PhD from @uwcse, formerly @allen_ai @AIatMeta @MSFTResearch
Xin Eric Wang @xwang_lk
18K Followers 1K Following Professor @ UCSB (@ucsantabarbara). Head of Research @SimularAI. Interim Director @ucsbcrml. #Multimodal #Embodied #Agents. AI for Humanity in the long run.
Siru Ouyang @Siru_Ouyang
906 Followers 879 Following CS PhD candidate @IllinoisCDS. Alumni @sjtu1896.
Scott Yih @scottyih
2K Followers 844 Following Research Scientist at Meta Fundamental AI Research (FAIR)
Shunyu Yao @ShunyuYao12
19K Followers 1K Following @OpenAI Language agents (ReAct, Reflexion, Tree of Thoughts, SWE-agent, CoALA) for digital automation (WebShop, SWE-bench, tau-bench)
Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
Mimansa Jaiswal @MimansaJ
4K Followers 5K Following Currently RS @aiatmeta | LLMs/SLMs | Data, Evals and Agentic System Orchestration
CLS @ChengleiSi
5K Followers 3K Following PhDing @stanfordnlp | teaching language models to do research | real AGI is the friends we made along the way
Zitong Lu 路子童 @ZitongLu
767 Followers 379 Following Vision & NeuroAI; Postdoc w/ @Nancy_Kanwisher & @ev_fedorenko; PhD w/ @juliedgolomb; Author of NeuroRA, EEG2EEG & ReAlnet; 公众号:路同学
Nathan Lambert @natolambert
56K Followers 853 Following Figuring out AI @allen_ai, open models, RLHF, fine-tuning, etc Contact via email. Writes @interconnectsai Wrote The RLHF Book Mountain runner
Carlos E. Perez @IntuitMachine
39K Followers 5K Following Quaternion Process Theory, Artificial (Intuition, Fluency, Empathy), Patterns for (Generative, Reason, Agentic) AI, https://t.co/fhXw0zjxXp
Philipp Schmid @_philschmid
45K Followers 1K Following AI Developer Experience @GoogleDeepMind | prev: Tech Lead at @huggingface, AWS ML Hero 🤗 Sharing my own views and AI News 🧑🏻💻 https://t.co/7IosdlNz22
Bill Yuchen Lin @billyuchenlin
23K Followers 3K Following Building Grok @xAI. Affiliate Assistant Prof @UW; Focusing on Grok Code for Macrohard now. Ex: @allen_ai, Google AI, Meta FAIR.
Ruocheng Guo @rguo_asu
1K Followers 1K Following Staff Research Scientist @Intuit Ph.D. from DMML @SCAI_ASU LLM Agents Causal ML Ex-TikTok, CityUHK, MSR, Google X