We're collaborating with six commercial agents to include them in the next leaderboard update. Meanwhile, GPT-5 Pro, ChatGPT Agent, Gemini 2.5 Pro Deep Think, and Claude Opus 4.1 are all on the way.
And now, a future prediction challenge: Will @grok remain the best? Let’s see……
We're collaborating with six commercial agents to include them in the next leaderboard update. Meanwhile, GPT-5 Pro, ChatGPT Agent, Gemini 2.5 Pro Deep Think, and Claude Opus 4.1 are all on the way.
And now, a future prediction challenge: Will @grok remain the best? Let’s see……
Thank you so much!
YES, we find Grok 4 super cool, and it is really good at harder tiers (Deep Search and Super Agent). Btw, our case study (sec 4.5.1) shows that Grok 4’s search capability is much better!
Thank you so much!
YES, we find Grok 4 super cool, and it is really good at harder tiers (Deep Search and Super Agent). Btw, our case study (sec 4.5.1) shows that Grok 4’s search capability is much better!
Probably the 1ST data-centric LLM copilot, integrating various data-centric tools, supporting complex data processing and modeling of clinical problems🔥🔥
- Paper: Summarizes various data-related issues & corresponding tools, along with detailed case studies in the medical field
Probably the 1ST data-centric LLM copilot, integrating various data-centric tools, supporting complex data processing and modeling of clinical problems🔥🔥
- Paper: Summarizes various data-related issues & corresponding tools, along with detailed case studies in the medical field
364 Followers 200 FollowingHelping 1B ppl learn how to integrate AI into their life & business to save time and make money. Real tools, automations, & insights🚀
205 Followers 648 Followinghttps://t.co/9WWVqWhNXy Maintainer of ModelScope Paper Community and CompassHub, Contributor of HuggingFace, OpenCompass, OpenMMLab and InternLM
43K Followers 3K FollowingWe're in a race. It's not USA vs China but humans and AGIs vs ape power centralization.
@deepseek_ai stan #1, 2023–Deep Time
«C’est la guerre.» ®1
163K Followers 326 FollowingCEO of @abacusai, the world’s first AI super assistant and general-purpose agent, DeepAgent, for enterprises and professionals. ex-GM, AWS and Google
374 Followers 431 FollowingCofounder @ 𝚂𝙾𝙲𝙰𝙸𝙱𝙻𝙴𝚂 → now building Blueprint Hub (Radar)
Think “AI babysitter” for your CRM: spots leaks, installs fixes, no delays.
2K Followers 7K FollowingLLM Arch Assoc Director @Accenture Ph.D. @LTIatCMU. Past @GoogleAI Sharing insights about AI research, LLMs, multimodal AI, coding & tech. 🚀 Views are my own
8K Followers 198 FollowingAssistant Prof at Stanford CS, member of @stanfordnlp and statsml groups; Formerly at Microsoft / postdoc at Stanford CS / Stats.
454 Followers 656 FollowingMy actual focus : practical agentic workflows as in "useful for business"
Some older ones : strategy, neuro, geopol, MLAI, intelligence, etc.
3K Followers 2K FollowingBuilding personal superintelligence @OPPO, previously @AIWaves_inc. Former CS PhD student at ETHZ. Former researcher at ByteDance, Intern at MSRA and PYI at AI2
5K Followers 828 FollowingPostdoc @LTIatCMU. PhD from Ohio State @osunlp. Author of MMMU, MAmmoTH. Training & evaluating foundation models. Opinions are my own.