Incredibly proud of my Applied ML team at Scale AI 🏭
We built a multi-agent autorater that catches 82% of PhD-level reasoning/math/science errors vs. 23% for the most advanced LLMs. When the multi-LLM agents collaborated with expert humans, we improved correctness by 87%.
What…
Incredibly proud of my Applied ML team at Scale AI 🏭
We built a multi-agent autorater that catches 82% of PhD-level reasoning/math/science errors vs. 23% for the most advanced LLMs. When the multi-LLM agents collaborated with expert humans, we improved correctness by 87%.
What…
This is the most important detail from the announcement... shift from retrieval to synthesis. The real unlock isn't finding existing answers, but generating novel connections *across* documents.
This is the most important detail from the announcement... shift from retrieval to synthesis. The real unlock isn't finding existing answers, but generating novel connections *across* documents.
That 'big stuff coming soon!' announcement? Classic market freeze play. While everyone hits pause waiting for the magic, the real move is going all-in on what you actually control: your data, your workflows, your reliability game. Good strategy doesn't wait for permission.
That 'big stuff coming soon!' announcement? Classic market freeze play. While everyone hits pause waiting for the magic, the real move is going all-in on what you actually control: your data, your workflows, your reliability game. Good strategy doesn't wait for permission.
This is a big deal. seeing big model/data improvements actually make it to open-source... unknown how far would that continue from Meta, but will enjoy it while we can 😅
This is a big deal. seeing big model/data improvements actually make it to open-source... unknown how far would that continue from Meta, but will enjoy it while we can 😅
The new critical skill in AI isn't just spotting breakthroughs, but practicing 'dual-diligence'.
1. Technical Diligence (The Reddit Test): Can the claims be scaled? (e.g., skepticism around ASI-Arch's 20M -> 30B param jump).
2. Narrative Diligence (The Twitter Test): Why is it…
13 Followers 129 FollowingProduct Strategy @ Google | CBS MBA | Ex-McKinsey & PE (Blackstone LatAm) | Interested in LLM alignment, RLHF, and applied AI for ads measurement
14K Followers 15K FollowingAustin Powered. Co-founder of OpenStack & OpenInfra Foundation. General Manager of AI & Infrastructure for the Linux Foundation. open source for fun & profit.
4K Followers 2K FollowingResearch Scientist at @Meta Fundamental AI Research (FAIR), New York. Previously: Postdoc @Caltech, PhD @PrincetonCS, Undergrad @Tsinghua_Uni.
2K Followers 5K FollowingResearch Associate at ITRE (NC State). PhD from @UNC @DCRPCarolina. Works on AI/ML for transportation and energy planning. Board member @okfn_np.
59 Followers 100 FollowingMachine Learning Researcher @palantirtech. Solving the world's hardest problems, one epoch at a time. Posts are personal & do not reflect the views of Palantir.
19K Followers 1K Following@OpenAI Language agents (ReAct, Reflexion, Tree of Thoughts, SWE-agent, CoALA) for digital automation (WebShop, SWE-bench, tau-bench)
19K Followers 1K FollowingAgents @Meta MSL TBD Lab. previously posttraining research @OpenAI train LLMs to do things: deep research, chatgpt agent, etc. CS PhD @LTIatCMU
565K Followers 513 FollowingFounder of the world’s most read daily AI newsletter @therundownai. Sharing the latest developments in the world of artificial intelligence.
50K Followers 3K FollowingAI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
38K Followers 992 FollowingCreator of bitsandbytes.Research Scientist @allen_ai and incoming professor @CarnegieMellon. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.
42K Followers 109 Following• Center for AI Safety Director
• xAI and Scale AI advisor
• GELU/MMLU/MATH/HLE
• PhD in AI
• Analyzing AI models, companies, policies, and geopolitics
50K Followers 403 Following@AnthropicAI. Prev. @Google Brain/DeepMind, founding team @OpenAI. Computer scientist; inventor of the VAE, Adam optimizer, and other methods. ML PhD.
4.3M Followers 3 FollowingOpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6Lg202
106K Followers 2K FollowingCovering the latest in AI development • ML Eng since 2017 • Building @AlphaSignalAI into the #1 source of news for AI devs → At 250k readers.
45K Followers 64 FollowingStudent of mind and nature, libertarian, chess player, cancer survivor. @ Keen, UAlberta, Amii, https://t.co/u8za2Kod54, The Royal Society, Turing Award
121K Followers 639 FollowingMila Scientific Director. Ex @Google DeepMind & Twitter Cortex. Father of 4. // Directeur scientifique à Mila. Ex @Google DeepMind & Twitter Cortex. Père de 4.
63K Followers 2K FollowingResearch Scientist at Google DeepMind (WaveNet, Imagen, Veo). I tweet about deep learning (research + software), music, generative models (personal account).