My friends @KaivuHariharan@uzpg_ are building a startup! I think they are highly capable, have good taste, and are motivated by the right reasons. Let them cook 🍳
My friends @KaivuHariharan@uzpg_ are building a startup! I think they are highly capable, have good taste, and are motivated by the right reasons. Let them cook 🍳
MATS 9.0 applications are open! Launch your career in AI alignment, governance, and security with our 12-week research program. MATS provides field-leading research mentorship, funding, Berkeley & London offices, housing, and talks/workshops with AI experts.
🚨 New report 🚨
What does the public think about **specific** AI policy proposals? We asked 300 working-class adults in CA, IL, and NY.
zenodo.org/records/165660…
A partner at a prominent law firm told me “AI is now doing work that used to be done by 1st to 3rd year associates. AI can generate a motion in an hour that might take an associate a week. And the work is better. Someone should tell the folks applying to law school right now.”
On IMO P6 (without going into too much detail about our setup), the model "knew" it didn't have a correct solution. The model knowing when it didn't know was one of the early signs of life that made us excited about the underlying research direction!
On IMO P6 (without going into too much detail about our setup), the model "knew" it didn't have a correct solution. The model knowing when it didn't know was one of the early signs of life that made us excited about the underlying research direction!
1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
xAI launched Grok 4 without any documentation of their safety testing. This is reckless and breaks with industry best practices followed by other major AI labs.
If xAI is going to be a frontier AI developer, they should act like one. 🧵
Anthropic staff realized they could ask Claude to buy things that weren’t just food & drink.
After someone randomly decided to ask it to order a tungsten cube, Claude ended up with an inventory full of (as it put it) “specialty metal items” that it ended up selling at a loss.
We are excited to announce Trinity, an autoformalization system for verified superintelligence that we have developed at @morph_labs. We have used it to automatically formalize in Lean a classical result of de Bruijn that the abc conjecture is true almost always.…
21K Followers 3K FollowingGMU econ PhD student, liberal, aspie, bi. I post interesting papers. Michael Kremer stan. I ❤️ optimal auction design. Spend more on drugs. Open borders now!
2K Followers 911 FollowingHiring: resume to [email protected]
to love math is to see the face of God
Morgan Prize, Rhodes Scholar
Math PhD@Stanford; Neuro@Oxford; Math+Physics@MIT
84 Followers 228 FollowingResearch @ MATS, CS @ Princeton, working on multi-agent safety, long-context language models, and efficient inference techniques.
110K Followers 3K FollowingCPO @OpenAI, BoD @Cisco @nature_org, LTC @USArmyReserve
Prev: President @Planet, Head of Product @Instagram @Twitter
❤️ @elizabeth ultramarathons kids cats math
10K Followers 6 FollowingBringing AI to offensive security by autonomously finding and exploiting web vulnerabilities. Watch XBOW hack things: https://t.co/D5Mco1u8zM
21K Followers 3K FollowingGMU econ PhD student, liberal, aspie, bi. I post interesting papers. Michael Kremer stan. I ❤️ optimal auction design. Spend more on drugs. Open borders now!
10K Followers 235 FollowingInterpretability/Finetuning @AnthropicAI
Previously: Staff ML Engineer @stripe, Wrote BMLPA by @OReillyMedia, Head of AI at @InsightFellows, ML @Zipcar
19K Followers 1K FollowingAgents @Meta MSL TBD Lab. previously posttraining research @OpenAI train LLMs to do things: deep research, chatgpt agent, etc. CS PhD @LTIatCMU
18K Followers 1K FollowingHanging out with Claude, improving its behavior, and building tools to support that @AnthropicAI 😁
prev: @open_phil @googlebrain @openai (@microcovid)
49K Followers 9K FollowingI lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
902 Followers 81 FollowingLead personality and model behavior research @OpenAI;
Previously built the object understanding system and foundation models for self-driving @Waymo