To push the open source frontier for RL + LLMs, we need scalable, modular environments with real-world complexity, beyond math benchmarks.
Today, we’re releasing *benchmax*.
An open-source framework to build, run, & scale useful RL envs for LLM fine-tuning, with integrations to…
Sharing some cool RL results @manans99 & I just witnessed in reasoning models for science! Our research question:
🤖 LLM performance limits agentic capabilities.
🎓 Humans learn new skills by practicing them.
🔬 Can LLMs learn to reason by solving graduate textbook problems?
my favorite part of this project was showing that we have general ways of spending much more than the limit of ~$20 per problem to continue to improve scores.
it’s incredible that we live in a world where you can trade compute for more intelligence
my favorite part of this project was showing that we have general ways of spending much more than the limit of ~$20 per problem to continue to improve scores.
it’s incredible that we live in a world where you can trade compute for more intelligence
Curious about what I’ve been working on over the last two years?
Join me next Thursday for the launch of Kumo.AI’s groundbreaking declarative ML platform that will transform the world of predictive AI as we know it. 🚀 More at info.kumo.ai/revolutionizin…#AI…
Still can't believe I had the opportunity to present our work on teprotumumab-related adverse events at the European Thyroid Association's conference in Milan last month! Grateful to my mentor @kosslermd for this incredible opportunity!
Kumo team at the @databricks summit! We're proud to partner with the Databricks team to bring ML to the enterprise Lakehouse. Excited to kick off the event with this stellar crew!
We are very excited to announce our annual Graph Learning Workshop on September 28th. You can learn more about the event here: snap.stanford.edu/graphlearning-…
We’d love for you to join us! Register here to reserve your spot:
eventbrite.com/e/stanford-gra…
I write constant notes to myself which mostly feel useless in the moment but sometimes I look back through them and it’s *amazing* to have a record of exactly how you felt at one particular moment in your life. This was... 2018?
Wow, the mental health effects due to the pandemic are extreme , across nearly every way of slicing the population (gender, age, race/ethnicity, income level, education, urban/rural). That right hand column: 👀
Please check in with your family/friends & suggest help if needed!
Wow, the mental health effects due to the pandemic are extreme , across nearly every way of slicing the population (gender, age, race/ethnicity, income level, education, urban/rural). That right hand column: 👀
Please check in with your family/friends & suggest help if needed!
1/ Covid (@UCSF) Chronicles, Day 150
Today, 150 days since I began my Covid tweets, I’m going to do something odd: write the speech that Trump should give. I have no faith he’ll do so, but it’s worth recognizing how little it would take to change course & save lives. Here goes:
This is horrifying. It appears the system is biased against less affluent schools. It factored in grades of past graduates.
This is how AI can be used to reinforce social and cultural disparities
This is horrifying. It appears the system is biased against less affluent schools. It factored in grades of past graduates.
This is how AI can be used to reinforce social and cultural disparities
@11kilobytes@xuenay I’ve been somewhat puzzled by the intensity of my own annoyance with AI hype. I think I’ve just discovered in this rant that, beneath irritation with badly-motivated, exaggerated claims, there’s deep sadness that the scientific opportunities are being lost. >
Hurts foreign students.
Hurts America's universities.
Hurts freedom.
Hurts America.
Hurts the World.
"Foreign college students could be forced to leave the U.S.... according to guidance released by Immigration and Customs Enforcement (ICE)."
axios.com/online-classes…
Arrogance has fueled a lot of Silicon Valley’s response to coronavirus.
It takes humility to consider that you might be wrong. It takes even more humility to understand how being wrong can be harmful 👇
stanforddaily.com/2020/04/11/hac…
Startup idea: a clothing store for college students called “It’s a Good Fit at This Time” where you can show a rejection letter from this week and get 20% off
Paper accepted to #cikm2019 on improving image classification using contextual information! Grateful for support from @GoogleAI and @sigir2019 to attend the conference in Beijing this November.
Preprint: ai.google/research/pubs/…
5K Followers 3K FollowingMaking the world a little better with software, materials, and community. Group Leader - AI & data infrastructure at @uchicago/@argonne. Opinions are mine.
994 Followers 658 Following🧠 AI Prod Mgmt🧑💻 Prev. first PM @ KumoAI. founded @LoopinHQ. ML growth @samsung 🤖 AI explorer 📣 $0.02 on PM, growth, and startups
475 Followers 631 Following@DukeU '18 → @NIH→ @Stanford MD '25 → @OpNotes Surgery Resident
In a world where you can choose to be anything, choose to be kind.
33 Followers 2K FollowingI usually take my dog for a walk 🐶 I enjoy golf ⛳ biking 🚴♀️ scuba diving 🤿 fitness 🏃 reading 📚 I love connecting with genuine people.
9K Followers 37 FollowingTeam member at something young.
Adjunct Prof @ McGill.
Member of Mila, Quebec AI Institute.
Stream of consciousness is my own.
23K Followers 1K FollowingComputer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.
5K Followers 0 FollowingVirtual seminar series featuring the latest advances in theoretical reinforcement learning. Seminars (approximately) every Tuesday at 6pm UTC.
2K Followers 3 FollowingWE'RE HIRING! (see website) Multimodal foundation models are the future of medical AI, and medical AI is the future of healthcare.
1K Followers 312 FollowingPhD Student in AI at @Mila_Quebec, making better OLMos at @allen_ai previously @MetaAI @ServiceNowRSRCH and softeng @UWaterloo
also @mnoukhov.bsky.social
644 Followers 0 FollowingHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
50K Followers 0 Following🤖 highlighting changes to news on main page of @nytimes. By @j_e_d. Based on 💡 of @newsdiffs. 🔝 edits: https://t.co/tabi5vFptt
110K Followers 3K FollowingCPO @OpenAI, BoD @Cisco @nature_org, LTC @USArmyReserve
Prev: President @Planet, Head of Product @Instagram @Twitter
❤️ @elizabeth ultramarathons kids cats math
20K Followers 2K FollowingThis is the site where I talk about the attacks on science and immigration.
Science is on the other site.
Lab website: https://t.co/vrtbcqRyRn
4K Followers 416 FollowingOptimize cost & performance with AI platforms powered by our industry-leading SLMs: Arcee Conductor for model routing, & Arcee Orchestra for agentic workflows.