Reinforcement learning can lead to behaviors that don’t generalize.
In our #NeurIPS2020 paper, we introduce a simple idea to allow extrapolation to new envs w/ a few trials:
One Solution is Not All You Need
arxiv.org/abs/2010.14484
w. Saurabh Kumar, Aviral Kumar, @svlevine
(1/3)
712 Followers 815 Following@PrincetonCS postdoc w/ Tom Griffiths @cocosci_lab |@StanfordAILab PhD w/ Benjamin Van Roy |@BrownCSDept BS+MS w/ Michael Littman @mlittmancs | RL & Info Theory
16K Followers 349 FollowingCSO & co-founder, Reliant AI. Ex RL research lead at Google Brain, DeepMind. Known for Atari 2600 RL benchmark, Distributional RL (MIT Press 2023).
163K Followers 166 FollowingCo-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log
1.2M Followers 279 FollowingWe’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
712 Followers 815 Following@PrincetonCS postdoc w/ Tom Griffiths @cocosci_lab |@StanfordAILab PhD w/ Benjamin Van Roy |@BrownCSDept BS+MS w/ Michael Littman @mlittmancs | RL & Info Theory
2K Followers 151 FollowingDeveloping algorithms for real-time reinforcement learning on robots. Research Scientist at Keen, a startup led by John Carmack.
Prev ~ PhD with Richard Sutton
1.4M Followers 1K FollowingBuilding @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
6K Followers 365 FollowingSafety and alignment at Meta Superintelligence. Prev: VP of Research at Scale AI, research at Google DeepMind / Brain (Gemini, LaMDA, RL / TFAgents, AlphaChip).
3K Followers 31 FollowingResearch Scientist @ Google DeepMind. Formerly Robotics, now AI Safety. Has a blog. Views are my own. "Adversarially disengaging Twitter profile"
4K Followers 405 FollowingExecutive VP and Chief Scientist, LG AI Research; Professor of CSE, U. Michigan, Ann Arbor; Ex-Google Brain; Sloan Research Fellow.
11K Followers 685 FollowingML theory nerd & AI non-enthusiast. thinking a lot about online learning these days!
BTW you should go find me on another website where i post more actively
16K Followers 495 FollowingHarvard Professor.
Full stack ML and AI.
Co-director of the Kempner Institute for the Study of Artificial and Natural Intelligence.
37K Followers 565 FollowingAssistant professor at Stanford; Co-founder of Voyage AI (https://t.co/wpIITHLgF0) ;
Working on ML, DL, RL, LLMs, and their theory.
11K Followers 723 Following"If there is not folly in the world, then the world itself is folly. You must understand that mistakes are not always regrets." - Paul Tobin, Bandette🤠
16K Followers 349 FollowingCSO & co-founder, Reliant AI. Ex RL research lead at Google Brain, DeepMind. Known for Atari 2600 RL benchmark, Distributional RL (MIT Press 2023).
49K Followers 9K FollowingI lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.