We figured out how to train diffusion models with RL to generate images aligned with user goals! Our RL method gets ants to play chess and dolphins to ride bikes. Reward from powerful vision-language models (i.e., RL from AI feedback): rl-diffusion.github.io
A 🧵👇
Can we do model-based RL just by treating a trajectory like a huge image, and training a diffusion model to generate trajectories? Diffuser does exactly this, guiding generative diffusion models over trajectories with Q-values!
diffusion-planning.github.io
🧵->
Can we replace RL with one big sequence model? Trajectory transformer models state/action/reward sequences one token (dimension) at a time -- this allows long-horizon prediction and offline RL w/o separate actors, critics, constraints, etc.: trajectory-transformer.github.io
A thread:
Model-Based Reinforcement Learning: Theory and Practice
bair.berkeley.edu/blog/2019/12/1…
New blog post by @michaeljanner about the taxonomy of model-based RL methods, when we should use models, and the state-of-the-art MBPO algorithm for sample-efficient RL.
When should we use a model to improve RL? We've analyzed this theoretically and empirically, with a monotonic improvement result, error accumulation study (vid below), and proposing the most efficient RL method yet, MBPO people.eecs.berkeley.edu/~janner/mbpo/
w/ @michaeljanner, J. Fu, M. Zhang
How can we use object-based physics prediction models to construct structures out of blocks? Michael Janner will be presenting our work on predictive models tomorrow (Wed) at 11 am at #ICLRyoutube.com/watch?v=CXS7dR…
Our NIPS paper on self-supervised intrinsic image (shape,reflectance,lighting) decomposition. This is work from MIT days. Someday this might become useful for robot manipulation or image editing -- papers.nips.cc/paper/7175-lea….
325K Followers 3K FollowingNVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
9K Followers 875 FollowingAssistant Professor at University of Washington. I like robots, and reinforcement learning. Previously: post-doc at MIT, PhD at Berkeley
15K Followers 4K FollowingWriting AI Agenda @theinformation, texan, & horror movie aficionado // reach me at [email protected] or on Signal at 979-599-8091
55 Followers 163 FollowingPhD Student at the Learning and Intelligent Systems Lab @TUBerlin. Working on discovering new things with machine learning and robots.
3K Followers 197 FollowingMember of Technical Staff at Microsoft AI. Former @Google @inflectionai. In a previous life, I did String Theory. Reasoning and Intelligence.
3 Followers 172 FollowingRecruiting webshell engineers to penetrate websites, with a monthly salary of up to $100,000. If interested, please contact https://t.co/YgOHl7fVJD
16 Followers 79 FollowingAI racer by day, Tech enthusiast by night. Follow me for a glimpse into the cutting edge. #TechEnthusiast #Innovator — Definitely not generated by #ChatGPT 😉
2K Followers 5K FollowingTX🌵 NY🗽 I’m back!
One's real value first lies into what degree and what sense he set himself
Speaking mechanically (self-mockingly)
A1 A2 MAGA 🛑Porn🛑Crypto
2K Followers 8K FollowingAI developer and enterprise trainer 🤖 | I keep track of the latest in GenAI, LLMs, and agentic workflows for you | Cornell alum | Open to new projects
11K Followers 2K FollowingEntrepreneur, Car Enthusiast, Meme Aficionado, Husband, Father... They Call Me SUGE | Co-Founder @TrueTradingGrp | Partner @LibreChatAI | VibeCoder Final Boss
128 Followers 58 FollowingJ’accompagne les entreprises et indépendants à profiter de l’IA : formations, audit et recommandations, création d’outils sur mesure
1.4M Followers 1K FollowingBuilding @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
9K Followers 875 FollowingAssistant Professor at University of Washington. I like robots, and reinforcement learning. Previously: post-doc at MIT, PhD at Berkeley
110K Followers 3K FollowingCPO @OpenAI, BoD @Cisco @nature_org, LTC @USArmyReserve
Prev: President @Planet, Head of Product @Instagram @Twitter
❤️ @elizabeth ultramarathons kids cats math
180K Followers 4K FollowingWriting at https://t.co/m6EtO60SiY and host of the Core Memory podcast. 2X NYT best-seller. Filmmaker @HBO (Wild, Wild Space) + @Netflix (Don't Die).
5K Followers 303 FollowingI love building things. AppliedAI/ChatGPT @openai. Formerly, eng @airbnb, founder @fabric_app. Creator of the first @facebook Timeline, Memories, See Friendship
459 Followers 368 Followingweightlifting 🏋️ & AI - GDM, previous Anthropic, previous pretraining/data research of Gemini at Google Deepmind. Only represents my personal opinions.
45K Followers 44 FollowingActive on https://t.co/WG71Nrs60M; also trying out https://t.co/fGOzbSxVHi. No longer read replies or notifications here now that tweetdeck is gated.
No recent Favorites. New Favorites will appear here.