Search results for #ReinforcementLearning
PROFšRight answer, flawed reason?š¤š šarxiv.org/pdf/2509.03403 Excited to share our work: PROF-PRocess cOnsistency Filter! š Challenge: ORM is blind to flawed logic, and PRM suffers from reward hacking. Our method harmonizes strengths of PRM & ORM. #LLM #ReinforcementLearning
From crowded labs to seamless automation š¤ Robots are learning to collaborate smarter! @SciRobotics highlights how @GoogleDeepMind is pushing the limits of #ReinforcementLearning in robotics. #AI #Robotics #SciOpen #TUP #Science
From crowded labs to seamless automation š¤ Robots are learning to collaborate smarter! @SciRobotics highlights how @GoogleDeepMind is pushing the limits of #ReinforcementLearning in robotics. #AI #Robotics #SciOpen #TUP #Science
Agentic AI is moving from sci-fi to enterprise reality. š¤ Systems that learn, adapt, and act, powered by reinforcement learning and multimodal input are reshaping how work gets done. šā ļø Buckle up š #AgenticAI #AIagents #ReinforcementLearning #Multimodal #FutureTech
The synergy between RL and generative modeling continues to unveil new insights. With innovations like GSPO and the dynamics of generative diffusion, we're on the brink of optimizing models like never before! #AI #DeepLearning #ReinforcementLearning
Reinforcement Learning is how AI masters complex gamesāby making millions of attempts, failing, and learning from its mistakes. #ReinforcementLearning #AI #DeepMind
Reinforcement Learning is how AI masters complex gamesāby making millions of attempts, failing, and learning from its mistakes. #ReinforcementLearning #AI #DeepMind
Pokemon ShowDownć§ćć±ć¢ć³AIććć«part3å ¬éćć¾ććć DQNāPPOć§ććŖćč”åć«å¤ę§ę§ćććććććććØć確čŖćć¾ććć youtu.be/wBz4sCLQwxo?si⦠#ppo #PokemonShowdown #reinforcementlearning
Biomni-R0: Revolutionizing Biomedical Research with Advanced Reinforcement Learning Models #ArtificialIntelligence #BiomedicalResearch #MachineLearning #HealthTech #ReinforcementLearning itinai.com/biomni-r0-revo⦠The Growing Role of AI in Biomedical Research Artificial intelligā¦
Day 17 of 40 š. I continued on my reinforcement learning course. Learnt about: - Temporal difference - deep Q learning intuition learning and action - experience replay. #40daysofcode #ReinforcementLearning #artifici
TheMiddleMarket: Nvidia's CoreWeave Acquires OpenPipe OpenPipe is focused on training AI agents using reinforcement learning. #Artificialintelligence #Cloudservices #reinforcementlearning themiddlemarket.com/latest-news/nv⦠#manda #mergerautomation #merger @karl_popp ift.tt/5gtEdQk
Nvidia's CoreWeave Acquires OpenPipe OpenPipe is focused on training AI agents using reinforcement learning. #Artificialintelligence #Cloudservices #reinforcementlearning themiddlemarket.com/latest-news/nvā¦
š Interestingresearch: Robix: A Unified Model for Robot Interaction, Reasoning and Planning Read more: huggingface.co/papers/2509.01⦠#LLM #ReinforcementLearning #MLResearch
Scientists have designed a #ReinforcementLearning-based framework that enables multiple robot arms to perform up to 40 tasks simultaneously without colliding in a crowded workspace. @GoogleDeepMind Learn more in Science #Robotics: scim.ag/3JIh7WF
MiniMax-M1 non ĆØ āun altroā #LLM gettato nella mischia. Si tratta di un foundation model con una caratteristica distintiva fondamentale: ĆØ stato addestrato da zero utilizzando un processo che integra il #reinforcementlearning (RL) su larga scala. š datamasters.it/blog/minimax-mā¦
This new method helps soft, squishy robots move better in changing environments! It learns by trying different movements and adjusting based on what works best. This means the robot can learn to navigate tricky spots more efficiently than before. #Robotics #ReinforcementLearningā¦
ć¢ć¹ćŖć¼ćć®ććć«čććčØē»ććåćććććčŖč»¢č» rai-inst.com/resources/blog⦠ćć«ćÆć¼ć«ć®ę©åę§ćØćć©ććŖć«č¤éćŖå°å½¢ćē„č¦ćć¦čØē»ććććć²ć¼ćććē„ę§ćå ¼ćåćć #ReinforcementLearning #UltraMobileVehicle #UMV #JumpingBicycle #RAI_Institute
Day 16 of 40š I made Learnt about Markov Decision Process, markov process, Policy vs plan, Q learning. I am currently getting a good foundation in Re-inforcement Learning. Can't wait to begin writing codes. #40daysofcode #Reinforcementlearning #ArtificialIntelligence
š§ News #EspritIA : CoreWeave acquiert OpenPipe, startup spĆ©cialisĆ©e dans les agents #IA avec #ReinforcementLearning, pour renforcer ses capacitĆ©s de cloud pour les entreprises et les laboratoires #IA.

ReinforcementLearning @ReinforcementL
10 Followers 25 Following
Jonathan Balloch @JonathanBalloch
383 Followers 1K Following I mostly tweet about #ai, #robots, #science, @packers... Robotics PhD student @GeorgiaTech studying #reinforcementlearning and #AI Thought/opinions are mine
Technion - Reinforcem... @Technion_RL
765 Followers 15 Following Official account covering research performed in the various #ReinforcementLearning labs at the @TechnionLive
Daniel J. Mankowitz @DJ_Mankowitz
1K Followers 49 Following Co-founder & CTO @ Ethos Ex. Staff Research Scientist @Deepmind, AlphaDev, MuZero for Video Compression, AlphaCode #deeplearning #reinforcementlearning
Yu-Xiang Wang @yuxiangw_cs
3K Followers 350 Following Faculty @hdsiucsd, director of S2ML lab. Visitor @awscloud. Prev @ucsbcs @SCSatCMU. Researcher in #machinelearning, #reinforcementlearning, #differentialprivacy
Ofir Nachum @ofirnachum
5K Followers 355 Following Research at @OpenAI. Previously at @GoogleAI on the Brain Team. Doing work on #ReinforcementLearning and #MachineLearning
James @jmac_ai
785 Followers 581 Following Ask me about #ReinforcementLearning #AI research @SonyAI_global RL for games, robotics, and other real-world applications Views and tweets are my own.
Joseph Cox @JosephJohnCox
199 Followers 705 Following operations research, data science, infra @ Hadrian; #machinelearning #reinforcementlearning; category theorist, information geometry, sci-fi.
CogitAI @Cogitai
283 Followers 70 Following Invented the industry's first self-learning AI SaaS platform #AI #selflearningAI #continuallearning #reinforcementlearning
Jacqueline Isabelle F... @JackieForien
393 Followers 678 Following CEO @ML_fr_company #MSc #MachineLearning @UCL, co-org @ParisMLgroup, Meetup Chair @NeurIPSConf, French translator of "#ReinforcementLearning" R.Sutton & A.Barto
Seydina Ndiaye @seysoosey
1K Followers 712 Following #AI_Algorithms #eHealth #ArtificialIntelligence #MachineLearning #ReinforcementLearning
robertjneal @robertjneal
342 Followers 326 Following Cars Ruin Cities #Experimentation, #ReinforcementLearning @LaunchDarkly Formerly #philosophy, #CogSci, @twittereng, @CreditKarmaEng