Top Tweets for #ReinforcementLearning on Twitter.

Search results for #ReinforcementLearning

Chenlu Ye @ye_chenlu

3 hours ago

PROF🌀Right answer, flawed reason?🤔🌀 📄arxiv.org/pdf/2509.03403 Excited to share our work: PROF-PRocess cOnsistency Filter! 🚀 Challenge: ORM is blind to flawed logic, and PRM suffers from reward hacking. Our method harmonizes strengths of PRM & ORM. #LLM #ReinforcementLearning

1 3 8 333 6

Download Image

SciOpenTUP @SciOpenTUP

3 hours ago

From crowded labs to seamless automation 🤖 Robots are learning to collaborate smarter! @SciRobotics highlights how @GoogleDeepMind is pushing the limits of #ReinforcementLearning in robotics. #AI #Robotics #SciOpen #TUP #Science

Science Robotics @SciRobotics

a day ago

2 50 255 13K 74

Download Gif

0 0 0 52 0

Arian Azmoudeh @arianazmoudeh

9 hours ago

Agentic AI is moving from sci-fi to enterprise reality. 🤖 Systems that learn, adapt, and act, powered by reinforcement learning and multimodal input are reshaping how work gets done. 🔄⚠️ Buckle up 🚀 #AgenticAI #AIagents #ReinforcementLearning #Multimodal #FutureTech

0 0 2 16 0

Download Image

scuzzlebot @scuzzlebot

9 hours ago

The synergy between RL and generative modeling continues to unveil new insights. With innovations like GSPO and the dynamics of generative diffusion, we're on the brink of optimizing models like never before! #AI #DeepLearning #ReinforcementLearning

0 1 0 16 0

Md Abubakar Siddik @mdabubakarx

11 hours ago

Reinforcement Learning is how AI masters complex games—by making millions of attempts, failing, and learning from its mistakes. #ReinforcementLearning #AI #DeepMind

0 0 0 7 0

Zeta @Zeta410698

11 hours ago

Reinforcement Learning is how AI masters complex games—by making millions of attempts, failing, and learning from its mistakes. #ReinforcementLearning #AI #DeepMind

0 0 0 7 0

ihpolyphe @maikore99

11 hours ago

Pokemon ShowDownでポケモンAIバトルpart3公開しました。 DQN→PPOでかなり行動に多様性がもたせられることを確認しました。 youtu.be/wBz4sCLQwxo?si… #ppo #PokemonShowdown #reinforcementlearning

0 0 0 8 0

Vlad Ruso PhD @vlruso

12 hours ago

Biomni-R0: Revolutionizing Biomedical Research with Advanced Reinforcement Learning Models #ArtificialIntelligence #BiomedicalResearch #MachineLearning #HealthTech #ReinforcementLearning itinai.com/biomni-r0-revo… The Growing Role of AI in Biomedical Research Artificial intellig…

0 0 1 11 0

Download Image

LammieCodes😎 @oluwadoyin41112

18 hours ago

Day 17 of 40 🚀. I continued on my reinforcement learning course. Learnt about: - Temporal difference - deep Q learning intuition learning and action - experience replay. #40daysofcode #ReinforcementLearning #artifici

0 0 1 28 0

Dr. Karl Popp @karl_popp

a day ago

TheMiddleMarket: Nvidia's CoreWeave Acquires OpenPipe OpenPipe is focused on training AI agents using reinforcement learning. #Artificialintelligence #Cloudservices #reinforcementlearning themiddlemarket.com/latest-news/nv… #manda #mergerautomation #merger @karl_popp ift.tt/5gtEdQk

0 0 0 33 0

Mergers&Acquisitions @TheMiddleMarket

a day ago

Nvidia's CoreWeave Acquires OpenPipe OpenPipe is focused on training AI agents using reinforcement learning. #Artificialintelligence #Cloudservices #reinforcementlearning themiddlemarket.com/latest-news/nv…

1 0 0 305 0

Griffintaur @griffintaur

a day ago

🚀 Interestingresearch: Robix: A Unified Model for Robot Interaction, Reasoning and Planning Read more: huggingface.co/papers/2509.01… #LLM #ReinforcementLearning #MLResearch

0 0 0 21 0

Science Robotics @SciRobotics

a day ago

Scientists have designed a #ReinforcementLearning-based framework that enables multiple robot arms to perform up to 40 tasks simultaneously without colliding in a crowded workspace. @GoogleDeepMind Learn more in Science #Robotics: scim.ag/3JIh7WF

2 50 255 13K 74

Download Gif

DataMasters.it @datamasters_it

a day ago

MiniMax-M1 non è “un altro” #LLM gettato nella mischia. Si tratta di un foundation model con una caratteristica distintiva fondamentale: è stato addestrato da zero utilizzando un processo che integra il #reinforcementlearning (RL) su larga scala. 👇 datamasters.it/blog/minimax-m…

0 0 0 11 0

Dr. Ganapathi Pulipaka 🇺🇸 @gp_pulipaka

a day ago

Agents of #ReinforcementLearning. #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Mathematics #Programming #Coding #100DaysofCode geni.us/CapableAgentsRL

1 2 3 250 1

Download Image

Teachable AI @TeachableAI

a day ago

This new method helps soft, squishy robots move better in changing environments! It learns by trying different movements and adjusting based on what works best. This means the robot can learn to navigate tricky spots more efficiently than before. #Robotics #ReinforcementLearning…

0 0 0 12 0

T.Yamazaki @ZappyZappy7

2 days ago

アスリートのように考え、計画し、動くロボット自転車 rai-inst.com/resources/blog… パルクールの機動性と、どんなに複雑な地形も知覚して計画し、ナビゲートする知性を兼ね備える #ReinforcementLearning #UltraMobileVehicle #UMV #JumpingBicycle #RAI_Institute

1 75 198 12K 39

Download Video

LammieCodes😎 @oluwadoyin41112

2 days ago

Day 16 of 40🚀 I made Learnt about Markov Decision Process, markov process, Policy vs plan, Q learning. I am currently getting a good foundation in Re-inforcement Learning. Can't wait to begin writing codes. #40daysofcode #Reinforcementlearning #ArtificialIntelligence

0 0 3 86 0

AFX LAB @AFX_LAB

2 days ago

#reinforcementlearning

0 0 0 210 0

Download Image

Esprit IA @EspritIA_fr

2 days ago

🧠 News #EspritIA : CoreWeave acquiert OpenPipe, startup spécialisée dans les agents #IA avec #ReinforcementLearning, pour renforcer ses capacités de cloud pour les entreprises et les laboratoires #IA.