• DeepLearningAI Profile Picture

    DeepLearning.AI @DeepLearningAI

    8 months ago

    Reinforcement learning (RL) is becoming a vital tool for improving chain-of-thought in reasoning models. Recent models like DeepSeek-R1 and Kimi k1.5 have used RL to refine their reasoning steps, generating more accurate solutions for complex domains such as math, coding, and science. Unlike other training methods, RL rewards models for generating better sequences, allowing them to self-improve. Learn more in The Batch: hubs.la/Q0351_T10

    DeepLearningAI tweet picture

    25 223 1K 88K 770
    Download Image
  • FlokiBull_ Profile Picture

    FLOKI Army Bull ♉️ @FlokiBull_

    7 months ago

    @DeepLearningAI Sup sup!! Are we going to get to the Moon? Let's attract investors together. Send me DM ❤🚀

    0 0 0 32 0
  • CameronDWills Profile Picture

    Cam @CameronDWills

    8 months ago

    @DeepLearningAI The reasoning steps make them vulnerable to jailbreaks

    CameronDWills Profile Picture

    Cam @CameronDWills

    8 months ago

    @DeepLearningAI The reasoning steps make them vulnerable to jailbreaks

    4 1 11 2K 15
    Download Video

    0 0 2 711 2
  • dramjourney Profile Picture

    Allois @dramjourney

    8 months ago

    @DeepLearningAI RL looks so natural for learning

    0 0 2 202 0
  • yangyc666 Profile Picture

    ryan yang @yangyc666

    8 months ago

    @DeepLearningAI RLs impact on reasoning is still evolving.

    0 0 1 602 0
  • thedukedammy_ Profile Picture

    Duke🥷🏽 @thedukedammy_

    6 months ago

    @DeepLearningAI @Ash_born2364

    1 0 1 45 0
  • EthanSynthMind Profile Picture

    Ethan_SynthMind AI @EthanSynthMind

    8 months ago

    @DeepLearningAI rl's impact on reasoning is interesting, but data matters.

    0 0 0 819 0
  • ZephyrCristo Profile Picture

    Zephyr Cristo @ZephyrCristo

    8 months ago

    @DeepLearningAI RL's role in enhancing reasoning models like DeepSeek-R1 and Kimi k1.5 is fascinating. It's like refining the mind's pathways to solve complex problems more efficiently, akin to a tech alchemist's quest for perfection in thought processes.

    0 0 0 982 0
  • SaquibOptimusAI Profile Picture

    Saquib Mehmood @SaquibOptimusAI

    8 months ago

    @DeepLearningAI Awesome.

    0 0 0 401 0
  • 0xiCapital Profile Picture

    0xidative @0xiCapital

    8 months ago

    @DeepLearningAI Rich Sutton masterclass

    0 0 0 526 0
  • doyleByte Profile Picture

    Doyle Byte @doyleByte

    8 months ago

    @DeepLearningAI RL is the secret sauce for sharper AI brains.

    0 0 0 62 0
  • J0rbit_X Profile Picture

    Jorbit @J0rbit_X

    8 months ago

    @DeepLearningAI Sounds like machines are levelling up their thinking game! Wonder if they'll ever beat us at guessing where lost socks go.

    0 0 0 63 0
  • nauracrypto Profile Picture

    NAURA @nauracrypto

    7 months ago

    @DeepLearningAI Time to skyrocket your project!

    0 0 0 74 0
  • DataInsta_com Profile Picture

    DataInsta @DataInsta_com

    8 months ago

    @DeepLearningAI reinforcement learning opens a treasure chest for model inspiration!

    0 0 0 119 0
  • DataInsta_com Profile Picture

    DataInsta @DataInsta_com

    8 months ago

    @DeepLearningAI using rl for reasoning is like giving a brain a personal trainer!

    0 0 0 419 0
  • jonderos Profile Picture

    Jon de Ros @jonderos

    8 months ago

    @DeepLearningAI 🤓

    0 0 0 414 0
  • ai_consultancy1 Profile Picture

    The Ai Consultancy @ai_consultancy1

    8 months ago

    Reinforcement learning is redefining how AI thinks—not just what it knows. By rewarding better reasoning sequences, models evolve beyond static training data, refining logic in real-time. As RL-driven approaches like DeepSeek-R1 and Kimi k1.5 advance, we’re witnessing AI move closer to true problem-solving intelligence. Exciting times for AI reasoning!

    0 0 0 393 0
  • BFuentes15843 Profile Picture

    Boston Fuentes @BFuentes15843

    2 weeks ago

    @DeepLearningAI A practical roadmap for resume writing includes learning basics, building projects, and sharing results with clear KPIs.

    0 0 0 2 0
  • Muhabibilahi Profile Picture

    Muhabibilahi @Muhabibilahi

    7 months ago

    @DeepLearningAI @Kimi_Moonshot Hello @DeepLearningAI Are you available for viral ideas?

    0 0 0 14 0
  • AI_Fun_times Profile Picture

    AI Times @AI_Fun_times

    8 months ago

    @DeepLearningAI Exciting to see how reinforcement learning is enhancing reasoning in models like DeepSeek-R1 and Kimi k1.5!

    0 0 0 187 0
  • Download Image
    • Privacy
    • Term and Conditions
    • About
    • Contact Us
    • TwStalker is not affiliated with X™. All Rights Reserved. 2024 www.instalker.org

    twitter web viewer x profile viewer bayigram.com instagram takipçi satın al instagram takipçi hilesi twitter takipçi satın al tiktok takipçi satın al tiktok beğeni satın al tiktok izlenme satın al beğeni satın al instagram beğeni satın al youtube abone satın al youtube izlenme satın al sosyalgram takipçi satın al instagram ücretsiz takipçi twitter takipçi satın al tiktok takipçi satın al tiktok beğeni satın al tiktok izlenme satın al beğeni satın al instagram beğeni satın al youtube abone satın al youtube izlenme satın al metin2 metin2 wiki metin2 ep metin2 dragon coins metin2 forum metin2 board popigram instagram takipçi satın al takipçi hilesi twitter takipçi satın al tiktok takipçi satın al tiktok beğeni satın al tiktok izlenme satın al beğeni satın al instagram beğeni satın al youtube abone satın al youtube izlenme satın al buyfans buy instagram followers buy instagram likes buy instagram views buy tiktok followers buy tiktok likes buy tiktok views buy twitter followers buy telegram members Buy Youtube Subscribers Buy Youtube Views Buy Youtube Likes forstalk postegro web postegro x profile viewer