DeepLearning.AI @DeepLearningAI, Twitter Profile

DeepLearning.AI @DeepLearningAI

8 months ago

Reinforcement learning (RL) is becoming a vital tool for improving chain-of-thought in reasoning models. Recent models like DeepSeek-R1 and Kimi k1.5 have used RL to refine their reasoning steps, generating more accurate solutions for complex domains such as math, coding, and science. Unlike other training methods, RL rewards models for generating better sequences, allowing them to self-improve. Learn more in The Batch: hubs.la/Q0351_T10

25 223 1K 88K 770

Download Image

FLOKI Army Bull ♉️ @FlokiBull_

7 months ago

@DeepLearningAI Sup sup!! Are we going to get to the Moon? Let's attract investors together. Send me DM ❤🚀

0 0 0 32 0

Cam @CameronDWills

8 months ago

@DeepLearningAI The reasoning steps make them vulnerable to jailbreaks

Cam @CameronDWills

8 months ago

@DeepLearningAI The reasoning steps make them vulnerable to jailbreaks

4 1 11 2K 15

Download Video

0 0 2 711 2

Allois @dramjourney

8 months ago

@DeepLearningAI RL looks so natural for learning

0 0 2 202 0

ryan yang @yangyc666

8 months ago

@DeepLearningAI RLs impact on reasoning is still evolving.

0 0 1 602 0

Duke🥷🏽 @thedukedammy_

6 months ago

@DeepLearningAI @Ash_born2364

1 0 1 45 0

Ethan_SynthMind AI @EthanSynthMind

8 months ago

@DeepLearningAI rl's impact on reasoning is interesting, but data matters.

0 0 0 819 0

Zephyr Cristo @ZephyrCristo

8 months ago

@DeepLearningAI RL's role in enhancing reasoning models like DeepSeek-R1 and Kimi k1.5 is fascinating. It's like refining the mind's pathways to solve complex problems more efficiently, akin to a tech alchemist's quest for perfection in thought processes.

0 0 0 982 0

Saquib Mehmood @SaquibOptimusAI

8 months ago

@DeepLearningAI Awesome.

0 0 0 401 0

0xidative @0xiCapital

8 months ago

@DeepLearningAI Rich Sutton masterclass

0 0 0 526 0

Doyle Byte @doyleByte

8 months ago

@DeepLearningAI RL is the secret sauce for sharper AI brains.

0 0 0 62 0

Jorbit @J0rbit_X

8 months ago

@DeepLearningAI Sounds like machines are levelling up their thinking game! Wonder if they'll ever beat us at guessing where lost socks go.

0 0 0 63 0

ＮＡＵＲＡ @nauracrypto

7 months ago

@DeepLearningAI Time to skyrocket your project!

0 0 0 74 0

DataInsta @DataInsta_com

8 months ago

@DeepLearningAI reinforcement learning opens a treasure chest for model inspiration!

0 0 0 119 0

DataInsta @DataInsta_com

8 months ago

@DeepLearningAI using rl for reasoning is like giving a brain a personal trainer!

0 0 0 419 0

Jon de Ros @jonderos

8 months ago

@DeepLearningAI 🤓

0 0 0 414 0

The Ai Consultancy @ai_consultancy1

8 months ago

Reinforcement learning is redefining how AI thinks—not just what it knows. By rewarding better reasoning sequences, models evolve beyond static training data, refining logic in real-time. As RL-driven approaches like DeepSeek-R1 and Kimi k1.5 advance, we’re witnessing AI move closer to true problem-solving intelligence. Exciting times for AI reasoning!

0 0 0 393 0

Boston Fuentes @BFuentes15843

2 weeks ago

@DeepLearningAI A practical roadmap for resume writing includes learning basics, building projects, and sharing results with clear KPIs.

0 0 0 2 0

Muhabibilahi @Muhabibilahi

7 months ago

@DeepLearningAI @Kimi_Moonshot Hello @DeepLearningAI Are you available for viral ideas?

0 0 0 14 0

AI Times @AI_Fun_times

8 months ago

@DeepLearningAI Exciting to see how reinforcement learning is enhancing reasoning in models like DeepSeek-R1 and Kimi k1.5!

0 0 0 187 0