This week, in The Batch, Andrew Ng shares three big takeaways from DeepSeek’s big week. Plus: 🧠 How DeepSeek-R1 and Kimi k1.5 use reinforcement learning to improve chain-of-thought 🌐 OpenAI launches Operator, its first web agent 📊 A smarter way to fine-tune models with synthetic data Read The Batch: hubs.la/Q034M82R0
2
17
78
6K
16
@DeepLearningAI reinforcement learning truly changes the game! excited to see what's next.