Taehyeon Kim @kimtaehyeon610

Research Scientist (Team Lead) - @LG_AI_Research. Prev: @GoogleAI (NYC 🇺🇸), @Qualcomm AI, @dynamo_ai (YCW22). Agent/LLM inference/alignment. 🎧 taehyeon.oopy.io Seoul, Korea Joined November 2021

Tweets

177
Followers

555
Following

243
Likes

2K

Jeff Dean @JeffDean

2 weeks ago

AI efficiency is important. Today, Google is sharing a technical paper detailing our comprehensive methodology for measuring the environmental impact of Gemini inference. We estimate that the median Gemini Apps text prompt uses 0.24 watt-hours of energy (equivalent to watching an…

149 832 4K 718K 2K

Download Image

Vaish Shrivastava @VaishShrivas

3 weeks ago

Test-time scaling w/ GRPO boosts accuracy, but also adds “filler tokens” increasing length w/o real progress. We present Group Filtered Policy Optimization (GFPO):🧵 1️⃣ Sample more per prompt 2️⃣ Rank by token efficiency (reward ÷ length) 3️⃣ Train on top-k 4️⃣ 🚀 Cut 80% of…

4 48 331 58K 278

Download Image

Jack Morris @jxmnop

4 weeks ago

OpenAI hasn’t open-sourced a base model since GPT-2 in 2019. they recently released GPT-OSS, which is reasoning-only... or is it? turns out that underneath the surface, there is still a strong base model. so we extracted it. introducing gpt-oss-20b-base 🧵

159 470 6K 918K 4K

Download Image

Ming Yin @MingYin_0312

4 weeks ago

I implemented GRPO and DPO from scratch in vanilla Pytorch to unravel every piece of training details. Hope it could be helpful for those who care about the implementation details of the algorithms. 👉 github.com/mingyin0312/RL… #AI #RL #LLM

15 211 2K 102K 2K

Sam Altman @sama

a month ago

gpt-oss is out! we made an open model that performs at the level of o4-mini and runs on a high-end laptop (WTF!!) (and a smaller one that runs on a phone). super proud of the team; big triumph of technology.

2K 4K 46K 4.2M 8K

Sangmin Bae @raymin0223

a month ago

✨Huge thanks for interest in Mixture-of-Recursions! Codes are officially out! It's been a long journey exploring Early-exiting with Recursive Architecture. I'll soon post my 👨‍🎓PhD thesis on Adaptive Computation too! Code: github.com/raymin0223/mix… Paper: arxiv.org/abs/2507.10524

6 64 278 16K 168

Download Image

Yujin Kim @yujin301300

2 months ago

Introducing our new work: 🚀Mixture-of-Recursions! 🪄We propose a novel framework that dynamically allocates recursion depth per token. 🪄MoR is an efficient architecture with fewer params, reduced KV cache memory, and 2× greater throughput— maintaining comparable performance!

10 60 331 22K 220

Download Image

Alex Prompter @alex_prompter

2 months ago

R.I.P McKinsey. You don’t need a $300k consultant anymore. You can now run full competitive market analysis using Grok 4. Here are the exact 3 mega-prompts I use to replicate McKinsey-style insights for free:

858 5K 44K 13.7M 62K

Download Image

Dongmin Park @dongmin_park11

3 months ago

🚨New Paper Alert As a game company, @Krafton_AI is actively exploring how to apply LLM agents to video games. We present Orak—a foundational video gaming benchmark for LLM agents! Includes Pokémon, StarCraft II, Slay the Spire, Darkest Dungeon, Ace Attorney, and more in🧵

2 23 74 10K 21

Download Image

Johannes Oswald @oswaldjoh

3 months ago

Super happy and proud to share our novel scalable RNN model - the MesaNet! This work builds upon beautiful ideas of 𝗹𝗼𝗰𝗮𝗹𝗹𝘆 𝗼𝗽𝘁𝗶𝗺𝗮𝗹 𝘁𝗲𝘀𝘁-𝘁𝗶𝗺𝗲 𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 (TTT), and combines ideas of in-context learning, test-time training and mesa-optimization.

4 64 402 87K 334

Download Image

Carlos E. Perez @IntuitMachine

4 months ago

Shocker! Claude 4 system prompt was leaked, and it's a goldmine! The Claude system prompt incorporates several identifiable agentic AI patterns as described in "A Pattern Language For Agentic AI." Here's an analysis of the key patterns used: Run-Loop Prompting: Claude…

63 497 5K 1.2M 13K

Download Image

Rohan Paul @rohanpaul_ai

5 months ago

Small language models struggle with complex reasoning tasks where large models excel. This paper introduces the SMART framework, where a small model performs reasoning but selectively requests corrections from a large model only for steps identified as uncertain via a scoring…

4 31 179 11K 121

Download Image

Genspark @genspark_ai

5 months ago

Meet Genspark Super Agent - a fast & reliable general AI agent! Check it out: genspark.ai

61 142 747 316K 575

Download Video

Pieter Abbeel @pabbeel

6 months ago

Basics of Deep RL tutorial I am still very happy with, as good a day as any to re-post :) youtube.com/playlist?list=…

15 115 946 116K 741

Yi Ma @YiMaTweets

8 months ago

Academia should focus on discovering simplifying and unifying principles and mechanisms behind intelligence; and industry is obviously better equipped to manifest and scale up. That is the same as physics/mechanics to building big airplanes... But I do not believe the current…

Lucas Beyer (bl16) @giffmana

8 months ago

8 18 259 80K 75

5 16 156 31K 65

Stephanie Chan @scychan_brains

8 months ago

Devastatingly, we have lost a bright light in our field. Felix Hill was not only a deeply insightful thinker -- he was also a generous, thoughtful mentor to many researchers. He majorly changed my life, and I can't express how much I owe to him. Even now, Felix still has so much…

6 94 610 89K 561

John Nguyen @JohnNguyen

9 months ago

🥪New Paper! 🥪Introducing Byte Latent Transformer (BLT) - A tokenizer free model scales better than BPE based models with better inference efficiency and robustness. 🧵

12 64 446 89K 315

Download Image

Tanishq Mathew Abraham, Ph.D. @iScienceLuvr

9 months ago

A new tutorial on RL by Kevin Patrick Murphy, a Research Scientist at Google DeepMind who also wrote several comprehensive, well-regarded textbooks on ML/DL. This ought to be a good read 👀

18 271 2K 223K 3K

Download Image

Sebastien Bubeck @SebastienBubeck

9 months ago

This should be interesting! forum.openai.com/public/events/…

8 32 234 24K 109

Yu Su (hiring postdoc) @ysu_nlp

10 months ago

Sharing the slides of my talk at Princeton yesterday--"A holistic and critical look at language agents": ysu1989.github.io/resources/lang… LLM-based language agents are exciting, but it's also undeniably a quite chaotic space: are agents the next big thing, or are they just thin wrappers…