Yash Malik @_yash_malik_

ML @AmazonScience Scaling RL for LLMs Prev @Google, SC Cambridge, UK Joined October 2021

Tweets

45
Followers

92
Following

1K
Likes

1K

Aidan Gomez @aidangomez

3 weeks ago

Cohere intends to acquire Perplexity immediately after their acquisitions of TikTok and Google Chrome. We will continue to monitor the progress of those deals closely so we can submit our term sheet upon completion.

116 84 2K 279K 153

Ganqu Cui @charlesfornlp

3 months ago

Thoughts: While LLMs provide powerful priors for RL, many recent studies show that simply narrowing the model's output distribution can improve performance, but this also exhausts the model's potential for further exploration and improvement. *Is it a blessing or a curse?* 🤔

1 2 15 670 0

Taiwei Shi @taiwei_shi

3 months ago

Is there anything that Qwen cannot do at this point? 😂

21 75 1K 86K 206

Download Image

xlr8harder @xlr8harder

6 months ago

I want to applaud @OpenAI on releasing GPT-4.5. It's not a benchmark beater, and they released it anyway. That takes some courage on their part, because they will get a lot of dumb criticism on eval scores. (If you think it needs to top evals to be valuable, you are wrong.)

48 20 752 51K 38

Andrej Karpathy @karpathy

6 months ago

After many hours of scrutinizing humor in LLM outputs, this one by Claude 3.7 is the funniest by far.

Tibo @tibo_maker

6 months ago

After many hours of scrutinizing humor in LLM outputs, this one by Claude 3.7 is the funniest by far.

208 159 4K 787K 312

Download Image

116 285 7K 504K 576

Deedy @deedydas

6 months ago

Reddit grandfather uploads 27 year old EXE file of a visual basic game and Claude one-shotted recreating the game in Python in under 5 minutes!! From the binary.

165 736 11K 1.2M 4K

Download Image

Andy Jassy @ajassy

6 months ago

Excited to be with the team in NYC today rolling out the new Alexa+. Across Amazon, we’re harnessing the transformative power of GenAI to reimagine the experiences we offer customers, and Alexa+ is the latest example. She’s smarter, more capable, more personalized, and unlike…

81 139 1K 107K 174

Download Video

Panos Panay @panos_panay

6 months ago

Say hello to Alexa+. Need dinner plans? She'll book your favorite restaurant, grab an Uber, and text your sitter — all in one conversation. Want concert tickets? She'll scout for the best prices. Need to check if the garbage went out? She'll find that exact Ring clip in seconds.…

140 320 3K 768K 668

Download Video

Thomas Wolf @Thom_Wolf

6 months ago

Let me add a bit context to the latest DeepSeek code release as I feel it was a bit bare bones. Mixture-of-Experts (MoE) is a simple extension of transformers which is rapidly establishing itself as be the go-to architecture for mid-to-large size LLM (20B-600B parameters). It…

DeepSeek @deepseek_ai

6 months ago

519 1K 8K 1.4M 1K

9 77 454 54K 236

DeepSeek @deepseek_ai

6 months ago

🚀 Day 2 of #OpenSourceWeek: DeepEP Excited to introduce DeepEP - the first open-source EP communication library for MoE model training and inference. ✅ Efficient and optimized all-to-all communication ✅ Both intranode and internode support with NVLink and RDMA ✅…

519 1K 8K 1.4M 1K

DeepSeek @deepseek_ai

7 months ago

🚀 Day 0: Warming up for #OpenSourceWeek! We're a tiny team @deepseek_ai exploring AGI. Starting next week, we'll be open-sourcing 5 repos, sharing our small but sincere progress with full transparency. These humble building blocks in our online service have been documented,…

1K 3K 21K 2.5M 2K

Thomas Wolf @Thom_Wolf

7 months ago

After 6+ months in the making and burning over a year of GPU compute time, we're super excited to finally release the "Ultra-Scale Playbook" Check it out here: hf.co/spaces/nanotro… A free, open-source, book to learn everything about 5D parallelism, ZeRO, fast CUDA kernels,…

110 706 4K 363K 4K

Download Image

David Dohan @dmdohan

9 months ago

imo the improvements on FrontierMath are even more impressive than ARG-AGI. Jump from 2% to 25% Terence Tao said the dataset should "resist AIs for several years at least" and "These are extremely challenging. I think that in the near term basically the only way to solve them,…

Nat McAleese @nmca

9 months ago

6 21 352 223K 39

Download Image

21 79 901 153K 220

nisten🇨🇦e/acc @nisten

9 months ago

@_philschmid @amazon Hmmm, it's actually not bad. Tried my standard martian railgun test. Prompt: calculate how long a mass driver rail would need to be to accelerate people comfortably at max 2Gs on mars travelling along the slope of and launching from the top of mount olympus mons and what speed…

2 2 20 5K 6

Download Image

Yash Malik @_yash_malik_

9 months ago

Excited to share what the team has been cooking up recently. A few more big things on the horizon! 👀 PS: Please excuse the error with bold numbers in the scores.

Philipp Schmid @_philschmid

9 months ago

Excited to share what the team has been cooking up recently. A few more big things on the horizon! 👀 PS: Please excuse the error with bold numbers in the scores.