Y Combinator @ycombinator, Twitter Profile

Y Combinator @ycombinator

a week ago

OpenAI recently released its first open-weights model since GPT-2, entering a field led by DeepSeek and Alibaba's Qwen. Ankit (@GuptaAnkitV) breaks down these top OSS models, including what sets them apart under the hood: mixture-of-experts, long-context training, and post-training techniques that shape reasoning and alignment—and how different design choices lead to surprisingly similar performance. 00:00 – OpenAI OSS Launch 01:00 – Comparing Open Source LLM Architectures 01:46 – GPT OSS Overview 02:37 – Under The Hood of GPT OSS 03:25 – Qwen-3 Architecture 04:17 – Qwen-3 Training 05:12 – Qwen-3 Post-Training 06:08 – Qwen-3 Reasoning & RL Innovations 06:52 – DeepSeek V3 Overview 07:40 – DeepSeek V3.1 Updates 08:39 – Attention Mechanism (MLA) 09:39 – Comparing Model Sizes 10:35 – Long Context Strategies 11:25 – Reflections on Methods 12:00 – Takeaways

49 78 390 205K 293

Download Video

Y Combinator @ycombinator

a week ago

Tune in: youtu.be/raTbhtKZTZA

0 4 14 10K 10

Download Image

Arlan @arlanrakh

a week ago

@ycombinator @GuptaAnkitV MY GOAT ANKIT

0 0 5 728 0

Z.ai @Zai_org

a week ago

@ycombinator @GuptaAnkitV Would love to see a breakdown of GLM-4.5 one day👀

0 0 4 877 0

Arindam Majumder 𝕏 @Arindam_1729

7 days ago

@ycombinator @GuptaAnkitV This one is super insightful Bookmarked this for this weekend

0 0 1 198 0

Himanshu Kumar @codewithimanshu

a week ago

@ycombinator @GuptaAnkitV Open weights foster broader access, but careful consideration of responsible development remains crucial. This shift could democratize AI, prompting exciting innovation.

0 0 1 290 0

Vignesh Shenoy @vgshenoy

6 days ago

@ycombinator @GuptaAnkitV if you watched the video (or not), here are a few charts to capture this OSS👌 breakdown

0 0 3 196 2

Download Image

0xagonally @0xagonally

5 days ago

@ycombinator @GuptaAnkitV pathetic cunts. do please go F yourself

0 0 1 20 0

Joy Zamoyski - Koch @JoyZamoyskiKoch

5 days ago

@ycombinator @GuptaAnkitV Say it isn’t so!

Richard Grenell @RichardGrenell

5 days ago

@ycombinator @GuptaAnkitV Say it isn’t so!

55 563 2K 60K 14

0 0 1 9 0

tulga.eth @iTulga

7 days ago

@ycombinator @GuptaAnkitV @HeyGenLabs mongolian

0 0 0 139 0

Sajib @imsajib_

a week ago

@ycombinator @GuptaAnkitV Amazing share!

0 0 0 485 0

Simeon Markoski @simeon_markoski

6 days ago

@ycombinator @GuptaAnkitV This is really interesting, can’t wait to see how these models evolve!

0 0 0 104 0

Adrian Humphrey @humphrey4thewin

a week ago

@ycombinator @GuptaAnkitV This is actually pretty cool !

0 0 0 305 0

KeW31.btc 🟧 @kew31btc

5 days ago

@ycombinator @GuptaAnkitV 长上下文或成新战场

0 0 0 13 0

sanket patel @realsanketp

a week ago

@ycombinator @GuptaAnkitV Forgot Gemma 3n 270M

0 0 0 353 0

Roger R @Live2Create_me

a week ago

@ycombinator @GuptaAnkitV Sweet! OpenAI rocks. 👍

0 0 0 254 0

Dest | NFT🎒 @desti910

4 days ago

@ycombinator @GuptaAnkitV wild that such different training philosophies are all landing at roughly the same performance ceiling

0 0 0 14 0

Tenzin Dhonyoe @_tenZdhon_

a week ago

@ycombinator @GuptaAnkitV Amazing video thanks for sharing!

0 0 0 246 0

Karan🧋 @kmeanskaran

a week ago

@ycombinator @GuptaAnkitV great share

0 0 0 380 0

Ankur Roy @ankurdorroy

6 days ago

@ycombinator @GuptaAnkitV open weights shift is strategic openai realizes they can't win on inference costs alone. giving developers the weights while keeping the training infrastructure advantage is smart positioning. deepseek and qwen proved open can compete on quality.

0 0 0 74 0

Zero2Unicorn @zero2unicorn_

6 days ago

@ycombinator @GuptaAnkitV A nice post

0 0 1 24 0

Shani Singh 🚀 @shani_singh1

a week ago

@ycombinator @GuptaAnkitV model is pretty solid

0 0 1 541 0

Angel Luis Ortega Ar @damnatio7

a week ago

@ycombinator @GuptaAnkitV @HeyGenLabs Spanish

1 0 0 258 0

Saumojit Santra @Jit_2077

6 days ago

@ycombinator @GuptaAnkitV Future is very exciting!!!

0 0 0 27 0

JuniJuin Enero @peacenjunity

6 days ago

@ycombinator @GuptaAnkitV this is the first timestamped vid i saw in x

0 0 0 30 0

Alston Antony | AI, WP, SEO & SaaS Expert @antonyalston

7 days ago

@ycombinator @GuptaAnkitV Been testing all three models this week. OpenAI's feels cleaner, but DeepSeek handles complex reasoning better in my experience.