Simon Mo @simon_mo_

@vllm_project Joined July 2018

Tweets

120
Followers

1K
Following

342
Likes

143

Simon Mo @simon_mo_

a month ago

It has been 1+ month of intense work! Now time to get some sleep 😴

Zhuohan Li @zhuohan123

a month ago

It has been 1+ month of intense work! Now time to get some sleep 😴

5 7 168 15K 24

2 2 29 2K 2

Simon Mo @simon_mo_

2 months ago

I didn't expect the first section "KV-cache hit rate is the single most important metric for a production-stage AI agent" but 🤯

Yichao 'Peak' Ji @peakji

2 months ago

I didn't expect the first section "KV-cache hit rate is the single most important metric for a production-stage AI agent" but 🤯

18 75 421 104K 398

0 0 10 767 2

Simon Mo @simon_mo_

2 months ago

Long time in the making and I'm beyond excited about the future of vLLM!

PyTorch @PyTorch

2 months ago

Long time in the making and I'm beyond excited about the future of vLLM!

4 67 314 21K 78

Download Image

0 0 18 1K 0

Announcing the first Codex open source fund grant recipients: ⬩vLLM - inference serving engine @vllm_project ⬩OWASP Nettacker - automated network pentesting @iotscan ⬩Pulumi - infrastructure as code in any language @PulumiCorp ⬩Dagster - cloud-native data pipelines @dagster…

36 147 943 95K 262

Simon Mo @simon_mo_

5 months ago

😲 super cool !!! Reminded me of Kevin's thesis "Structured Contexts For Large Language Models" and this is such a natural continuation of the idea.

Letta @Letta_AI

5 months ago

😲 super cool !!! Reminded me of Kevin's thesis "Structured Contexts For Large Language Models" and this is such a natural continuation of the idea.

11 35 154 56K 99

0 3 7 1K 0

vLLM @vllm_project

5 months ago

🙏 @deepseek_ai's highly performant inference engine is built on top of vLLM. Now they are open-sourcing the engine the right way: instead of a separate repo, they are bringing changes to the open source community so everyone can immediately benefit! github.com/deepseek-ai/op…

25 348 2K 202K 764

Simon Mo @simon_mo_

6 months ago

Having been at every single vLLM meetup, I won't miss this one :D Looking forward to meet all the vLLM users in Boston!

vLLM @vllm_project

7 months ago

Having been at every single vLLM meetup, I won't miss this one :D Looking forward to meet all the vLLM users in Boston!

2 5 24 6K 3

2 1 18 3K 1

Character.AI @character_ai

7 months ago

it's Catacter AI now 😼

19 13 389 29K 8

Robert Shaw @robertshaw21

7 months ago

Landed my first PR in @vllm_project 1 year ago today (github.com/vllm-project/v…) 38K LOC and 100+ PRs later and we are just getting started

0 3 34 5K 0

Roger Wang @rogerw0108

7 months ago

Robert and I started contributing to vLLM around the same time and today is my turn. Back then vLLM had only about 30 contributors. One year later, today the project has received contributions from 800+ community members! and we're just getting started github.com/vllm-project/v…

Robert Shaw @robertshaw21

7 months ago

0 3 34 5K 0

5 4 52 6K 5

vLLM @vllm_project

7 months ago

We landed the 1st batch of enhancements to the @deepseek_ai models, starting MLA and cutlass fp8 kernels. Compared to v0.7.0, we offer ~3x the generation throughput, ~10x the memory capacity for tokens, and horizontal context scalability with pipeline parallelism.

50 102 740 90K 321

Download Image

Costa Huang @vwxyzjn

7 months ago

Finally, I want to give a special thanks to the @vllm_project team (@KaichaoYou @woosuk_k @simon_mo_ @zhuohan123) for their invaluable support in debugging NCCL weight transfer issues. They made our 70 RLVR weight transfer 45x faster and 405B RLVR even possible! See…

0 6 28 7K 6

Simon Mo @simon_mo_

7 months ago

Our biggest milestone yet! I'm particularly excited how the vLLM contributor community organized from many organization to deliver a high quality V1 engine core. We are just getting started 🚀

vLLM @vllm_project

7 months ago

Our biggest milestone yet! I'm particularly excited how the vLLM contributor community organized from many organization to deliver a high quality V1 engine core. We are just getting started 🚀

15 95 645 84K 193

Download Image

2 0 22 1K 1

NovaSky @NovaSkyAI

8 months ago

1/6 🚀 Introducing Sky-T1-32B-Preview, our fully open-source reasoning model that matches o1-preview on popular reasoning and coding benchmarks — trained under $450! 📊Blog: novasky-ai.github.io/posts/sky-t1/ 🏋️‍♀️Model weights: huggingface.co/NovaSky-AI/Sky…