Announcing the first Codex open source fund grant recipients:
⬩vLLM - inference serving engine @vllm_project
⬩OWASP Nettacker - automated network pentesting @iotscan
⬩Pulumi - infrastructure as code in any language @PulumiCorp
⬩Dagster - cloud-native data pipelines @dagster…
🙏 @deepseek_ai's highly performant inference engine is built on top of vLLM. Now they are open-sourcing the engine the right way: instead of a separate repo, they are bringing changes to the open source community so everyone can immediately benefit!
github.com/deepseek-ai/op…
Robert and I started contributing to vLLM around the same time and today is my turn.
Back then vLLM had only about 30 contributors. One year later, today the project has received contributions from 800+ community members!
and we're just getting started
github.com/vllm-project/v…
Robert and I started contributing to vLLM around the same time and today is my turn.
Back then vLLM had only about 30 contributors. One year later, today the project has received contributions from 800+ community members!
and we're just getting started
github.com/vllm-project/v…
We landed the 1st batch of enhancements to the @deepseek_ai models, starting MLA and cutlass fp8 kernels. Compared to v0.7.0, we offer ~3x the generation throughput, ~10x the memory capacity for tokens, and horizontal context scalability with pipeline parallelism.
Finally, I want to give a special thanks to the @vllm_project team (@KaichaoYou@woosuk_k@simon_mo_@zhuohan123) for their invaluable support in debugging NCCL weight transfer issues.
They made our 70 RLVR weight transfer 45x faster and 405B RLVR even possible!
See…
Our biggest milestone yet! I'm particularly excited how the vLLM contributor community organized from many organization to deliver a high quality V1 engine core. We are just getting started 🚀
Our biggest milestone yet! I'm particularly excited how the vLLM contributor community organized from many organization to deliver a high quality V1 engine core. We are just getting started 🚀
1/6 🚀
Introducing Sky-T1-32B-Preview, our fully open-source reasoning model that matches o1-preview on popular reasoning and coding benchmarks — trained under $450!
📊Blog: novasky-ai.github.io/posts/sky-t1/
🏋️♀️Model weights: huggingface.co/NovaSky-AI/Sky…
28 Followers 569 FollowingML engineer at Spike Technologies. Previously, a machine learning master's student at @Cambridge_Uni. Excited about opportunities in deep-learning and LLMs.
985 Followers 3K FollowingRetail Investor. I do NOT provide financial advice. My messages are my personal opinions only. In my opinion, there is no such thing called DUMB MONEY.
51K Followers 288 Followingmarketing at @linera_io. running @torproject relays on @raspberry_pi and with @emeraldonion. free and open-source soft/hardware.
4.3M Followers 3 FollowingOpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6Lg202
228 Followers 226 FollowingCS PhD Student working on diverse AI generation advised by @profjoeyg and Sanjit Seshia. formerly @aiatmeta, @nuro, @columbia
3K Followers 856 FollowingFounding Team Thinking Machines. Previously @c.ai. I like zsh aliases, audiobooks, and running. This does not reflect the opinions of my future self.
92K Followers 207 FollowingLMArena: Open Platform for Community-driven AI Benchmarking. Graduated from UC Berkeley / @lmsysorg. We’re hiring: https://t.co/1OkfLq2n0I
5K Followers 1K FollowingCo-founder @allhands_ai, building OpenHands | PhD candidate @IllinoisCDS | BS @UMichCSE ('22) | Ex Intern @GoogleAI @Microsoft | Opinions are my own