Modular engineers are using Mojo with Nvidia Blackwell to make matrix multiplication faster than cuBLAS. In part 1 of our series, we explain what matmul is, why it’s fundamental to LLMs, and give a quick history from Ampere to Blackwell.
modular.com/blog/matrix-mu…
Cerebras Code: 20x faster than Claude, 1x the price
Today we are launching two monthly coding plans:
➡️Cerebras Code Pro: $50/m – for indie developers
➡️Cerebras Code Max: $200/m – for power users with 5x rate limits
Both plans get: Qwen3-Coder at 2,000 tokens/s, 131K context,…
developer.nvidia.com/blog/cutlass-p… marks the start of a short series of blogposts about CUTLASS 3.x and CuTe that we've been meaning to write for years. There are a few more parts to come still, hope you enjoy!
Have been thinking about this and it actually makes a lot of sense.
Imports are completely meaningless so I made a neovim plugin to automatically fold imports in every langauge I use using treesitter (works in C, Rust, C++, OCaml, (Type/Java)script, Zig, and Python so far)…
Have been thinking about this and it actually makes a lot of sense.
Imports are completely meaningless so I made a neovim plugin to automatically fold imports in every langauge I use using treesitter (works in C, Rust, C++, OCaml, (Type/Java)script, Zig, and Python so far)… https://t.co/fX9BpGtZ2i
Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms
Important paper explaining NCCL’s internal architecture and how it works. Personally it was very difficult to understand how NCCL works because of bad documentation and complex code. Hopefully…
You know the prompts. The issue is you’re generating too few tokens.
Generate more output.
It comes with prefix caching: that’s how you compound learning.
KV-Cache decays over time. Make sure to prefill with quality tokens, not TikTok shorts.
You know the prompts. The issue is you’re generating too few tokens.
Generate more output.
It comes with prefix caching: that’s how you compound learning.
KV-Cache decays over time. Make sure to prefill with quality tokens, not TikTok shorts.
12K Followers 3K Followingresearch @MIT_CSAIL @thinkymachines. work on scalable and principled algorithms in #LLM and #MLSys. in open-sourcing I trust 🐳. she/her/hers
584 Followers 456 Followingbitter lesson pilled |
cold showers |
in omnibus excelsior |
polyglot |
carpe diem |
do hard things |
the laws of physics are the only limit
2K Followers 581 FollowingAssistant Prof @CornellECE and cofounder/Chief Science Officer at @mako_dev_ai. At the intersection of machine learning and hardware. Father. Muslim.
3K Followers 90 Followingcreator of @electronjs, check https://t.co/ZDJujd4Nql for the open source things I built.
currently sponsored to write a CUDA backend for MLX.
2K Followers 362 Following18, dev @lovable_dev, ex @ensdomains, made OSS apps with 310k+ users at 14, high school speedrunner, self-taught chinese speaker 学习中文
775K Followers 4 FollowingA platform for illuminating academic papers. We annotate and share a paper every week. Save, annotate and share papers with anyone: https://t.co/0o2Pls3jmo
31K Followers 2K FollowingLead Engineer at @AIPRMcorp (https://t.co/fepyWfV4kA) and @lrt_co (https://t.co/p7LEvIKduG), building AIPRM for ChatGPT & Claude. Signal @ btibor.91
2K Followers 679 Followingenjoying the late pre-agi; making llms go brrr @Aleph__Alpha; yapping about economics of AI systems at https://t.co/tbsybxOMHz
555K Followers 132 FollowingFather of three, Creator of Ruby on Rails + Omarchy, Co-owner & CTO of 37signals, Shopify director, NYT best-selling author, and Le Mans 24h class-winner.
8K Followers 167 FollowingLarge Model Systems Organization: Join our Slack: https://t.co/mSPNyKTLTS We developed SGLang https://t.co/jEqIJcGwGA, Chatbot Arena (now @lmarena_ai), and Vicuna!
35K Followers 2K FollowingHost of The Yaron Brook Show on YouTube: https://t.co/FpSRAOXEWN Chairman, @AynRandInst Co-author of Free Market Revolution & Equal is Unfair. BSc MBA PhD
20K Followers 452 Followingphysics of language models @ Meta (FAIR, not GenAI)
🎓:Tsinghua Physics — MIT CSAIL — Princeton/IAS
🏅:IOI x 2 — ACM-ICPC — USACO — Codejam — math MCM
603K Followers 5K FollowingPresident & CEO @ycombinator —Founder @Initialized—designer/engineer who helps founders—San Francisco Dem accelerating the boom loop—e/acc—technology brother
15K Followers 1K FollowingCo-founder and CEO @Hyperbolic_Labs. ex-@avax & ex-@citsecurities. Finished Math PhD in 2yrs @UCBerkeley. Math Olympiad Gold Medalist. Highest honor @PKU1898