Sudhir Arora @sudarora
Joined April 2009-
Tweets16
-
Followers20
-
Following419
-
Likes22
Intra-sm, inter-sm, and cross-gpu overlapping within a single kernel for llama 70b! Rather than compilers to fuse ops, we explore using an interpreter. While interpreters are often considered slow, the large granularity of ML operations and low overheads of this impl. allow large…
Intra-sm, inter-sm, and cross-gpu overlapping within a single kernel for llama 70b! Rather than compilers to fuse ops, we explore using an interpreter. While interpreters are often considered slow, the large granularity of ML operations and low overheads of this impl. allow large…
Very excited to share that I've finished my phd @Stanford and will be joining @Caltech’s cms department as an assistant professor. Looking forward to working with students and colleagues on ml systems! Grateful to my amazing advisor and labmates @HazyResearch for the best time…
We are ending strong with GPU Programming 🚀! 2 talks today back to back! First @exists_forall for intro to CUDA and then @simran_s_arora for Thunder Kittens 🐈! Today at: 1:00pm EST / 11:00am PT - scale-ml.org/bootcamp
There’s been tons of work on KV-cache compression and KV-cache free Transformer-alternatives (SSMs, linear attention) models for long-context, but we know there’s no free lunch with these methods. The quality-memory tradeoffs are annoying. *Is all lost?* Introducing CARTRIDGES:…
~ megakernels ~ are next 🚀 excited to share a single kernel that runs a full llama 1b forwards pass!
~ megakernels ~ are next 🚀 excited to share a single kernel that runs a full llama 1b forwards pass!
BASED ✌️ turns 1! One year since its launch at NeurIPS 2023 — and it's helped shape the new wave of efficient LMs. ⚡️ Fastest linear attention kernels 🧠 405B models trained on 16 GPUs 💥 Inspired Mamba-v2, RWKVs, MiniMax Checkout our retrospective below!
happy thanksgiving!! we're excited to share new thunderkittens blog posts this week for your enjoyment over the holiday break. today, we're starting off with fp8 support!!! 1500 tflops in <95 lines of code 🔥
Wish writing AI kernels was like writing PyTorch??? Enter ThunderKittens 0.002: for simpler, faster, more adorable AI kernels! We use TK to provide 10-40% faster attention backwards, CuBLAS-speed GEMMs, 8x faster state space models, 14x faster linear attentions – averaging <200…
Want Llama 405B, but wish it scaled linearly in sequence length??? Enter LoLCATS: an efficient method for "turning Transformers to linear attention models", all on an academic budget!! We use LoLCATS to linearize the *full Llama 3.1 model family* for the first time – 20+ points…
Enjoyed attending STOC 2024 recently to share some of the theoretical underpinnings of our work on Zoology, Based, and Just Read Twice! It’s such a wonderful community! Sharing slides and an explainer blogpost we wrote in conjunction with the talk for those interested:
Excited to share Just read twice: going beyond causal language modeling to close quality gaps between efficient recurrent models and attention-based models!! There’s so much recent progress on recurrent architectures, which are dramatically more memory efficient and…
Excited to release Based, an architecture that combines two✌️ simple, familiar, attention-like primitives – short (size-64) sliding window attention and softmax-approximating linear attention – to enable high quality and efficient inference! 💨 🚀 joint w/ @EyubogluSabri,…
KV-cache got you down? Sharing Based✌️, a simple architecture built from PyTorch 101 building blocks (convolutions, linear attention). It gives exciting quality vs the modern architectures & its hidden state size is fixed, enabling 4.5x >throughput vs attn hazyresearch.stanford.edu/blog/2023-12-1…
LMs can be expensive for document processing. E.g., inference over the 55M Wiki pages costs >$100K (>$0.002/1k toks)💰 We propose a strategy that reduces inference cost by 110x and can even improve quality vs. running inference over each doc directly! 💻 github.com/HazyResearch/e…
Popular frameworks for personal ML e.g. federated learning display a tension between privacy and quality. We introduce a simple new framework, FOCUS, based on shipping foundation models to private silos, guaranteeing **perfect secrecy** When and where does this work, if at all?
Reasoning over both public and *private* data is necessary for personalized ML systems. But how can we use personal context without exposing it to the world? 🔑🔒 We explore this question in new work on personalized and private retrieval systems!! dl.fbaipublicfiles.com/concurrentqa/r…
Reasoning over both public and *private* data is necessary for personalized ML systems. But how can we use personal context without exposing it to the world? 🔑🔒 We explore this question in new work on personalized and private retrieval systems!! dl.fbaipublicfiles.com/concurrentqa/r…

luna @luna_marelli
783 Followers 2K Following Singles|Entrepreneurs|Poland Reading immerses me in different worlds broadening my perspective Each book is a conversation with great minds🇺🇸🇺🇸🇺🇸
Ashmeet Sidana 🤠 @ashmeetsidana
2K Followers 899 Following Chief Engineer, Engineering Capital - a VC who leads seed rounds based on technical insights. https://t.co/QfziL8tluz
てっこ @tekko124227
154 Followers 613 Following 21めす // 160cm // Dcup // お泊りとかしたい // ヒマヒマ // 仲良くなったらなんでも◎
さとみん @satomin43553754
132 Followers 700 Following 162cm Cかぷ せふ欲しい かまってちゃん 🌷 からみましょ https://t.co/Qv9dlTbN7j
I @INnnokm
49 Followers 6 Following
Rioga Premium Real Es... @Rioga_Premium
98 Followers 425 Following Expert Real Estate Advisory in Mumbai | 11 Years of Experience | 160+ Mumbai projects | 42 Grade-A Developers Take Yourself Home
Nikhar Arora @nikyarora41
37 Followers 283 Following
Poonam Mishra @tweettopoonam
40 Followers 88 Following Solution Sales Exec with a passion to transform enterprises to cloud solutions. Expertise in services contracts, discovery-solution architecture blueprint.
bianka sanchez @bianka_201l5m
86 Followers 1K Following
Michael Dunford @PrincessLd1760
65 Followers 834 Following
Ashmeet Sidana 🤠 @ashmeetsidana
2K Followers 899 Following Chief Engineer, Engineering Capital - a VC who leads seed rounds based on technical insights. https://t.co/QfziL8tluz
GENERAL @iicn1
131 Followers 67 Following
M. @Matr_i_x
511 Followers 306 Following
ابو نوره @nwar_1111
214 Followers 1 Following
Nxksk @Nxksk4
215 Followers 0 Following
Alvin Driscoll @AlvinDriscoll
216 Followers 0 Following
لأجلك عهد ا�... @For_ahad_18
214 Followers 0 Following استغفر الله واتوب اليه استغفر الله العظيم @18_ahad
nour. @No538i
216 Followers 0 Following
jhsbfisancas saoinci @heuheuehjj
213 Followers 38 Following
I @INnnokm
49 Followers 6 Following
- @sarangsoc
216 Followers 58 Following
نوني @noo696
215 Followers 0 Following
Nnnbj @Nnnbj6
215 Followers 1 Following
Fafali @Fafali83074313
212 Followers 0 Following
غفر الله لي ... @kjhyuimnhj
212 Followers 0 Following
nora @NoraNnoras
49 Followers 0 Following
djdncnc @nnrkfk_djdncnc
48 Followers 2 Following
N.. @ncpstp
215 Followers 18 Following
نيران @noran1993
263 Followers 0 Following
رووووني @noal48fo
107 Followers 202 Following
hamid_almasabi @hamid_almasabi
213 Followers 1 Following
nm21_f @nm21_f
213 Followers 30 Following
نواف علي شه�... @nawaf_1333
213 Followers 0 Following
Reid Castillo @ReidCastillo
218 Followers 0 Following
خويّل✨ @khwlah__
212 Followers 0 Following
1 @1Nnnrhh
48 Followers 21 Following
Bondi_TooT @acacfon
214 Followers 0 Following
Nonn @noonj1g
215 Followers 1 Following
ole sarge @SargeOle
213 Followers 0 Following
HOWLETT MAYESKI @HowlettNoaksl
50 Followers 0 Following
خويلد @vejd3
209 Followers 4 Following
منن @ss123_qw
213 Followers 20 Following
Noemi Nuñez @NoemiNuez2
215 Followers 4 Following
@Jeeh_Joga_Bonito @Jeeh_JogaBonito
234 Followers 15 Following 19 aninhos , boleiro sempre , fé no pai que o inimigo cai ... Deus 33
Moe フォロー返�... @moe000712
215 Followers 1 Following
اخو سارا @yvB5gWQyhALX3gV
57 Followers 12 Following
Nnn @Nnn56371115
216 Followers 0 Following
bieber kidradrauhl @kidradrauhl
216 Followers 0 Following
R_ @Roof_555
212 Followers 0 Following
سعدالهاملي... @abo__fahd1
455 Followers 774 Following
Jjjb @Jjjb05525409
213 Followers 0 Following
mam7ona @naane_11
258 Followers 56 Following