FlashAttention in 3D? Our latest blog explores the #kernel design of 2-Simplicial #Attention, modeling the algorithm with a hardware aligned design and rewriting the entire kernel in TLX (Triton Low Level Extensions).
🔗 hubs.la/Q03H6S9D0#PyTorch#OpenSourceAI
New in-depth blog post - "Inside vLLM: Anatomy of a High-Throughput LLM Inference System". Probably the most in depth explanation of how LLM inference engines and vLLM in particular work!
Took me a while to get this level of understanding of the codebase and then to write up…
I really love this, that's how we work at HF and it's always nice to see other ppl having the same mentality.
They are building very good and open research and you should make sure to follow @dlwh@WilliamBarrHeld@wen_kaiyue@percyliang ⛵
I really love this, that's how we work at HF and it's always nice to see other ppl having the same mentality.
They are building very good and open research and you should make sure to follow @dlwh@WilliamBarrHeld@wen_kaiyue@percyliang ⛵
Advice for young researchers. Always make sure you have good signal. Asking fewer questions and answering them correctly is better than more breadth with incorrect or incomplete conclusions.
Advice for young researchers. Always make sure you have good signal. Asking fewer questions and answering them correctly is better than more breadth with incorrect or incomplete conclusions.
(1/n) Check out our new paper: "Fantastic Pretraining Optimizers and Where to Find Them"! >4000 models to find the fastest optimizer! 2× speedups over AdamW? Unlikely. Beware under-tuned baseline or limited scale! E.g. Muon: ~40% speedups <0.5B & only 10% at 1.2B (8× Chinchilla)!
Fuck it. Today, we open source FineVision: the finest curation of datasets for VLMs, over 200 sources!
> 20% improvement across 10 benchmarks
> 17M unique images
> 10B answer tokens
> New capabilities: GUI navigation, pointing, counting
FineVision 10x’s open-source VLMs.
Today, we are releasing FineVision, a huge open-source dataset for training state-of-the-art Vision-Language Models:
> 17.3M images
> 24.3M samples
> 88.9M turns
> 9.5B answer tokens
Here are my favourite findings:
Hey devs! We're running a survey on large models to help us design what's next. We'd love your input! We'll be sharing the anonymized results with the community to show what the industry needs. Scan the QR code and fill it out here!
嘿开发者们,我们需要你的声音!…
Worth mentioning that it's not just `model.to(nvfp4)` but more reminiscent of fp16 than of bf16.
Though I'm sure someone will write a monkey-patching autonvfp4 context manager.
bf4 when @JeffDean ??
Worth mentioning that it's not just `model.to(nvfp4)` but more reminiscent of fp16 than of bf16.
Though I'm sure someone will write a monkey-patching autonvfp4 context manager.
bf4 when @JeffDean ?? https://t.co/x5hn4L2cwY
join us tomorrow for our 2nd AMA on r/LocalLLaMA with Hugging Face Science, the team behind SmolLM, SmolVLM, and more
this is going to be a good one, for many reasons but i cannot say more than that at the moment :)
don't miss it, tomorrow 8am-11am PST
438 Followers 6K Following🔍 Data Scientist | Data Analyst
📈 Python | SQL | Power BI | Machine Learning | AI
🔗 https://t.co/TOM7nTDmXq
Portfolio: https://t.co/7cQtfTpKym
18K Followers 2K FollowingInactive here outside of posting additional reasons for leaving X.
For fun science topics, other social media options listed on https://t.co/72AVrLhfUr
315 Followers 227 FollowingWorking on LLM/VLM Tool Learning and Reasoning at Tsinghua and Bytedance, reading at least one paper a day — The future will not invent itself.
27K Followers 1 FollowingNano Banana 🍌, aka Gemini 2.5 Flash Image, the world's most powerful image editing and generation model! Try it for free in the @GeminiApp
11K Followers 747 Followingslightly less attractive cofounder @AskEureka: we’re replacing all doctors with AI. I tweet abt healthcare and tech, prev @Harvard @Google @BCG, dm to say hi :)
470 Followers 622 FollowingMember of Technical Staff @xAI Prev: @PyTorch @MetaAI @Google @Snowflake ML systems. Believer in "The Bitter Lesson" Loves funny tweets. Opinions are my own.
703 Followers 56 FollowingTech journalist with 12+ years covering AI & global innovation (lots on China) Based in Beijing. Currently running Dailyio & HelloChinaTech.