Jiawei Zhao @jiawzhao

Research Scientist at Meta FAIR @AIatMeta, PhD @Caltech, GaLore, DeepConf jiaweizhao.com Joined February 2013

Tweets

91
Followers

3K
Following

242
Likes

192

Jiawei Zhao @jiawzhao

a week ago

Want to try DeepConf NOW? While our full repo is coming, we just dropped a ready-to-run example in our vLLM (@vllm_project ) PR: github.com/vllm-project/v… DeepConf + DeepSeek-R1-8B + BRUMO25 = • 93.3% accuracy (+2.5% boost) • 52.9% fewer tokens generated • 31% faster…

Jiawei Zhao @jiawzhao

2 weeks ago

62 332 3K 445K 2K

Download Video

2 18 126 13K 66

Jiawei Zhao @jiawzhao

2 weeks ago

Thanks @vllm_project folks for pushing this. Please give it a try and let us know!

vLLM @vllm_project

2 weeks ago

Thanks @vllm_project folks for pushing this. Please give it a try and let us know!

4 40 317 47K 115

Download Image

2 7 134 13K 48

Jiawei Zhao @jiawzhao

2 weeks ago

⏰ Submission deadline coming up fast! (Sep 1) Working on efficient reasoning? Don’t miss the chance to share it at NeurIPS 2025!

Cheng Luo @ChengLuo_lc

2 weeks ago

⏰ Submission deadline coming up fast! (Sep 1) Working on efficient reasoning? Don’t miss the chance to share it at NeurIPS 2025!

0 4 27 34K 14

0 1 5 3K 3

Yuandong Tian @tydsh

2 weeks ago

We released DeepConf that can achieve 99.9% on AIME'25 with open source models with only 15% of the compute, compared to majority voting@512. The secret? Simple. Just to pruning the rollouts if they show a consecutive stream of low-confidence😀. Can be applied to any models…

Jiawei Zhao @jiawzhao

2 weeks ago

62 332 3K 445K 2K

Download Video

10 49 368 48K 207

Jiawei Zhao @jiawzhao

2 weeks ago

Thank you for sharing it! @_akhaliq x.com/jiawzhao/statu…

AK @_akhaliq

2 weeks ago

Thank you for sharing it! @_akhaliq x.com/jiawzhao/statu…

4 11 85 28K 31

Download Video

0 1 6 1K 3

Jiawei Zhao @jiawzhao

2 weeks ago

Excited to see Logarithmic format (LNS, UE8M0 FP8) used in production by @deepseek_ai! LNS enables efficient multi (just addition between exponents) + great dynamic range. Our LNS-Madam optimizer, built for LNS, was proposed years ago before LLM-era - hope it shines again!

Prof. Anima Anandkumar @AnimaAnandkumar

2 weeks ago

14 106 612 72K 414

Download Image

0 7 40 8K 21

Jiawei Zhao @jiawzhao

3 months ago

You can skip prompts that aren’t useful for the current policy during training! 🔍 Efficient prompt selection is key to scaling RL training for LLM reasoning. We are actively building algos for efficient and scalable RL training system. Stay tuned!

Infini-AI-Lab @InfiniAILab

3 months ago

1 29 165 31K 145

Download Image

2 5 21 2K 2

Infini-AI-Lab @InfiniAILab

3 months ago

🥳 Happy to share our new work – Kinetics: Rethinking Test-Time Scaling Laws 🤔How to effectively build a powerful reasoning agent? Existing compute-optimal scaling laws suggest 64K thinking tokens + 1.7B model > 32B model. But, It only shows half of the picture! 🚨 The O(N²)…

7 71 246 78K 163

Download Image

Zhuang Liu @liuzhuang1234

6 months ago

New paper - Transformers, but without normalization layers (1/n)

77 599 4K 1.3M 3K

Download Image

Zechun Liu @zechunliu

7 months ago

Our ParetoQ is substantially better than the previous work in ternary LLM, such as 1-bit era paper.

Yuandong Tian @tydsh

7 months ago

Our ParetoQ is substantially better than the previous work in ternary LLM, such as 1-bit era paper. https://t.co/YSiiJfyYVn

2 14 75 9K 21

0 7 24 5K 5

Download Image

Yuandong Tian @tydsh

7 months ago

We introduce ParetoQ, a series of pre-trained models that show SoTA in trinary (1.58bit), 2/3/4-bit quantization for SLMs (up to 3B parameters) using initial full pre-training + QAT later. In addition, we also discover that the representation changes substantially after low-bit…

2 14 75 9K 21

Yuandong Tian @tydsh

8 months ago

Our Coconut work (learning continuous latent CoT) has opened sourced now. Welcome to play with it: github.com/facebookresear…

22 270 2K 159K 1K

Kaiyu Yang @KaiyuYang4

9 months ago

🚀 Excited to share our position paper: "Formal Mathematical Reasoning: A New Frontier in AI"! 🔗 arxiv.org/abs/2412.16075 LLMs like o1 & o3 have tackled hard math problems by scaling test-time compute. What's next for AI4Math? We advocate for formal mathematical reasoning,…