Kaiwen Wang @kaiwenw_ai

RL PhD @Cornell_Tech. @Google PhD Fellow. kaiwenw.github.io NYC Joined February 2020

Tweets

71
Followers

353
Following

511
Likes

81

Kaiwen Wang @kaiwenw_ai

2 months ago

Correction re the time: my posters on Q# and VGS at @ai4mathworkshop is happening today from 10:50 am to 12:20 pm. Hope to see you there! x.com/kaiwenw_ai/sta…

Kaiwen Wang @kaiwenw_ai

2 months ago

Correction re the time: my posters on Q# and VGS at @ai4mathworkshop is happening today from 10:50 am to 12:20 pm. Hope to see you there! x.com/kaiwenw_ai/sta…

2 16 47 12K 21

0 1 4 543 1

AI for Math Workshop @ ICML 2025 @ai4mathworkshop

2 months ago

It's happening today! 📍Location: West Ballroom C, Vancouver Convention Center ⌚️Time: 8:30 am - 6:00 pm 🎥 Livestream: icml.cc/virtual/2025/w… #icml2025 #icml25 #icml #aiformath #ai4math #workshop

0 11 20 3K 5

Download Image

This captures something fundamental we're seeing in AI right now! The shift from just scaling pre-training to scaling test-time compute is huge. Our Q# + VGS work shows how value-based methods can guide models through the vast implicit graphs of reasoning possibilities.

Kaiwen Wang @kaiwenw_ai

2 months ago

2 16 47 12K 21

0 2 6 631 0

Wen Sun @WenSun1

2 months ago

How can small LLMs match or even surpass frontier models like DeepSeek R1 and o3 Mini in math competition (AIME & HMMT) reasoning? Prior work seems to suggest that ideas like PRMs do not really work or scale well for long context reasoning. @kaiwenw_ai will reveal how a novel…

Kaiwen Wang @kaiwenw_ai

2 months ago

2 16 47 12K 21

0 9 23 6K 15

Jon Richens @jonathanrichens

3 months ago

Are world models necessary to achieve human-level agents, or is there a model-free short-cut? Our new #ICML2025 paper tackles this question from first principles, and finds a surprising answer, agents _are_ world models… 🧵

33 176 1K 182K 1K

Download Image

Jason Gauci @NeuralNets4Life

6 months ago

I've made FANG billions of $ with reinforcement learning, so this episode is a long-time coming :-). Episode 180: Reinforcement Learning, drops on Monday! patreon.com/posts/180-lear…

0 2 3 232 0

Kaiwen Wang @kaiwenw_ai

9 months ago

Join us @pluralistic_ai workshop at #NeurIPS to learn more about CLP! 🗓️ Sat, 14 Dec, 2024 🕙 10:40-11:40am PST 📍 West Meeting Room 116, 117 🔗 arxiv.org/abs/2407.15762 x.com/kaiwenw_ai/sta…

Kaiwen Wang @kaiwenw_ai

10 months ago

1 33 171 20K 127

Download Image

0 2 8 711 0

Kaiwen Wang @kaiwenw_ai

9 months ago

Making inferences robust to distribution shifts and hidden confounders is paramount for decision making under uncertainty. At the upcoming @NeurIPSConf, I’m excited to present our efficient and sharp algorithm for off-policy evaluation in robust markov decision processes. Many…

0 7 27 2K 5

Download Image

Jason Wei @_jasonwei

9 months ago

2022: I never wrote a RL paper or worked with a RL researcher. I didn’t think RL was crucial for AGI Now: I think about RL every day. My code is optimized for RL. The data I create is designed just for RL. I even view life through the lens of RL Crazy how quickly life changes