Adam Wei @adamwei_

Robotics PhD student @MIT_CSAIL Joined April 2025

Tweets

15
Followers

138
Following

75
Likes

51

Jyo Pari @jyo_pari

2 days ago

For agents to improve over time, they can’t afford to forget what they’ve already mastered. We found that supervised fine-tuning forgets more than RL when training on a new task! Want to find out why? 👇

11 114 762 93K 650

Download Image

Jyo Pari @jyo_pari

3 weeks ago

We have a fun collaboration of @GPU_MODE x @scaleml coming up! We’re hosting a week-long online bootcamp that explores the core components of GPT-OSS while also diving into cutting-edge research that pushes beyond what’s currently in GPT-OSS! For example, how can MoE's power…

1 20 71 22K 27

Download Image

Adam Wei @adamwei_

4 weeks ago

Happy to announce that we’ll be presenting our work on sim-and-real cotraining at IROS 2025! Check out our latest arXiv version - we’ve added new experiments for 2 alternative co-training formulations and addressed some FAQs 📄 arxiv.org/abs/2503.22634 🌐 sim-and-real-cotraining.github.io

2 5 26 2K 2

Russ Tedrake @RussTedrake

2 months ago

TRI's latest Large Behavior Model (LBM) paper landed on arxiv last night! Check out our project website: toyotaresearchinstitute.github.io/lbm1/ One of our main goals for this paper was to put out a very careful and thorough study on the topic to help people understand the state of the…

8 106 484 79K 193

Giannis Daras @giannis_daras

3 months ago

Announcing Ambient Diffusion Omni — a framework that uses synthetic, low-quality, and out-of-distribution data to improve diffusion models. State-of-the-art ImageNet performance. A strong text-to-image results in just 2 days on 8 GPUs. Filtering ❌ Clever data use ✅

10 65 453 64K 403

Download Image

Jyo Pari @jyo_pari

3 months ago

What if an LLM could update its own weights? Meet SEAL🦭: a framework where LLMs generate their own training data (self-edits) to update their weights in response to new inputs. Self-editing is learned via RL, using the updated model’s downstream performance as reward.

130 527 3K 600K 3K

Download Image

Adam Wei @adamwei_

4 months ago

Really cool work on environment generation + inference time search with diffusion!

Nicholas Pfaff @NicholasEPfaff

4 months ago

Really cool work on environment generation + inference time search with diffusion!

7 26 130 19K 76

Download Video

0 0 2 155 0

Michael Posa @MichaelAPosa

5 months ago

We're really honored to be named alongside Ajay and Justin for this award, from the hands-down best technical committee in robotics! Here's the paper arxiv.org/abs/2304.11259 and accompanying video youtube.com/watch?v=L57Jz3…

Model-Based Optimization @TCOptRob

5 months ago

1 1 12 5K 2

1 10 29 4K 2

Adam Wei @adamwei_

5 months ago

Amazing concurrent work showing the effectiveness of sim+real cotraining on a wide range of tasks! The similarity of our findings suggests that sim data could unlock massive improvements in robotics... I'm excited to see what comes next in this space