Hao Sun - RL @HolarisSun

RS @GoogleDeepMind. Prev. PhD @CambridgeUni, #MMLab, B.Phys. @PKU1898 holarissun.github.io London, UK Joined October 2022

Tweets

167
Followers

893
Following

960
Likes

537

Hao Sun - RL @HolarisSun

2 months ago

🚀 RL is powering breakthroughs in LLM alignment, reasoning, and agentic apps. Are you ready to dive into the RL x LLM frontier? Join us at @aclmeeting ACL’25 tutorial: Inverse RL Meets LLM Alignment this Sunday at Vienna🇦🇹(Jul 27th, 9am) 📄 Preprint at huggingface.co/papers/2507.13…

0 12 68 4K 43

Hao Sun - RL @HolarisSun

2 months ago

This is SCIENCE🚀!!!

Alan Jeffares @ ICML 🇨🇦 @Jeffaresalan

2 months ago

This is SCIENCE🚀!!!

12 79 951 129K 967

Download Image

2 0 8 794 1

Hao Sun - RL @HolarisSun

3 months ago

Now with Qwen’s RL-fine-tuning results, are we witnessing a quiet return of prompt optimization/engineering? Now we have a 2-player game: users become “lazy prompters”, but the system prompts (e.g. thinking patterns) need to be highly optimized. Next: Bi-level optimization?

0 0 3 420 0

Download Image

Hao Sun - RL @HolarisSun

4 months ago

"Knowledge belongs to humanity, and is the torch which illuminates the world." — Louis Pasteur Especially for those contributed by the community.

0 0 7 3K 0

Download Image

Hao Sun - RL @HolarisSun

4 months ago

AI cannot feel time, then how can it really understand humans?

0 0 2 353 0

Jean-François Ton @jeanfrancois287

4 months ago

📢New Paper on Process Reward Modelling 📢 Ever wondered about the pathologies of existing PRMs and how they could be remedied? In our latest paper, we investigate this through the lens of Information theory! #icml2025 Here’s a 🧵on how it works 👇 arxiv.org/abs/2411.11984

5 74 308 28K 284

Download Image

Jean-François Ton @jeanfrancois287

4 months ago

Happy to share that our paper on "Active Reward Modeling" has been accepted to ICML 2025! #ICML2025 The part I like the most about the project is its simplicity! Huge thanks to my amazing co-authors @ShenRaphael @HolarisSun More to come! For more detailed 🧵 see 👇

Jean-François Ton @jeanfrancois287

7 months ago

1 9 36 8K 15

Download Image

0 3 12 2K 3

Hao Sun - RL @HolarisSun

4 months ago

OpenReview Justice!

Yunyi Shen/申云逸 🐺 @ShenRaphael

4 months ago

OpenReview Justice!

1 4 75 6K 9

Download Image

0 0 5 661 0

Yunyi Shen/申云逸 🐺 @ShenRaphael

4 months ago

Glad to be there with @HolarisSun presenting our work openreview.net/forum?id=rfdbl…

Hao Sun - RL @HolarisSun

5 months ago

Glad to be there with @HolarisSun presenting our work openreview.net/forum?id=rfdbl… https://t.co/byFdilTSjj

0 2 18 10K 0

1 7 44 8K 4

Download Image

Hao Sun - RL @HolarisSun

4 months ago

ICLR wrapped! Eggie and Toastie said it was the BEST🥰

2 2 50 5K 3

Download Image

Hao Sun - RL @HolarisSun

4 months ago

The oral sessions and poster sessions are happening at the same time, so it actually feels like the oral speakers are just talking to each other🤣

Yunyi Shen/申云逸 🐺 @ShenRaphael

4 months ago

The oral sessions and poster sessions are happening at the same time, so it actually feels like the oral speakers are just talking to each other🤣

1 1 7 2K 0

Download Image

0 0 6 826 0

Hao Sun - RL @HolarisSun

5 months ago

Heading to 🇸🇬ICLR next week! Can’t wait to catch up with old friends and meet new ones — let’s chat about RL, reward models, alignment, reasoning, and agents! Also, fun fact🤓: Yunyi won’t be there physically, but his digital twin will be attending instead. Stay tuned!