Wei Wang @WeiWangML

PhD Student @UTokyo_news | Researcher @RIKEN_AIP_EN Tokyo Joined October 2023

Tweets

44
Followers

138
Following

887
Likes

758

Tanishq Mathew Abraham, Ph.D. @iScienceLuvr

4 weeks ago

RLVR/RLHF libraries: • verl - ByteDance • TRL - HuggingFace • slime - Zhipu AI • prime-rl - Prime Intellect • ROLL - Alibaba • Nemo-RL - NVIDIA • AReaL - Ant Research • SkyRL - UC Berkeley • open-instruct - Allen AI • torchtune - PyTorch Any I am missing? Which do you…

38 110 990 79K 1K

yidongwang37 @yidongwang37

a month ago

Another step for China’s AI innovation! Try it yourself: AutoSurvey: github.com/AutoSurveys/Au… GLM4.5: z.ai/blog/glm-4.5 Kimi: kimi-k2.com #GLM4.5 #kimi2 #AIResearch #ZhipuAI #AutoSurvey

1 4 5 116 4

Download Image

ICLR 2026 @iclr_conf

a month ago

Announcing the ICLR 2026 Call for Papers! Abstract submission: Sept 19 (AoE) Paper submission: Sept 24 (AoE) Reviews released: Nov 11 Author/Reviewer discussion: Nov 11-Dec 3 Final decisions: Jan 22 2026 iclr.cc/Conferences/20…

3 63 535 47K 156

Yiping Lu @2prime_PKU

2 months ago

Anyone knows adam?

271 465 5K 615K 504

Download Image

Feng Liu @AlexFengLiu1

3 months ago

Ever confused by "prompt tuning" vs "model reprogramming" vs "in-context learning"? What if they're all the same thing—just different names across ML, CV, and NLP communities? Our recent paper introduces Neural Network Reprogrammability as a unifying framework showing these…

0 1 11 7K 7

Qi Lei @Qi_Lei_

3 months ago

🧵New survey: Bridging Distribution Shift and AI Safety Distribution shift and AI safety have long been studied in parallel. But how can their insights formally inform each other? We present the first comprehensive, mathematically grounded, and one-to-one aligned treatment. 1/6

2 8 21 4K 7

Andrew Gordon Wilson @andrewgwils

3 months ago

AI benchmarking culture is completely out of control. Tables with dozens of methods, datasets, and bold numbers, trying to answer a question that perhaps no one should be asking anymore.

6 17 217 15K 26

Wei Huang @WeiHuang_USTC

3 months ago

🚀【Deep Learning Theory Team Seminar】 🎙️ Talk by Prof. Wuyang Chen (SFU): Building Machines That Understand the Physics 🧠 AI meets scientific tools & physics-enriched data 📅 May 28, 15:00 JST 🔗 Details & RSVP: …c59ed978213830355fc8978.doorkeeper.jp/events/184833 #AI #DeepLearning #ScientificML #LLM

2 1 7 508 1

Andreas Kirsch 🇺🇦 @BlackHC

4 months ago

I want to share my latest (very short) blog post: "Active Learning vs. Data Filtering: Selection vs. Rejection." What is the fundamental difference between active learning and data filtering? Well, obviously, the difference is that: 1/11

14 76 566 94K 707

Download Image

Saining Xie @sainingxie

4 months ago

Wow, Deeply Supervised Nets received the Test of Time award at @aistats_conf 2025! It was the very first paper I submitted during my PhD. Fun fact: the paper was originally rejected by NeurIPS with scores of 8/8/7 (yes, that pain stuck with me... maybe now I can finally let it…

AISTATS Conference @aistats_conf

4 months ago

2 6 70 90K 4

Download Image

33 44 510 88K 50

Tongtian Zhu @Tongtian_Zhu

5 months ago

ICML 2025's rebuttal process be like🤣: 👨‍💻 Authors: spend a whole week writing a careful rebuttal ✅ Reviewer: clicks "acknowledge" without reading 🚫 Author: not allowed to reply anymore So what does acknowledge mean here? "You speak. I pretend to listen. Conversation over."🙃

8 25 296 27K 19

Wei Huang @WeiHuang_USTC

6 months ago

Excited to announce our seminar! Join us on Mar 12, 2025, 13:00–14:30 (JST) for a hybrid talk by Prof. Difan Zou (HKU) on "Transformers: Model Depth & Attn" In-person at RIKEN AIP Nihonbashi Office & online via Zoom. Register: …c59ed978213830355fc8978.doorkeeper.jp/events/181888 #DeepLearning #Transformers

0 4 18 2K 5

no name @noname_records_

6 months ago

グリーン・デイ来日ライブ横浜Kアリーナ Day 2 セトリ最後まさかの観客がビリーのギターをステージで譲り受けてGood Riddanceを共演若い日本の子がGreen Dayの曲を完璧に弾き継いでこんな幸せなファイナルある？ #greenday #グリーンデイ #setlist Green Day - Good Riddance (Time of Your Life)

150 7K 46K 5.0M 8K

Download Video

Takayuki Osa @TakayukiOsa

7 months ago

Separate from the recently announced Special Postdoctoral Researcher recruitment at RIKEN, the Robot Learning Team at RIKEN AIP @RIKEN_AIP_EN is seeking to hire two researchers! If you are interested, please consider applying. riken.jp/en/careers/res…

0 3 5 810 0

Wei Huang @WeiHuang_USTC

7 months ago

🎉 Thrilled that our paper "On the Optimization and Generalization of Two-layer Transformers with Sign Gradient Descent" is a Spotlight at #ICLR2025! Huge thanks to my collaborators & reviewers! Excited to discuss at the conference! 📄 Paper: openreview.net/forum?id=97rOQ…

2 15 69 7K 24

Taiji Suzuki @btreetaiji

7 months ago

ICML2025のDeep Generative Modelワークショップの締め切りが2月5日に迫ってきました．ぜひ投稿ください． Submission deadline: February 5 (AOE), 2025 delta-workshop.github.io

Wei Huang @WeiHuang_USTC

7 months ago

0 17 39 19K 10

0 6 30 7K 5

Wei Huang @WeiHuang_USTC

7 months ago

ICML DDL is over, but don’t forget about the ICLR 2025 Workshop on Deep Generative Models submission deadline coming up fast! Share your innovative work: delta-workshop.github.io #ICLR2025 #DeepGenerativeModels #ICLR2025Workshop #CallForPapers

Wei Huang @WeiHuang_USTC

8 months ago

8 26 102 29K 30

Download Image

0 17 39 19K 10

Yoshua Bengio @Yoshua_Bengio

7 months ago

Today, we are publishing the first-ever International AI Safety Report, backed by 30 countries and the OECD, UN, and EU. It summarises the state of the science on AI capabilities and risks, and how to mitigate those risks. 🧵 Link to full Report: assets.publishing.service.gov.uk/media/679a0c48… 1/16

49 528 1K 393K 766

Download Video

Pan Xu @iampanxu

8 months ago

If you’re using the #ICML LaTeX template, there’s a typo in algorithmic.sty that prevents cross-referencing specific lines in the algorithm environment. The fix is simple: change \addtocounter{ALC@line}{1} to \refstepcounter{ALC@line} on Line 106. Credit: tex.stackexchange.com/questions/5234…

3 15 147 14K 102

Alex Dimakis @AlexGDimakis

8 months ago

Most AI researchers I talk to have been a bit shocked by DeepSeek-R1 and its performance. My preliminary understanding nuggets: 1. Simple post-training recipe called GRPO: Start with a good model and reward for correctness and style outcomes. No PRM, no MCTS no fancy reward…