Jinjie Ni @NiJinjie from NUS will be giving a talk titled "Diffusion Language Models are Super Data Learners" at Friday Aug 22 11am HKT. link to talk: hku.zoom.us/j/94293996114?…
Xinyu Yang from CMU will be giving a talk titled "Multiverse: Your Language Models Secretly
Decide How to Parallelize and Merge Generation" at Friday July 25 11am HKT (Thursday July 24 8pm PDT). Link to talk: hku.zoom.us/j/92651812689?…
What happend after Dream 7B?
First, Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data.
Plus, DreamOn cracks the variable-length generation problem! It enables code infilling that goes beyond a fixed canvas.
We present DreamOn: a simple yet effective method for variable-length generation in diffusion language models.
Our approach boosts code infilling performance significantly and even catches up with oracle results.
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
Apple introduces DiffuCoder, a 7B diffusion LLM trained on 130B tokens of code
authors also propose a diffusion-native RL training framework, coupled-GRPO
Decoding of dLLMs differ from…
Hongru Wang from CUHK will be giving a talk titled "Theory of agent: from definition to objective" at ⏰Wednesday 6.11 3pm HKT (Thursday 6.11 11am PDT). Link to talk: hku.zoom.us/j/91654661534?…
I always found it puzzling how language models learn so much from next-token prediction, while video models learn so little from next frame prediction. Maybe it's because LLMs are actually brain scanners in disguise. Idle musings in my new blog post: sergeylevine.substack.com/p/language-mod…
🔍 Are Verifiers Trustworthy in RLVR?
Our paper, Pitfalls of Rule- and Model-based Verifiers, exposes the critical flaws in reinforcement learning verification for mathematical reasoning.
🔑 Key findings:
1️⃣ Rule-based verifiers miss correct answers, especially when presented in…
🔥 Meet PromptCoT-Mamba
The first reasoning model with constant-memory inference to beat Transformers on competition-level math & code
⚡ Efficient decoding: no attention, no KV cache
⚡ +16.0% / +7.1% / +16.6% vs. s1.1-7B on AIME 24 / 25 / LiveCodeBench
🚀 Up to 3.66× faster
“What is the answer of 1 + 1?”
Large Reasoning Models (LRMs) may generate 1500+ tokens just to answer this trivial question.
Too much thinking 🤯
Can LRMs be both Faster AND Stronger?
Yes.
Introducing LASER💥: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping…
Share our another #ICML25 paper: “Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging” !
(1/5) We use model merging to enhance VLMs' reasoning by integrating math-focused LLMs—bringing textual reasoning into multi-modal models. Surprisingly, this…
Guanqi Jiang from UCSD will be giving a talk titled "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets" at ⏰Friday 5.16 11am HKT (Thursday 5.15 8pm PDT). Link to talk: hku.zoom.us/j/97674910858?…
We are kicking off a series of seminars at @hkunlp2020. @siyan_zhao will be giving a talk titled "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning" at ⏰Friday 5.9 11am HKT (Thursday 5.8 8pm PDT). Link to talk: hku.zoom.us/j/97925412724?…
🚀🔥 Thrilled to announce our ICML25 paper: "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas"!
We dive into the core reasons behind spatial reasoning difficulties for Vision-Language Models from an attention mechanism view. 🌍🔍
Paper:…
Excited to be in Singapore 🇸🇬 for #ICLR2025! Thrilled for my first time attending after past visa issues kept me away 😢.
We'll be presenting our work on:
1️⃣ Jailbreaking as a Reward Misspecification Problem
🗓️ Thursday, April 24 — 3:00 PM - 5:30 PM (SGT)
📍 Hall 3 + Hall 2B —…
Excited to share our work at ICLR 2025 in 🇸🇬. @iclr_conf 🥳 Happy to chat about LLM reasoning & planning, agents, and AI4Science!
📍Sat 26 Apr 3 p.m. CST — 5:30 p.m Hall 3 + Hall 2B #554
69 Followers 2K FollowingData & AI @ ENSAE 🤖 | From Dakar to Paris to the world 🌍 | Founder mindset ⚡ | (finance • media • sport • NLP • Crypto ) | Legacy. Growth. Impact.
476 Followers 315 FollowingPostDoc @genentech in the group of @MarioniLab | Interested in single-cells, deep learning, networks | Core member @scverse_team | https://t.co/wNWP4SbLVw
81 Followers 81 FollowingSumqayıtlı – Azərbaycanın ikinci böyük şəhəri olan və burada doğulan, yaşayan və ya ictimai, mədəni, iqtisadi həyatında fəal iştirak edən biri
993 Followers 984 FollowingPh.D. @CarnegieMellon. Working on data and hardware-driven principled algorithm & system co-design for scalable and generalizable foundation models. They/Them
974 Followers 1K FollowingPost-training in KIMI @Kimi_Moonshot | MS Peking University @PKU1898
Author of MetaMath, Easy2hard generalization, NuminaMath, Kimi k1.5, Kimi K2
110K Followers 3K FollowingCPO @OpenAI, BoD @Cisco @nature_org, LTC @USArmyReserve
Prev: President @Planet, Head of Product @Instagram @Twitter
❤️ @elizabeth ultramarathons kids cats math
47 Followers 194 FollowingAn official account for the COLM 2025 Workshop on LLM for Scientific Discovery: Reasoning, Assistance, and Collaboration (LM4SCI)
1K Followers 336 FollowingChen Institute has committed $1 billion to advance fundamental and translational brain research. Join us at AIAS 2025 this October in SF. https://t.co/R77byXINFW
1K Followers 315 FollowingBringing good stuff to the world.
CMU MLD phd. cooked with TPUs at Google Brain.
Leading Tree and Rock AI Lab (TRAIL) at NUS (Singapore)
5K Followers 2K FollowingAssociate Professor @UWCSE developing computational methods that leverage large-scale behavioral data to improve human well-being. Recruiting PhD students :-)
1K Followers 103 FollowingAI/RL researcher, Assistant Prof. at @Tsinghua_Uni, leading the RL lab at @AntResearch_, PhD at @berkeley_ai, frequent flyer and milk tea lover.
10K Followers 1K FollowingMachine learning, statistical genomics, ML4 health, senior investigator
at Gladstone Institutes, professor at Stanford, single mom to four kids. she/her
476 Followers 315 FollowingPostDoc @genentech in the group of @MarioniLab | Interested in single-cells, deep learning, networks | Core member @scverse_team | https://t.co/wNWP4SbLVw
5K Followers 3 FollowingTweeting interesting papers submitted at https://t.co/rXX8x0HzXV.
Submit your own at https://t.co/QhbJKXBd4Q, and link models/datasets/demos to it!