chang ma @ma_chang_nlp

Ph.D student @hku previously @PKU1898, I work on agents and science. chang-github-00.github.io/-changma Shanghai/Beijing Joined May 2022

Tweets

135
Followers

767
Following

1K
Likes

3K

HKUNLP @hkunlp2020

3 weeks ago

Jinjie Ni @NiJinjie from NUS will be giving a talk titled "Diffusion Language Models are Super Data Learners" at Friday Aug 22 11am HKT. link to talk: hku.zoom.us/j/94293996114?…

0 11 43 4K 9

Download Image

Xinyu Yang from CMU will be giving a talk titled "Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation" at Friday July 25 11am HKT (Thursday July 24 8pm PDT). Link to talk: hku.zoom.us/j/92651812689?…

1 7 22 3K 1

Download Image

Zhihui Xie @_zhihuixie

2 months ago

🚀 Thrilled to announce Dream-Coder 7B — the most powerful open diffusion code  LLM to date.

3 36 122 13K 48

Download Image

Lingpeng Kong @ikekong

2 months ago

What happend after Dream 7B? First, Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data. Plus, DreamOn cracks the variable-length generation problem! It enables code infilling that goes beyond a fixed canvas.

1 35 72 7K 20

Download Gif

Zirui Wu @WilliamZR7

2 months ago

We present DreamOn: a simple yet effective method for variable-length generation in diffusion language models. Our approach boosts code infilling performance significantly and even catches up with oracle results.

2 29 120 15K 59

Download Gif

Tanishq Mathew Abraham, Ph.D. @iScienceLuvr

2 months ago

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Apple introduces DiffuCoder, a 7B diffusion LLM trained on 130B tokens of code authors also propose a diffusion-native RL training framework, coupled-GRPO Decoding of dLLMs differ from…

4 72 298 27K 170

Download Image

HKUNLP @hkunlp2020

3 months ago

Hongru Wang from CUHK will be giving a talk titled "Theory of agent: from definition to objective" at ⏰Wednesday 6.11 3pm HKT (Thursday 6.11 11am PDT). Link to talk: hku.zoom.us/j/91654661534?…

0 2 7 3K 1

Download Image

Sergey Levine @svlevine

3 months ago

I always found it puzzling how language models learn so much from next-token prediction, while video models learn so little from next frame prediction. Maybe it's because LLMs are actually brain scanners in disguise. Idle musings in my new blog post: sergeylevine.substack.com/p/language-mod…

52 177 1K 303K 1K

Yuzhen Huang @yuzhenh17

3 months ago

🔍 Are Verifiers Trustworthy in RLVR? Our paper, Pitfalls of Rule- and Model-based Verifiers, exposes the critical flaws in reinforcement learning verification for mathematical reasoning. 🔑 Key findings: 1️⃣ Rule-based verifiers miss correct answers, especially when presented in…

3 21 131 26K 94

Download Image

Xueliang Zhao @xlzhao_hku

3 months ago

🔥 Meet PromptCoT-Mamba The first reasoning model with constant-memory inference to beat Transformers on competition-level math & code ⚡ Efficient decoding: no attention, no KV cache ⚡ +16.0% / +7.1% / +16.6% vs. s1.1-7B on AIME 24 / 25 / LiveCodeBench 🚀 Up to 3.66× faster

2 15 29 2K 6

Download Image

Wei Liu @WeiLiu99

4 months ago

“What is the answer of 1 + 1?” Large Reasoning Models (LRMs) may generate 1500+ tokens just to answer this trivial question. Too much thinking 🤯 Can LRMs be both Faster AND Stronger? Yes. Introducing LASER💥: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping…

2 32 140 28K 88

Download Image

Shiqi Chen @shiqi_chen17

4 months ago

Share our another #ICML25 paper: “Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging” ! (1/5) We use model merging to enhance VLMs' reasoning by integrating math-focused LLMs—bringing textual reasoning into multi-modal models. Surprisingly, this…

0 13 88 5K 44

Download Gif

HKUNLP @hkunlp2020

4 months ago

Guanqi Jiang from UCSD will be giving a talk titled "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets" at ⏰Friday 5.16 11am HKT (Thursday 5.15 8pm PDT). Link to talk: hku.zoom.us/j/97674910858?…

0 5 10 1K 0

Download Image

HKUNLP @hkunlp2020

4 months ago

Follow our new HKUNLP seminars at hkunlp.github.io/seminar/. You can also sign up as a speaker to share your work!

0 2 6 422 1

chang ma @ma_chang_nlp

4 months ago

We are kicking off a series of seminars at @hkunlp2020. @siyan_zhao will be giving a talk titled "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning" at ⏰Friday 5.9 11am HKT (Thursday 5.8 8pm PDT). Link to talk: hku.zoom.us/j/97925412724?…

0 13 37 3K 3

Download Image

Shiqi Chen @shiqi_chen17

4 months ago

🚀🔥 Thrilled to announce our ICML25 paper: "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas"! We dive into the core reasons behind spatial reasoning difficulties for Vision-Language Models from an attention mechanism view. 🌍🔍 Paper:…

5 36 229 31K 169

Download Gif

Zhihui Xie @_zhihuixie

5 months ago

Excited to be in Singapore 🇸🇬 for #ICLR2025! Thrilled for my first time attending after past visa issues kept me away 😢. We'll be presenting our work on: 1️⃣ Jailbreaking as a Reward Misspecification Problem 🗓️ Thursday, April 24 — 3:00 PM - 5:30 PM (SGT) 📍 Hall 3 + Hall 2B —…

0 5 24 2K 2

Download Image

chang ma @ma_chang_nlp

5 months ago

Excited to share our work at ICLR 2025 in 🇸🇬. @iclr_conf 🥳 Happy to chat about LLM reasoning & planning, agents, and AI4Science! 📍Sat 26 Apr 3 p.m. CST — 5:30 p.m Hall 3 + Hall 2B #554