Yuqing Yang @yyqcode

First-year PhD student @CSatUSC @nlp_usc. ayyyq.github.io Joined June 2023

Tweets

37
Followers

231
Following

364
Likes

414

Chenxin An @AnChancy46881

3 months ago

# 🚨 4B open-recipe model beats Claude-4-Opus 🔓 100% open data, recipe, model weights and code. Introducing Polaris✨--a post-training recipe for scaling RL on advanced reasoning models. 🥳 Check out how we boost open-recipe reasoning models to incredible performance levels…

24 83 447 98K 395

Download Image

Xi Ye @xiye_nlp

3 months ago

There’s been hot debate about (The Illusion of) The Illusion of Thinking. My take: it’s not that models can’t reason — they just aren’t perfect at long-form generation yet. We eval reasoning models on LongProc benchmark (requiring generating 8K CoTs, see thread). Reasoning…

Xi Ye @xiye_nlp

8 months ago

3 49 220 33K 123

Download Image

1 9 34 4K 11

Dongwei Jiang @Dongwei__Jiang

3 months ago

🧵 Recent studies show LLMs can self-improve their responses when given external feedback. But how effectively can they incorporate it? We tested this systematically—and found they can't fully integrate feedback, even when the feedback is high-quality and backed by ground-truth.

3 27 157 11K 75

Download Image

Linxin Song @linxins2

4 months ago

🚨 We discovered a surprising side effect of Reinforcement Finetuning (RFT): it makes LLMs more confidently wrong on unanswerable questions. We call this the hallucination tax: a drop in refusal behavior that leads to overconfident hallucinations. 🧵 1/n

5 41 270 35K 260

Download Image

Deqing Fu @DeqingFu

4 months ago

Textual steering vectors can improve visual understanding in multimodal LLMs! You can extract steering vectors via any interpretability toolkit you like -- SAEs, MeanShift, Probes -- and apply them to image or text tokens (or both) of Multimodal LLMs. And They Steer!

1 14 47 7K 13

Download Image

Linxin Song @linxins2

5 months ago

Want to know what your LLM don’t know? This is how 👇 Preprint: arxiv.org/abs/2503.23361 Code: github.com/uscnlp-lime/SEA

3 24 86 22K 57

Tianyi Zhou @tianyi_zhou12

7 months ago

Billion-parameter LLMs still struggle with simple arithmetic? 📞 FoNE (Fourier Number Embedding) tackles this problem. By mapping numbers directly into Fourier space, it bypasses tokenization and significantly improves numerical accuracy with better efficiency and accuracy.

1 14 23 3K 7

Download Gif

Muru Zhang @zhang_muru

7 months ago

Running your model on multiple GPUs but often found the speed not satisfiable? We introduce Ladder-residual, a parallelism-aware architecture modification that makes 70B Llama with tensor parallelism ~30% faster! Work done at @togethercompute. Co-1st author with @MayankMish98…

5 61 323 76K 196

Download Image

Tengxiao Liu @TengxiaoLiu

9 months ago

Come join the #NeurIPS2024 poster session and discuss whether language models can learn to skip steps in reasoning! 🗓Dec 12, Thursday, 11:00 am - 2:00 pm 📍East Exhibit Hall A-C #2900 Feel free to stop by and say hi! I am actively seeking Summer 2025 internship opportunities!

Tengxiao Liu @TengxiaoLiu

9 months ago

3 13 46 6K 18

Download Image

0 3 13 938 1

Qinyuan Ye @qinyuan_ye

9 months ago

I'll present a poster for Lifelong ICL and Task Haystack at #NeurIPS2024! ⏰ Wednesday 11am-2pm 📍 East Exhibit Hall A-C #2802 📜 arxiv.org/abs/2407.16695 My co-first author @xiaoyue02_xu is applying to PhD programs and I am looking jobs in industry! Happy to connect at NeurIPS!