Che-Ping Tsai @chepingt

PhD @mldcmu, interpretability and representation learning, machine learning theories. Joined November 2016

Tweets

61
Followers

126
Following

513
Likes

624

Wen-Tse Chen @WenzeChen2

4 weeks ago

[0/3] 🚀 Introducing Verlog – an open-source RL framework built specifically for training long-horizon, multi-turn LLM agents. 📊 Max episode length comparison: •VeRL / RAGEN → ~10 turns •verl-agent → ~50 turns •Verlog (ours) → 400+ turns 🔥 ⚙️ Technical foundation:…

2 71 395 34K 365

Download Gif

Lili @lchen915

a month ago

Self-Questioning Language Models: LLMs that learn to generate their own questions and answers via asymmetric self-play RL. There is no external training data – the only input is a single prompt specifying the topic.

25 183 1K 141K 1K

Download Image

Thang Luong @lmthang

2 months ago

Yes, there is an official marking guideline from the IMO organizers which is not available externally. Without the evaluation based on that guideline, no medal claim can be made. With one point deducted, it is a Silver, not Gold.

Mikhail Samin @Mihonarium

2 months ago

60 197 2K 462K 502

Download Image

14 57 587 121K 93

Burak Varıcı @VariciBurak

2 months ago

I'll be at ICML this week to present our take on "what we're really learning in representation learning and why it works." Our central argument: "Representations are learned from the association between input 𝑋 and context variable 𝐴"

1 1 6 404 0

Download Image

Li-Wei Chen @liweiche77

2 months ago

Thrilled to share our #ICML2025 paper! We introduce a variational approach for speech language models, automating speech attribute learning to deliver more natural, human-like speech. Joint work b/w @LTIatCMU and @Apple Read it: arxiv.org/abs/2506.14767

1 3 12 486 1

Shanda Li 黎善达 @Shanda_Li_2000

3 months ago

Can LLM solve PDEs? 🤯 We present CodePDE, a framework that uses LLMs to automatically generate solvers for PDE and outperforms human implementation! 🚀 CodePDE demonstrates the power of inference-time algorithms and scaling for PDE solving. More in 🧵: #ML4PDE #AI4Science

4 12 68 17K 24

Download Image

Amrith Setlur @setlur_amrith

3 months ago

Introducing e3 🔥 Best <2B model on math 💪 Are LLMs implementing algos ⚒️ OR is thinking an illusion 🎩.? Is RL only sharpening the base LLM distrib. 🤔 OR discovering novel strategies outside base LLM 💡? We answer these ⤵️ 🚨 arxiv.org/abs/2506.09026 🚨 matthewyryang.github.io/e3/

1 23 96 13K 53

Download Image

Junhong Shen @JunhongShen1

3 months ago

🔥Unlocking New Paradigm for Test-Time Scaling of Agents! We introduce Test-Time Interaction (TTI), which scales the number of interaction steps beyond thinking tokens per step. Our agents learn to act longer➡️richer exploration➡️better success Paper: arxiv.org/abs/2506.07976

7 38 168 80K 96

Download Image

Zhengyang Geng @ZhengyangGeng

4 months ago

Excited to share our work with my amazing collaborators, @Goodeat258, @SimulatedAnneal, @zicokolter, and Kaiming. In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,…

5 39 151 28K 57

Download Image

Yiding Jiang @yidingjiang

4 months ago

Data selection and curriculum learning can be formally viewed as a compression protocol via prequential coding. New blog (with @AllanZhou17 ) about this neat idea that motivated ADO but didn’t make it into the paper. yidingjiang.github.io/blog/post/curr…

2 17 104 13K 60

Rattana Pukdee @rpukdeee

4 months ago

In our #AISTATS2025 paper, we ask: when it is possible to recover a consistent joint distribution from conditionals? We propose path consistency and autoregressive path consistency—necessary and easily verifiable conditions. See you at Poster session 3, Monday 5th May.

1 6 14 959 3

Download Image

Che-Ping Tsai @chepingt

4 months ago

Check out Runtian's thesis on contexture theory, which shows that many representation learning methods perform eigendecomposition on the context-induced linear operators. More papers coming soon—stay tuned!

Runtian Zhai @RuntianZhai

4 months ago

2 30 164 22K 114

0 0 4 167 0

Che-Ping Tsai @chepingt

5 months ago

Heading to #ICLR2025! Looking forward to discussions on LLMs (for tabular data), interpretability, and representation learning. I'll be presenting my internship project on LLMs for tabular anomaly detection — catch our poster on Sat, April 26 at 10am! Come say hi! @iclr_conf

0 1 15 192 0

Download Image

Christina Baek @_christinabaek

5 months ago

Are current reasoning models optimal for test-time scaling? 🌠 No! Models make the same incorrect guess over and over again. We show that you can fix this problem w/o any crazy tricks 💫 – just do weight ensembling (WiSE-FT) for big gains on math! 1/N

7 103 486 53K 326

Download Image

Dylan Sam @dylanjsam

7 months ago

Excited to share new work from my internship @GoogleAI ! Curious as to how we should measure the similarity between examples in pretraining datasets? We study the role of similarity in pretraining 1.7B parameter language models on the Pile. arxiv: arxiv.org/abs/2502.02494 1/🧵

5 41 170 19K 95

Download Image

Samuel Sokota @ssokota

7 months ago

Model-free deep RL algorithms like NFSP, PSRO, ESCHER, & R-NaD are tailor-made for games with hidden information (e.g. poker). We performed the largest-ever comparison of these algorithms. We find that they do not outperform generic policy gradient methods, such as PPO. 1/N

9 61 352 77K 325

Download Image

Dylan Sam @dylanjsam

8 months ago

To trust LLMs in deployment (e.g., agentic frameworks or for generating synthetic data), we should predict how well they will perform. Our paper shows that we can do this by simply asking black-box models multiple follow-up questions! w/ @m_finzi and @zicokolter 1/ 🧵

4 40 118 14K 84

Download Image

Amrith Setlur @setlur_amrith

8 months ago

Through 2024, scaling test-time compute has become key. But, what does it mean to use test-time compute effectively & efficiently + how to do it? 🤔 We wrote a blog post ✍️ with a conceptual perspective on this: blog.ml.cmu.edu/2025/01/08/opt… 🎯Answer: meta reinforcement learning 🧵⤵️

3 22 123 7K 78

Junhong Shen @JunhongShen1

8 months ago

Introducing Content-Adaptive Tokenizer (CAT) 🐈! An image tokenizer that adapts token count based on image complexity, offering flexible 8x, 16x, or 32x compression! Unlike fixed-length tokenizers, CAT optimizes both representation efficiency and quality. Importantly, we use just…

4 47 246 22K 158

Download Image

Euxhen Hasanaj @EuxhenH

9 months ago

We have just released SenSet, a novel list of 106 senescence marker genes. We hope this resource accelerates discoveries in aging research, cancer biology, and regenerative medicine. #senescence #aging #pulearning #gene-set #SenNet biorxiv.org/content/10.110…