Zixiang Chen @_zxchen_

Research Scientist at @SalesforceAI | Ph.D. from @UCLA | B.S. from @Tsinghua_Uni | Foundation Model, Theory, Reinforcement Learning | Opinions are my own sites.google.com/view/zxchen Los Angeles, CA Joined August 2019

Tweets

166
Followers

1K
Following

2K
Likes

1K

Anthropic @AnthropicAI

a month ago

New Anthropic research: Persona vectors. Language models sometimes go haywire and slip into weird and unsettling personas. Why? In a new paper, we find “persona vectors"—neural activity patterns controlling traits like evil, sycophancy, or hallucination.

232 939 6K 1.4M 4K

Download Image

AK @_akhaliq

4 weeks ago

CoAct-1 Computer-using Agents with Coding as Actions

3 29 124 19K 68

Download Image

OpenAI @OpenAI

a month ago

Want to see our open models in action? Watch how gpt-oss builds a video game—using tools step-by-step within chain-of-thought reasoning 👾🍓

163 423 4K 479K 1K

Download Video

OpenAI @OpenAI

a month ago

Our open models are here. Both of them. openai.com/open-models

1K 3K 20K 6.5M 4K

Google DeepMind @GoogleDeepMind

2 months ago

An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵

155 772 4K 1.1M 704

Download Image

Google DeepMind @GoogleDeepMind

4 months ago

We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO

93 665 5K 1.3M 1K

Download Gif

Zhihong Shao @zhs05232838

4 months ago

We just released DeepSeek-Prover V2. - Solves nearly 90% of miniF2F problems - Significantly improves the SoTA performance on the PutnamBench - Achieves a non-trivial pass rate on AIME 24 & 25 problems in their formal version Github: github.com/deepseek-ai/De…

74 321 2K 453K 624

Download Image

Association for Computing Machinery @TheOfficialACM

6 months ago

Meet the recipients of the 2024 ACM A.M. Turing Award, Andrew G. Barto and Richard S. Sutton! They are recognized for developing the conceptual and algorithmic foundations of reinforcement learning. Please join us in congratulating the two recipients! bit.ly/4hpdsbD

35 471 2K 447K 138

Download Video

Nouamane Tazi @Nouamanetazi

7 months ago

🚀 Excited to release *THE* Ultra-Scale Playbook - a comprehensive guide on training LLMs from 1 to 1000s of GPUs!

29 234 1K 152K 1K

Download Image

Jacob Austin @jacobaustin132

7 months ago

Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n

25 387 2K 444K 3K

Download Image

Kaixuan Ji @Kaixuan_Ji_19

8 months ago

🚀🚀Thrilled to introduce our recent research on LLM multi-step reasoning! We propose Direct Q-function Optimization, a new approach enhancing LLM's reasoning performance and achieves up to 2% performance gain on mathematical reasoning benchmarks. 🔥🔥 ✅Free from online…

Quanquan Gu @QuanquanGu

8 months ago

6 40 306 91K 225

6 37 199 39K 145

Download Image

Andrej Karpathy @karpathy

8 months ago

DeepSeek (Chinese AI co) making it look easy today with an open weights release of a frontier-grade LLM trained on a joke of a budget (2048 GPUs for 2 months, $6M). For reference, this level of capability is supposed to require clusters of closer to 16K GPUs, the ones being…

DeepSeek @deepseek_ai

8 months ago

669 2K 13K 7.3M 5K

Download Gif

406 2K 19K 6.5M 8K

M3L Workshop @ NeurIPS 2024 @M3LWorkshop

9 months ago

Hope everyone had fun at the 2nd workshop of M3L! Many thanks to the speakers, authors, reviewers, and participants for making this workshop a success. We had a full house again, and we hope to see you next year! 💡

0 6 17 4K 2

Download Image

NeurIPS Conference @NeurIPSConf

9 months ago

NeurIPS acknowledges that the cultural generalization made by the keynote speaker today reinforces implicit biases by making generalisations about Chinese scholars. This is not what NeurIPS stands for. NeurIPS is dedicated to being a safe space for all of us. We want to address…

533 385 3K 1.3M 572

Zixiang Chen @_zxchen_

9 months ago

📢 Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time! Come and chat! 💡 📍 West Ballroom A-D #7007 ⏰ Poster Session: Thu 4:30-7:30 PM PST 🌟 Highlights: 1. Training-free Method for faster generation 2. Predetermined transition time

Zixiang Chen @_zxchen_

12 months ago

0 3 17 4K 3

0 0 9 638 0

Zixiang Chen @_zxchen_

9 months ago

📢 Learning sparse parities efficiently is a fundamental challenge in learning theory. Come see how the SGD-based method can match the SQ lower bound! Let's chat! 💡 📝 arxiv.org/pdf/2404.12376 📍 Poster Session 2 West Ballroom A-D #7107 ⏰ Wed 4:30-7:30 PM PST 🌟 Highlights:…