Research Scientist at @SalesforceAI | Ph.D. from @UCLA | B.S. from @Tsinghua_Uni | Foundation Model, Theory, Reinforcement Learning | Opinions are my ownsites.google.com/view/zxchen Los Angeles, CAJoined August 2019
New Anthropic research: Persona vectors.
Language models sometimes go haywire and slip into weird and unsettling personas. Why? In a new paper, we find “persona vectors"—neural activity patterns controlling traits like evil, sycophancy, or hallucination.
An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇
It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵
We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model.
Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO
We just released DeepSeek-Prover V2.
- Solves nearly 90% of miniF2F problems
- Significantly improves the SoTA performance on the PutnamBench
- Achieves a non-trivial pass rate on AIME 24 & 25 problems in their formal version
Github: github.com/deepseek-ai/De…
Meet the recipients of the 2024 ACM A.M. Turing Award, Andrew G. Barto and Richard S. Sutton! They are recognized for developing the conceptual and algorithmic foundations of reinforcement learning. Please join us in congratulating the two recipients! bit.ly/4hpdsbD
Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n
🚀🚀Thrilled to introduce our recent research on LLM multi-step reasoning! We propose Direct Q-function Optimization, a new approach enhancing LLM's reasoning performance and achieves up to 2% performance gain on mathematical reasoning benchmarks. 🔥🔥
✅Free from online…
🚀🚀Thrilled to introduce our recent research on LLM multi-step reasoning! We propose Direct Q-function Optimization, a new approach enhancing LLM's reasoning performance and achieves up to 2% performance gain on mathematical reasoning benchmarks. 🔥🔥
✅Free from online… https://t.co/5XiqAhfInA
DeepSeek (Chinese AI co) making it look easy today with an open weights release of a frontier-grade LLM trained on a joke of a budget (2048 GPUs for 2 months, $6M).
For reference, this level of capability is supposed to require clusters of closer to 16K GPUs, the ones being…
DeepSeek (Chinese AI co) making it look easy today with an open weights release of a frontier-grade LLM trained on a joke of a budget (2048 GPUs for 2 months, $6M).
For reference, this level of capability is supposed to require clusters of closer to 16K GPUs, the ones being…
Hope everyone had fun at the 2nd workshop of M3L! Many thanks to the speakers, authors, reviewers, and participants for making this workshop a success. We had a full house again, and we hope to see you next year! 💡
NeurIPS acknowledges that the cultural generalization made by the keynote speaker today reinforces implicit biases by making generalisations about Chinese scholars. This is not what NeurIPS stands for. NeurIPS is dedicated to being a safe space for all of us. We want to address…
📢 Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time! Come and chat! 💡
📍 West Ballroom A-D #7007
⏰ Poster Session: Thu 4:30-7:30 PM PST
🌟 Highlights:
1. Training-free Method for faster generation
2. Predetermined transition time
📢 Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time! Come and chat! 💡
📍 West Ballroom A-D #7007
⏰ Poster Session: Thu 4:30-7:30 PM PST
🌟 Highlights:
1. Training-free Method for faster generation
2. Predetermined transition time
📢 Learning sparse parities efficiently is a fundamental challenge in learning theory. Come see how the SGD-based method can match the SQ lower bound! Let's chat! 💡
📝 arxiv.org/pdf/2404.12376
📍 Poster Session 2 West Ballroom A-D #7107
⏰ Wed 4:30-7:30 PM PST
🌟 Highlights:…
62K Followers 66K FollowingThis is the official twitter account for web site called Domesticated Brain. We are sharing various kinds of #computer #tutorials and latest #technology news.
409 Followers 4K FollowingI won the America Lottery on Powerball and I'm willing to give out $500,000K to
my first 1k followers, we rise by lifting others, just Dm
"WEST I FOLLOWED"
538 Followers 6K FollowingTenure-Track Assistant Professor at University of Alabama at Birmingham. Previous: Indiana State University, UC San Diego. PhD from University of Chicago.
290 Followers 633 FollowingThe AI Risk Network
Podcasts | Videos | Documentaries
215k+ strong.
Helping everyday people understand the growing threat of AI extinction.
3K Followers 5K FollowingHumanist technologist and AI optimist. Currently CTO at @welcomeaccount_. Building for an inclusive economy through #AI, #MachineLearning, and #Tech4Good
248 Followers 546 FollowingEnvisioning technologies to enrich communication and simplify expression as an HCI researcher. Bewitched by visual tactics and fluid thinking/creating tools.
3K Followers 4K FollowingFinal-year PhD building AI to predict molecular toxicity 🤖⚕️💊📈
I discuss AI, economics, investing and geopolitics 📊💱🌍
Views are my own.
30K Followers 1K FollowingOne of the world’s largest & most diverse global ecosystems with more than 300,000 developers. Join the rebellion, you’ll be in great company. 🚀
454 Followers 225 FollowingAssistant Professor @WisconsinCS. Formerly: Postdoc @MSFTResearch (New England & NYC). PhD @IllinoisCDS. Working on reinforcement learning.
637K Followers 35 FollowingWe're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.
904 Followers 81 FollowingLead personality and model behavior research @OpenAI;
Previously built the object understanding system and foundation models for self-driving @Waymo
1K Followers 301 FollowingAsst Professor at @JohnsHopkins (@JohnsHopkinsAMS and @HopkinsDSAI). Previously: @SimonsInstitute, @oxfordstats, @Polytechnique. I like to scale up things!
567 Followers 314 FollowingResearch Scientist at Google DeepMind, Gemini ♊️ Reinforcement Learning, Thinking and Reasoning. Snowboarding and traveling when not working.
7K Followers 6K FollowingProduct Lead | Google Gemini
Prev: Launched @aws Trainium, @alexa99 Echo Show 5
Tweets are my own. Retweets are not endorsements.
Joyful Learning Machines
317 Followers 707 FollowingProfessor in Computer Science @KAUST. Leading the efforts to establish a world class #cybersecurity research lab. And still enjoying research.
11K Followers 1K Followingbuilding a new SF lab at @amazon; former cofounder of @adeptailabs, vp engineering @openai, and google LLMs lead. all about type II fun.
874K Followers 52 Followingwe invest in software eating the world
https://t.co/A9eTFq6plZ
https://t.co/MXGUBJoesw
Watch "The Ben & Marc Show": https://t.co/eRuDhx7kpe
43K Followers 3K FollowingWe're in a race. It's not USA vs China but humans and AGIs vs ape power centralization.
@deepseek_ai stan #1, 2023–Deep Time
«C’est la guerre.» ®1
2K Followers 2K FollowingDirector @ Salesforce Research. Research Interest: Large Language Model, Action Agent, Reinforcement Learning, Time Series Analytics, Learning Theory.
250 Followers 457 FollowingOptimization in ML | Applied Math and Stat PhD @stonybrooku | ML Strategy Intern @Bloomberg Previously @thisisuic. Opinions are my own.