Xiang Long @SDxFaith

MLE @ModelBest, ex-Alibaba Group/MSRA/Tencent AI Lab Joined February 2020

Tweets

26
Followers

26
Following

542
Likes

17

LMSYS Org @lmsysorg

4 months ago

SGLang, verl, OpenBMB and Tsinghua University: Pioneering End-to-End Multi-Turn RLHF We are thrilled to announce the release of the first fully functional, convergence-verified, end-to-end open source multi-turn Reinforcement Learning with Human Feedback (RLHF) framework,…

3 28 169 18K 130

Download Image

Xiang Yue @xiangyue96

7 months ago

Demystifying Long CoT Reasoning in LLMs arxiv.org/pdf/2502.03373 Reasoning models like R1 / O1 / O3 have gained massive attention, but their training dynamics remain a mystery. We're taking a first deep dive into understanding long CoT reasoning in LLMs! 11 Major…

12 224 942 181K 1K

Download Image

Giuliano @Giuliano_Mana

8 months ago

"When you find a genius, give them all power." I've been obsessed with this idea for 12 months now. I learned it from Munger but then saw all successful people apply it. From Steve Jobs to Robert Oppenheimer. Thread:

94 722 8K 1.4M 9K

Download Image

Simo Ryu @cloneofsimo

10 months ago

This is periodic reminder / recommendation to read this paper inside out. It is still the most helpful paper ive ever read. You may have not have encountered it because its not super popular like "Attention is all you need", but you WILL thank me. Despite the paper's title,…

33 204 2K 253K 4K

Download Image

Xiang Long @SDxFaith

a year ago

We're excited about the future of truly multimodal tokens, where input and output seamlessly integrate across different media types. 🚀 #MultimodalAI #AI #Innovation

will depue @willdepue

a year ago

We're excited about the future of truly multimodal tokens, where input and output seamlessly integrate across different media types. 🚀 #MultimodalAI #AI #Innovation

118 314 3K 541K 758

0 0 5 67 0

Xiang Long @SDxFaith

a year ago

May be indicate some signal of benchmark-oriented tuning lol

Wenhu Chen @WenhuChen

a year ago

May be indicate some signal of benchmark-oriented tuning lol

46 125 660 173K 295

Download Image

0 0 1 78 0

Jim Fan @DrJimFan

a year ago

I know your timeline is flooded now with word salads of "insane, HER, 10 features you missed, we're so back". Sit down. Chill. <gasp> Take a deep breath like Mark does in the demo </gasp>. Let's think step by step: - Technique-wise, OpenAI has figured out a way to map audio to…

108 635 3K 991K 2K

Download Video

Ge Zhang @GeZhang86038849

a year ago

[1/n] Happy to share our new work "MuPT: A Generative Symbolic Music Pretrained Transformer", encompassing a series of music generation models ranging from 190 million to 4.2 billion parameters, all based on the ABC Notation. According to human preference evaluations, our models…

2 22 70 13K 43

Download Video

Zeyuan Allen-Zhu, Sc.D. @ZeyuanAllenZhu

a year ago

Our 12 scaling laws (for LLM knowledge capacity) are out: arxiv.org/abs/2404.05405. Took me 4mos to submit 50,000 jobs; took Meta 1mo for legal review; FAIR sponsored 4,200,000 GPU hrs. Hope this is a new direction to study scaling laws + help practitioners make informed decisions

28 332 1K 236K 1K

Download Image

Chelsea Finn @chelseabfinn

a year ago

Introducing a new, fully open robotics dataset! - 76k episodes - 564 unique scenes - 100 contributors - 13 labs/institutions - 3 continents droid-dataset.github.io A short 🧵 on the backstory

15 147 892 99K 432

Download Video

Jim Fan @DrJimFan

a year ago

Today is the beginning of our moonshot to solve embodied AGI in the physical world. I’m so excited to announce Project GR00T, our new initiative to create a general-purpose foundation model for humanoid robot learning. The GR00T model will enable a robot to understand multimodal…

216 1K 6K 1.1M 2K

Download Video

Bindu Reddy @bindureddy

2 years ago

Custom LLM and AI Agents (RAG) On Structured + Unstructured Data - AI Brain For Your Organization Imagine a ChatGPT-like interface over all your structured (database) and unstructured data. Ideally, you want to ask a question to an AI bot, and it should be able to run multiple…

26 212 1K 118K 1K

Download Image

Bindu Reddy @bindureddy

2 years ago

Failure Points In RAG Systems Anyone who has tried to deploy an RAG system knows that there are several failure modes to watch out for While RAG helps you reduce hallucinations and create custom ChatLLM, there can be several failure points, given the complexity of the system.…

17 205 915 107K 1K

Download Image

Yao Fu @Francis_YAO_

2 years ago

What is the correct recipe for finetuning LLMs for math reasoning? In MAmmoTH, we systematically study the SFT data composition and format for improving math, either in-distribution or out-of-distribution. Key takes: > Substantial, unprecidented performance gain for open-source…

Xiang Yue @xiangyue96

2 years ago

2 60 268 110K 155

Download Image

4 42 214 65K 142

Tianle Cai @tianle_cai

2 years ago

Ever want to make your LLM inference go brrrrr but got stuck at implementing speculative decoding and finding the suitable draft model? No more pain! Thrilled to unveil Medusa, a simple framework that removes the annoying draft model while getting 2x speedup. 🧵👇

28 207 1K 476K 650

Download Gif

Bryan Marley @_bryanmarley

2 years ago

50 Perplexity Prompts to Take Your Research to the Next Level. Don't forget to bookmark this post, to try these with Bard, ChatGPT, and Claude. 👇🏾 1. Emerging Industry Trends Prompt: "What are the emerging trends in [user input: specific industry] for the current year?" 2.…