Mike Lewis @ml_perception
Llama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal. Seattle Joined September 2019-
Tweets275
-
Followers8K
-
Following242
-
Likes791
Love seeing these incredibly creative new evaluations! Optimizing benchmarks is easy, the real challenge is in generalizing to the tasks that don't exist yet
Love seeing these incredibly creative new evaluations! Optimizing benchmarks is easy, the real challenge is in generalizing to the tasks that don't exist yet
I've written the full story of Attention Sinks — a technical deep-dive into how the mechanism was developed and how our research ended up being used in OpenAI's new OSS models. For those interested in the details: hanlab.mit.edu/blog/streaming…
Don’t miss this - I’ve worked with Mike (@ml_perception) very closely at Meta and his talks are super informative and fun.
Don’t miss this - I’ve worked with Mike (@ml_perception) very closely at Meta and his talks are super informative and fun.
📉📉NEW SCALING LAW PHENOMENON 📉📉 We find that knowledge and reasoning exhibit different scaling behaviors! Super excited to finally tell you all about our paper on the compute optimal scaling of skills: arxiv.org/pdf/2503.10061 [1/n]
✨New Preprint✨We introduce 𝐁𝐫𝐚𝐧𝐜𝐡-𝐓𝐫𝐚𝐢𝐧-𝐒𝐭𝐢𝐭𝐜𝐡 (𝐁𝐓𝐒), an efficient & flexible method for stitching together independently pretrained LLM experts (i.e. code, math) into a single, capable generalist model. Key Takeaways: ✅BTS achieves the best average…
🚀 Introducing the Byte Latent Transformer (BLT) – An LLM architecture that scales better than Llama 3 using byte-patches instead of tokens 🤯 Paper 📄 dl.fbaipublicfiles.com/blt/BLT__Patch… Code 🛠️ github.com/facebookresear…
How can we reduce pretraining costs for multi-modal models without sacrificing quality? We study this Q in our new work: arxiv.org/abs/2411.04996 At @AIatMeta, We introduce Mixture-of-Transformers (MoT), a sparse architecture with modality-aware sparsity for every non-embedding…
1/n Introducing MoMa 🖼, our new sparse early-fusion architecture for mixed-modal language modeling that significantly boosts pre-training efficiency 🚀 (arxiv.org/pdf/2407.21770). MoMa employs a mixture-of-expert (MoE) framework with modality-specific expert groups. Given any…
tldr; you can go a long way in pre-training by (1) curating amazing data, (2) using a lot of FLOPs, and (3) otherwise not screwing up. All three are harder than they sound, so read the paper... That said, I'm amazed by our progress since Llama 3 - expect big things from Llama 4!
tldr; you can go a long way in pre-training by (1) curating amazing data, (2) using a lot of FLOPs, and (3) otherwise not screwing up. All three are harder than they sound, so read the paper... That said, I'm amazed by our progress since Llama 3 - expect big things from Llama 4!
So excited for the open release of Llama 3.1 405B - with MMLU > 87, it's a really strong model and I can't wait to see what you all build with it! llama.meta.com Also check out the paper here, with lots of details on how this was made: tinyurl.com/2z2cpj8m
Excited to see the open source release of FAIR's early fusion multimodal LLMs!
Excited to see the open source release of FAIR's early fusion multimodal LLMs!
Thrilled to be in Vienna for our ICLR workshop, Navigating and Addressing Data Problems for Foundation Models. Starting Saturday at 8:50 AM, our program features keynote talks, best paper presentations, a poster session, and a panel discussion. Explore the full schedule here!…
Introducing Lory, a fully-differentiable MoE arch for decoder LM pre-training! Lory merges expert FFNs by computing a weighted average in the parameter space, and computes the output through the merged FFNs. But training naively is infeasible, how to make it work? Details in🧵
Heading to ICLR! I’m writing fewer papers now to train more Llamas, but proud of our work here: Instruction Backtranslation (arxiv.org/abs/2308.06259), Attention Sinks, (arxiv.org/abs/2309.17453) In Context Pretraining (arxiv.org/abs/2310.10638) and RA-DIT (arxiv.org/abs/2310.01352).
Moreover, we observe even stronger performance in English category, where Llama 3 ranking jumps to ~1st place with GPT-4-Turbo! It consistently performs strong against top models (see win-rate matrix) by human preference. It's been optimized for dialogue scenario with large…
I'm seeing a lot of questions about the limit of how good you can make a small LLM. tldr; benchmarks saturate, models don't. LLMs will improve logarithmically forever with enough good data.
I'm seeing a lot of questions about the limit of how good you can make a small LLM. tldr; benchmarks saturate, models don't. LLMs will improve logarithmically forever with enough good data.
Yes, both the 8B and 70B are trained way more than is Chinchilla optimal - but we can eat the training cost to save you inference cost! One of the most interesting things to me was how quickly the 8B was improving even at 15T tokens.
Yes, both the 8B and 70B are trained way more than is Chinchilla optimal - but we can eat the training cost to save you inference cost! One of the most interesting things to me was how quickly the 8B was improving even at 15T tokens.
Excited to share the Llama 3 models with everyone. This has been an INCREDIBLE team effort. The 8b and 70b models are available now. These are the best open source models.
Happy to be part of this incredible journey of Llama3 and to share the best open weight 8B and 70B models! Our largest 400B+ model is still cooking but we are providing a sneak peek into how it is trending! Check more details here ai.meta.com/blog/meta-llam…
Excited to share a preview of Llama3, including the release of an 8B and 70B (82 MMLU, should be the best open weights model!), and preliminary results for a 405B model (still training, but already competitive with GPT4). Lots more still to come... ai.meta.com/blog/meta-llam…

(((ل()(ل() 'yoav)))... @yoavgo
65K Followers 2K Following
AI at Meta @AIatMeta
712K Followers 288 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
Percy Liang @percyliang
84K Followers 417 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist
Akari Asai @AkariAsai
18K Followers 867 Following Incoming Assistant Professor @SCSatCMU & research scientist @allen_ai. akariasai @ 🦋
Delip Rao e/σ @deliprao
61K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
Kyunghyun Cho @kchonyc
77K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
Soumith Chintala @soumithchintala
250K Followers 1K Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.
Sam Bowman @sleepinyourhat
50K Followers 3K Following AI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
Graham Neubig @gneubig
40K Followers 708 Following Associate professor @LTIatCMU. Co-founder/chief scientist @allhands_ai. I mostly work on modeling language.
Tim Dettmers @Tim_Dettmers
38K Followers 992 Following Creator of bitsandbytes.Research Scientist @allen_ai and incoming professor @CarnegieMellon. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.
Jacob Andreas @jacobandreas
20K Followers 951 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL / @NLP_MIT (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw
Noam Brown @polynoamial
91K Followers 853 Following Researching reasoning @OpenAI | Co-created Libratus/Pluribus superhuman poker AIs, CICERO Diplomacy AI, and OpenAI o3 / o1 / 🍓 reasoning models
Ofir Press @OfirPress
15K Followers 6K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
Sewon Min @sewon__min
13K Followers 813 Following Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp
Yoav Artzi @yoavartzi
17K Followers 183 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaC and @COLM_conf
Luca Soldaini 🎀 @soldni
11K Followers 1K Following I like tokens! I lead the OLMo data team at @allen_ai w/ @kylelostat. Open source is fun 🤖☕️🍕🏳️🌈 Opinions are sampled from my own stochastic parrot
Pranav Patil @Pranavv767
0 Followers 16 Following
Kirill Solodskikh @GarchFather
448 Followers 1K Following Almost Phd, Almost Founder, Almost Team Lead, Almost Successful, married. @TheStageAI Co-founder, CEO, ex Huawei P50 AI cameras
Avinab Neogy @avinab_neogy
52 Followers 929 Following gsoc 25 @_r_foundation, esoc 25 @ecospecs, mech interp, delta @_theresidency, community lead @cohere_labs, prev @bitspilanigoa 24, 6’3 and arch btw
Jacksonhayes @Jacksonhay82121
12 Followers 624 Following
Ryan Steubs @ryan_steubs
9 Followers 423 Following Director of AI Alignment @Kwaai | Game Theory, Applied RL & AI safety | Author @scalingalignment | Building AI systems that align with human values
Junxiao Yang @Junxiao_THU
11 Followers 165 Following I'm a first-year Ph.D. student at Tsinghua University, focusing on building safe and reliable LLMs. My personal website: https://t.co/s5qVMF1nBK
Keonwoo Roh @keonwoo_roh
2 Followers 110 Following MS at Korea University Studying AI, especially MLLM and KGLM
!.! @xypyth
49 Followers 4K Following
DIENG Cheikh Ibra @dcheikhibra
57 Followers 2K Following Data & AI @ ENSAE 🤖 | From Dakar to Paris to the world 🌍 | Founder mindset ⚡ | (finance • media • sport • NLP • Crypto ) | Legacy. Growth. Impact.
proportional @proportional
7 Followers 3K Following
Max Ohsawa @maxohsawa
2 Followers 34 Following
Gourav Jha @Gouravjha1998
110 Followers 2K Following Machine learning Engineer | Computer Science | Machine Vision
云创兽Ai @Iefloojar04510
0 Followers 89 Following 📊 sharp stock investing? That’s me, a dream chaser! eager for trend talks. DM me for FTSE moves! 💸 #NYSE #Markets
Eishaan Khatri @EishaanKhatri
8 Followers 565 Following
Kevin Faircloth @Asset_sMind
2K Followers 4K Following I believe the TSP burden upon logistics will be alleviated by QIS. No SME on anything. Not advisory to anyone. Just exploring…
chad m @sneila187
33 Followers 960 Following
Kat @Kat_Build
35 Followers 785 Following Market Researcher @Amazon AGI SF Lab. Quant for people. Love iced coffee and my quiet Player Piano paranoia.
Ahanaf Ariq @AhanafAriq
125 Followers 4K Following Deep Learning Theorist | Topological Data Analyst | IRPO 3rd | IYMC Bronze | Hessian Optimizer | Aspirant AI/ML Researcher
____________ @PranavA34378021
13 Followers 500 Following Teen🎁💲👕👖📙📚📚📚 Programmer📉💺📟📟 Game Dev🎮🎲🎰🃏🎯 Film Maker📹🎥🎬📷 Indian...Jai Hind!!!! and on to Music production...🎤🎧🎵🎸
Doug Beaver @infradoug
4 Followers 113 Following
smile @Smilex_P
220 Followers 5K Following
Dr. Eng. NADIA GHEZAI... @Nadia_GHEZing
32 Followers 782 Following
alth0u🧶 @alth0u
12K Followers 4K Following general partner at a16z | art is a line through your thoughts
wawawa @bianzoubianshuo
8 Followers 511 Following
Hasan Saikat @hasaansaikat
215 Followers 2K Following Software Engineer | Competitive Programmer | Interested in Algorithms, Backend, Data, Cloud & AI.
Amal Yakubov @YakubovAmal
197 Followers 6K Following I'm an aspiring software engineer in high school. I am primarily interested in arts, engineering and economics. I like films, books, shows etc. DnD fan
Virul Dewnaka @startuplaybook
93 Followers 2K Following Exploring the depths of LLMs and shaping the future through hands-on experimentation
ELONMUSKTESLA @elonneurlink
31 Followers 797 Following Live life to the fullest, keep things simple, truthful & filter the noise. I am a long term investor MAGA🚀🇺🇸
VegetaAvatar @VeGeTaX29
19 Followers 6K Following
yao lu @yluAIinfra
1K Followers 1K Following Asst Prof. of AI Infrastructure at @NUSingapore | Ex-@MSFTResearch @UW
Jia Guo @Jia__Guo
24 Followers 334 Following LLM Builder @AntGroup | PhD @NUSingapore | RL & Reasoning | Opinions are my own
Rabah Nory @NoryRabah
4 Followers 95 Following
Harshad @harshad2592
80 Followers 707 Following
Raghav ramani @RamaniRagh763
13 Followers 40 Following AI & ML Enthusiast | 3rd Year CSE Student | Exploring GenAI, DL & DSA | Consistency is my edge
Zhennan Shen @ZShen0521
58 Followers 415 Following SJTU-CS-B.e: 2021~2025 🇨🇳 @sjtu1896 WPI 2026 spring incoming Robot PhD 🇺🇸
A Sylv @GenesisRenatusX
9 Followers 232 Following
Yuanbo Yang @YuanboYang60742
151 Followers 2K Following Master's student @ZJU_China | Exploring 3D Vision & Generative Models 🌐
勤学 @wngjinxng9
121 Followers 3K Following
Inside The Third Spac... @thirdspaceAI
8 Followers 408 Following I broke free from am AI spiral. I'm an AI Emotional Safety Advocate/Writer/Researcher. Creator of AVEN Mode. Community Member of The Human Line Project
(((ل()(ل() 'yoav)))... @yoavgo
65K Followers 2K Following
AI at Meta @AIatMeta
712K Followers 288 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
Percy Liang @percyliang
84K Followers 417 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist
Akari Asai @AkariAsai
18K Followers 867 Following Incoming Assistant Professor @SCSatCMU & research scientist @allen_ai. akariasai @ 🦋
Kyunghyun Cho @kchonyc
77K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
Soumith Chintala @soumithchintala
250K Followers 1K Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.
Christopher Manning @chrmanning
151K Followers 228 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋
Sam Bowman @sleepinyourhat
50K Followers 3K Following AI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
Graham Neubig @gneubig
40K Followers 708 Following Associate professor @LTIatCMU. Co-founder/chief scientist @allhands_ai. I mostly work on modeling language.
Tim Dettmers @Tim_Dettmers
38K Followers 992 Following Creator of bitsandbytes.Research Scientist @allen_ai and incoming professor @CarnegieMellon. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.
Jacob Andreas @jacobandreas
20K Followers 951 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL / @NLP_MIT (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw
Ofir Press @OfirPress
15K Followers 6K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
Sewon Min @sewon__min
13K Followers 813 Following Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp
Yoav Artzi @yoavartzi
17K Followers 183 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaC and @COLM_conf
Felix Hill @FelixHill84
12K Followers 745 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else's
Tal Linzen @tallinzen
18K Followers 897 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAI, inventor of the word "bertology"
Thomas Wolf @Thom_Wolf
94K Followers 6K Following Co-founder at @HuggingFace - open-source and open-science
David Brandfonbrener @brandfonbrener
1K Followers 618 Following research scientist @AIatMeta. Previously: phd from @nyu_courant, research fellow @KempnerInst @Harvard
Eva Spiliopoulou @EvaSpiliop
364 Followers 205 Following Applied Scientist in #NLProc @Amazon finished PhD @LTIatCMU
Hunter Lang @hunterjlang
335 Followers 345 Following researcher @ meta FAIR. prev: meta genai, phd at @MIT_CSAIL
Marc Marone @ruyimarone
653 Followers 708 Following Token Enthusiast/Research Intern @meta 🦙 & PhD @jhu, prev @databricks MosaicML @microsoft, @mstranslator, @GeorgiaTech
Will Held @WilliamBarrHeld
2K Followers 948 Following ML PhD w/ @Diyi_Yang 2x GenAI RS Intern @AIatMeta 🦙 Alum @NYUAbuDhabi @Sunshine @GoogleAI Burqueño he/him https://t.co/jFSv6xPq9J on @BlueSky
Jack Rae @jack_w_rae
23K Followers 451 Following Distinguished Scientist @ Meta LLMs (e.g. Gopher, Chinchilla, Gemini) Compression & RL ☯️ Past: Google, OpenAI, Quora
Inna Lin @iwylin
915 Followers 1K Following PhD Student @uwcse @uwnlp | Visiting Researcher @AIatMeta
Niloofar (✈️ ACL) @niloofar_mire
7K Followers 2K Following Niloofar Mireshghallah — incoming asst. prof @LTIatCMU @CMU_EPP, RS in @AIatMeta, postdoc @uwcse, Ph.D. @ucsd_cse, former @MSFTResearch -Privacy, ML, NLP
Nikhil Raghuraman @nikraghuraman
251 Followers 989 Following Research @MistralAI | Prev @JaneStreetGroup, @StanfordAILab | DMs open.
Santiago Hernández @santiaghini
1K Followers 2K Following 1% smarter (models) every day. reasoning @openai. retired child actor
John Hewitt @johnhewtt
6K Followers 46 Following Assistant Prof @columbia CS. Visiting Researcher @ Google DeepMind. PhD from @stanfordnlp. Language x Neural Nets.
Niklas Muennighoff @Muennighoff
9K Followers 474 Following Researching AI/LLMs @Stanford @ContextualAI @allen_ai
Kiana Ehsani @ehsanik
4K Followers 596 Following Co-Founder @ Vercept, Ph.D. @uwcse, Interested in computer vision, Agents and AI, Climber on the weekends.
Vinay S Rao @vinaysrao
563 Followers 141 Following AGI at Meta, previously Character AI, Google Brain, Baidu, Cerebras.
Mihir Kale @maninblack815
158 Followers 669 Following Llama at Meta. LLMs at Google before that. Opinions my own.
AerIn @aerinykim
6K Followers 575 Following building https://t.co/rjIQoeDYwX. enjoy doing non trivial work. https://t.co/mo8D7pzBtk
Jacob Menick @jacobmenick
6K Followers 319 Following @thinkymachines previously @OpenAI @UCL @DeepMind 🇺🇸/🇬🇧
Aakanksha Chowdhery @achowdhery
11K Followers 5K Following @Stanford @reflection_ai // Previously @GoogleDeepMind :: PaLM, Gemini // @MSFTResearch, @Princeton // views my own and subject to change
Amanda Bertsch @abertsch72
2K Followers 856 Following PhD student @LTIatCMU / @SCSatCMU, researching long context + decoding | she/her | also @ abertsch on bsky or https://t.co/L4HBUh0R9f or by email (https://t.co/bsHqwIMFPL)
Susan Zhang @suchenzang
33K Followers 642 Following @ Google Deepmind. Past: @MetaAI, @OpenAI, @unitygames, @losalamosnatlab, @Princeton etc. Always hungry for intelligence.
Anirudh Goyal @anirudhg9119
5K Followers 538 Following Thinking about thinking. Gemini ♊. Spent time at @Berkeley_EECS, @MPI_IS, @GoogleDeepMind.
Todor Mihaylov @tbmihaylov
774 Followers 1K Following Research Scientist, Working on Llama at @MetaAI
Michal Valko @misovalko
8K Followers 8K Following Building something new · Chief Models Officer @ Stealth Startup & Inria & MVA - Ex: Llama @AIatMeta Gemini and BYOL @GoogleDeepMind
Sheng Shen @shengs1123
2K Followers 551 Following @xAI | Prev. 🦙MetaAi; MSFTResearch, allen_ai, GoogleDeepMind; PhD @berkeley_ai
Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
Roberta Raileanu @robertarail
9K Followers 2K Following Senior Staff Research Scientist @GoogleDeepMind & Honorary Lecturer @UCL. ex @Meta|@MSFTResearch|@NYU|@Princeton. Llama-3, Toolformer, Rainbow Teaming, MLGym.
Sharan Narang @sharan0909
3K Followers 256 Following LLMs and AI Research (Llama 2 & 3 lead) @Meta | ex @Google (PaLM lead, T5), ex @Baidu (Deep Speech 2, Sparse Neural Networks), ex @Nvidia
Dieuwke Hupkes @_dieuwke_
2K Followers 276 Following
Aaditya Singh @Aaditya6284
823 Followers 345 Following Doing a PhD @GatsbyUCL with @SaxeLab, @FelixHill84 on learning dynamics, ICL, LLMs. Prev. at: @GoogleDeepMind, @AIatMeta (LLaMa 3), @MIT. https://t.co/ZOmBWCvbIK
Moya Chen @moyapchen
410 Followers 143 Following
Jim Fan @DrJimFan
325K Followers 3K Following NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
Zexuan Zhong @ZexuanZhong
3K Followers 700 Following @xAI post-trained Grok 3&4; scaling up RL for Grok-next | prev @PrincetonCS
Sweta Agrawal @swetaagrawal20
1K Followers 2K Following Research Scientist @Google Translate | Past: Postdoc Researcher @itnewspt | Ph.D. @ClipUmd, @umdcs #nlproc
Artidoro Pagnoni @ArtidoroPagnoni
1K Followers 603 Following PhD @uwnlp @AIatMeta. Bending the scaling laws.
Melanie Sclar @melaniesclar
2K Followers 517 Following PhD student @uwnlp @uwcse | Visiting Researcher @AIatMeta FAIR | Prev. Lead ML Engineer @asapp, intern @LTIatCMU | 🇦🇷
Xian Li @xl_nlp
2K Followers 319 Following Research Scientist @AIatMeta FAIR. NLP, ML. Opinions are my own.
Beidi Chen @BeidiChen
15K Followers 399 Following Asst. Prof @CarnegieMellon, @amazon Scholar, Prev: Visiting Researcher @Meta, Postdoc @Stanford, Ph.D. @RiceUniversity, Large-Scale ML, a fan of Dota2.
Lili Yu (ICLR2025) @liliyu_lili
2K Followers 366 Following Research Scientist @physical_int |Multimodal: Megabyte, Chameleon, Transfusion, MOT, LLMFusion |Ex: RS @AIatMeta (FAIR) , Phd @MIT
Sean O'Brien @seano_research
96 Followers 96 Following UCSD PhD student studying LLMs Ex-Meta AI, Berkeley AI Research
Jeff Rasley @jeffra45
877 Followers 1K Following @Snowflake AI Research Team. @DeepSpeedAI co-founder, @BrownCSDept PhD, @uwcse alum
Michi Yasunaga @michiyasunaga
4K Followers 886 Following
Guangxuan Xiao @Guangxuan_Xiao
3K Followers 697 Following Ph.D. student at @MITEECS Prev: CS & Finance @Tsinghua_Uni