-
Tweets731
-
Followers570
-
Following512
-
Likes187
New research from FAIR- Active Reading: a framework to learn a given set of material with self-generated learning strategies for generalized and expert domains(such as Finance). Absorb significantly more knowledge than vanilla finetuning and usual data augmentations strategies
New research from FAIR- Active Reading: a framework to learn a given set of material with self-generated learning strategies for generalized and expert domains(such as Finance). Absorb significantly more knowledge than vanilla finetuning and usual data augmentations strategies
🔍 How do we teach an LLM to 𝘮𝘢𝘴𝘵𝘦𝘳 a body of knowledge? In new work with @AIatMeta, we propose Active Reading 📙: a way for models to teach themselves new things by self-studying their training data. Results: * 𝟔𝟔% on SimpleQA w/ an 8B model by studying the wikipedia…
🚀 Introducing BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent. It is a new Deep-Research evaluation benchmark built on top of BrowseComp. It features - 📚 a fixed, carefully curated corpus of web documents - ✅ human-verified positive…
Factuality and logical reasoning (e.g., math, code) favor different sets of reasoning patterns. 🧑🍳 A fresh RL recipe to improve factuality is here — crafted by the amazing @ccsasuke!
Factuality and logical reasoning (e.g., math, code) favor different sets of reasoning patterns. 🧑🍳 A fresh RL recipe to improve factuality is here — crafted by the amazing @ccsasuke!
...is today a good day for new paper posts? 🤖Learning to Reason for Factuality 🤖 📝: arxiv.org/abs/2508.05618 - New reward func for GRPO training of long CoTs for *factuality* - Design stops reward hacking by favoring precision, detail AND quality - Improves base model across…
Now accepted by #ACL2025 main. We propose a training framework to generate strong smaller retriever with integration of LLM data augmentation and LLM pruning, letting smaller retriever improves together with the advancement of LLM.
Now accepted by #ACL2025 main. We propose a training framework to generate strong smaller retriever with integration of LLM data augmentation and LLM pruning, letting smaller retriever improves together with the advancement of LLM.
Accepted by #ACL2025! Congrats @mingdachen and the team🥳 Several cool ideas: - Maintain an explicit editable working memory during generation; - Actively integrate external feedback (factual check w/ VeriScore); A smart LM learns to memorize, a smarter LM learns to forget too!
Accepted by #ACL2025! Congrats @mingdachen and the team🥳 Several cool ideas: - Maintain an explicit editable working memory during generation; - Actively integrate external feedback (factual check w/ VeriScore); A smart LM learns to memorize, a smarter LM learns to forget too!
Today is the start of a new era of natively multimodal AI innovation. Today, we’re introducing the first Llama 4 models: Llama 4 Scout and Llama 4 Maverick — our most advanced models yet and the best in their class for multimodality. Llama 4 Scout • 17B-active-parameter model…
🧵 Adapting your LLM for new tasks is dangerous! A bad training set degrades models by encouraging hallucinations and other misbehavior. Our paper remedies this for RAG training by replacing gold responses with self-generated demonstrations. Check it out: arxiv.org/abs/2502.10
Today we released DRAMA, a set of small (sub-1B) multilingual dense retrievers that perform strongly across multiple languages and tasks. It also offers flexible model sizes and embedding dimensionalities. Led by my awesome intern @xueguang_ma arxiv.org/abs/2502.18460
Today we released DRAMA, a set of small (sub-1B) multilingual dense retrievers that perform strongly across multiple languages and tasks. It also offers flexible model sizes and embedding dimensionalities. Led by my awesome intern @xueguang_ma arxiv.org/abs/2502.18460
New paper! Byte-Level models are finally competitive with tokenizer-based models with better inference efficiency and robustness! Dynamic patching is the answer! Read all about it here: dl.fbaipublicfiles.com/blt/BLT__Patch… (1/n)
I will present our paper FLAME on factuality alignment for LLMs with @luyu_gao at #NeurIPS2024! 🎉 Join us at East Exhibit Hall A-C, Booth #3501 for a chat on Wed (Dec 11, 4:30--7:30 pm). Looking forward to connecting! More detail: neurips.cc/virtual/2024/p…
I will present our paper FLAME on factuality alignment for LLMs with @luyu_gao at #NeurIPS2024! 🎉 Join us at East Exhibit Hall A-C, Booth #3501 for a chat on Wed (Dec 11, 4:30--7:30 pm). Looking forward to connecting! More detail: neurips.cc/virtual/2024/p…
🚨 I’m on the job market this year! 🚨 I’m completing my @uwcse Ph.D. (2025), where I identify and tackle key LLM limitations like hallucinations by developing new models—Retrieval-Augmented LMs—to build more reliable real-world AI systems. Learn more in the thread! 🧵
1/ Excited to share that our paper "NEST🪺: Nearest Neighbor Speculative Decoding for LLM Generation and Attribution" is accepted at #NeurIPS2024! 🚀 Catch us at the poster session on Thu, Dec 12, 4:30–7:30 PM PST, East Exhibit Hall A-C, #2201. [Details: neurips.cc/virtual/2024/p…]
1/ Excited to share that our paper "NEST🪺: Nearest Neighbor Speculative Decoding for LLM Generation and Attribution" is accepted at #NeurIPS2024! 🚀 Catch us at the poster session on Thu, Dec 12, 4:30–7:30 PM PST, East Exhibit Hall A-C, #2201. [Details: neurips.cc/virtual/2024/p…]
Excited to open-source a new hallucinations eval called SimpleQA! For a while it felt like there was no great benchmark for factuality, and so we created an eval that was simple, reliable, and easy-to-use for researchers. Main features of SimpleQA: 1. Very simple setup: there…
🚀 Excited to share our latest work: Transfusion! A new multi-modal generative training combining language modeling and image diffusion in a single transformer! Huge shout to @violet_zct @omerlevy_ @michiyasunaga @arunbabu1234 @kushal_tirumala and other collaborators.
🚀 Excited to share our latest work: Transfusion! A new multi-modal generative training combining language modeling and image diffusion in a single transformer! Huge shout to @violet_zct @omerlevy_ @michiyasunaga @arunbabu1234 @kushal_tirumala and other collaborators.
Lillian!
Last week we released Meta Chameleon: a new mixed-modal research model from Meta FAIR. Get the models ➡️ go.fb.me/4m87kk The 7B & 34B safety tuned models we’ve released can take any combination of text and images as input and produce text outputs using a new early…

Akari Asai @AkariAsai
18K Followers 867 Following Incoming Assistant Professor @SCSatCMU & research scientist @allen_ai. akariasai @ 🦋
Sewon Min @sewon__min
13K Followers 814 Following Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp
Weijia Shi @WeijiaShi2
9K Followers 1K Following PhD student @uwnlp @allen_ai | Prev @MetaAI @CS_UCLA | 🏠 https://t.co/Q6Mzg8ow2j
Leo Boytsov @srchvrs
9K Followers 2K Following Machine learning scientist and engineer speaking πtorch & C++. Past @LTIatCMU, @awscloud. Opinions sampled from MY OWN 100T param LM.
Bill Yuchen Lin @billyuchenlin
23K Followers 3K Following Building Grok @xAI. Affiliate Assistant Prof @UW; Focusing on Grok Code for Macrohard now. Ex: @allen_ai, Google AI, Meta FAIR.
Yu Su (hiring postdoc... @ysu_nlp
11K Followers 948 Following cooking something new. prof. @osunlp. sloan fellow. intelligence and agents. author of Mind2Web, SeeAct, MMMU, HippoRAG, BioCLIP, UGround.
Ofir Press @OfirPress
15K Followers 6K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
Boshi Wang @BoshiWang2
2K Followers 507 Following Fourth-year Ph.D. @OhioState. Prev intern @MSFTResearch
Nandan Thakur @beirmug
2K Followers 3K Following PhD @uwaterloo🌲 IR & NLP | I like good evals🔎 l Prev: intern @DbrxMosaicAI @GoogleAI & RA @UKPLab | https://t.co/kxQprYr7Xn, https://t.co/YVvVjSyXOS, TREC-RAG and FreshStack! ✨
Ori Ram @ori__ram
968 Followers 384 Following Research Scientist @GoogleAI. Working on #NLProc, and specifically on making Gemini more factual. PhD from @TelAvivUni
hyunji amy lee @hyunji_amy_lee
819 Followers 512 Following Incoming postdoc @unc_ai_group w/ @mohitban47. PhD student @kaist_ai. Previously: @allen_ai @Adobe.
Zexuan Zhong @ZexuanZhong
3K Followers 700 Following @xAI post-trained Grok 3&4; scaling up RL for Grok-next | prev @PrincetonCS
Stefan @stefan_star
309 Followers 2K Following building with LLMs, blockchain enthusiast, full stack developer prev @etherchain_org
Baiyun Jing @BaiyunJ
75 Followers 603 Following
florin @florin10
16 Followers 220 Following
Yixin Lin @yixin_lin_
2K Followers 7K Following something new. prev: embodied AI @GoogleDeepMind, FAIR/@AIatMeta, Google Brain.
elias bahid @elias_bahid
21 Followers 222 Following
Joe Mayo @JoeMayo
16K Followers 7K Following Author and Independent Consultant Recent books: - Programming the Microsoft Bot Framework/MSPress - C# Cookbook/O'Reilly Agents, AI, Generative AI, MCP, RAG
Ada Offonry @adaoffonry
395 Followers 3K Following AI/ML Recruitment | Career Strategist | Podcast Host at Adapted Ambitions |Speaker
Xian Li @xl_nlp
2K Followers 319 Following Research Scientist @AIatMeta FAIR. NLP, ML. Opinions are my own.
Seb Seb @SebaFraMar
2 Followers 103 Following
Wooseok Seo @just1nseo
49 Followers 265 Following PhD Student @yonsei_u | Research Intern @LG_AI_Research
Marc Andreessen 🇺�... @pmarca
1.9M Followers 27K Following Yes, I can see some risk that your threat to jail Internet company executives for not censorsing aggressively enough could backfire.
Kamalika Chaudhuri @kamalikac
5K Followers 2K Following Director, FAIR @ Meta. Former Professor at UCSD. Researcher in AI privacy, security, and generalization.
Epsilon Guanlin Lee @Epsilon_Lee
256 Followers 3K Following PhD, MLer, CLer (NLPer), ML Engineer at https://t.co/gX6Lem59Co, have belief in interpretability research of AI/ML/NNs
Yichuan Wang @YichuanM
701 Followers 2K Following 1st year EECS PhD at UC Berkeley SkyLab @BerkeleySky, 2020 ACM class in SJTU, interested in MLSYS.
rahul x @rahulme74418504
58 Followers 2K Following
Leitian Tao @LeitianT
130 Followers 560 Following 3rd Machine Learning PhD student at @WisconsinCS | research scientist intern @AIatMeta FAIR | |ex Research intern @Adobe | BS 23' @WHU_1893
Parshin Shojaee @ParshinShojaee
3K Followers 1K Following PhD student @VT_CS | AI for Science, Math, Code, Reasoning | Intern @Apple | prev @Adobe
Zhepei Wei @weizhepei
188 Followers 531 Following Ph.D. Student @CS_UVA | Research Intern @Meta. Previously @AmazonScience. Research interest: ML/NLP/LLM.
Jason Weston @jaseweston
13K Followers 705 Following @MetaAI+NYU. NLP from scratch(Pretrain+FT LLM) 2008, MemNets (pre-Transformer) 2015, DrQA(pre-RAG) 2017, BlenderBot(dialog pre-ChatGPT) 2018+,Self-Reward+ more!
Jyoti Mann @jyoti_mann1
3K Followers 4K Following Tech Reporter @businessinsider prev @FT + hedgie consultant. 📧: [email protected] (my views)
Uiblaxui @Uiblaxui118224
5 Followers 506 Following
Oodaugoo @Oodaugoo845486
21 Followers 1K Following
sigchill @nishitmengar
474 Followers 6K Following
Virogi @Virogi292535
42 Followers 1K Following
Basit Mustafa @moltar81435
415 Followers 8K Following introverted but willing to discuss sanctuary moon innovation/ai + dsop/ai delivery @ https://t.co/nQ5pf3TzTZ
Muly Oved @mulyoved
122 Followers 2K Following
Pierre Chambon @PierreChambon6
806 Followers 2K Following NLP/Code Generation PhD at FAIR (Meta AI) and INRIA - previously researcher at Stanford University - MS Stanford 22’ - Centrale Paris P2020
Letitia Wong Martinez @wongletitia
26 Followers 5K Following
Muharrem AYICI @MuAyI
73 Followers 1K Following
ADAM @noadm19
114 Followers 8K Following
FUNNY MOON @FUNNYMOON168193
1 Followers 67 Following
Lee Yin @YinLi70917
33 Followers 618 Following I lead the technical recruitment team at Tencent, focusing on AI and Large Language Model initiatives.
Snoyrez @Snoyrezxicp
35 Followers 4K Following
Sanaudue @Sanauduef7d
41 Followers 4K Following
Adhiraj Ghosh ✈️ ... @adhiraj_ghosh98
258 Followers 507 Following ELLIS PhD @uni_tue | vision-language & data-centric ML @bethgelab 🦋: https://t.co/Q03vvJFIPw
Akari Asai @AkariAsai
18K Followers 867 Following Incoming Assistant Professor @SCSatCMU & research scientist @allen_ai. akariasai @ 🦋
Sewon Min @sewon__min
13K Followers 814 Following Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp
Weijia Shi @WeijiaShi2
9K Followers 1K Following PhD student @uwnlp @allen_ai | Prev @MetaAI @CS_UCLA | 🏠 https://t.co/Q6Mzg8ow2j
Yann LeCun @ylecun
949K Followers 764 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
Bill Yuchen Lin @billyuchenlin
23K Followers 3K Following Building Grok @xAI. Affiliate Assistant Prof @UW; Focusing on Grok Code for Macrohard now. Ex: @allen_ai, Google AI, Meta FAIR.
(((ل()(ل() 'yoav)))... @yoavgo
65K Followers 2K Following
Yi Tay @YiTayML
46K Followers 81 Following research scientist @googledeepmind ✨♊, model co-lead/captain of gemini deepthink imo gold medal 🥇, opinions are my own.
Yu Su (hiring postdoc... @ysu_nlp
11K Followers 948 Following cooking something new. prof. @osunlp. sloan fellow. intelligence and agents. author of Mind2Web, SeeAct, MMMU, HippoRAG, BioCLIP, UGround.
AI at Meta @AIatMeta
712K Followers 288 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
Graham Neubig @gneubig
40K Followers 708 Following Associate professor @LTIatCMU. Co-founder/chief scientist @allhands_ai. I mostly work on modeling language.
Wenhu Chen @WenhuChen
22K Followers 663 Following AI researcher. Interested in Reasoning, Multimodal. I direct TIGER-Lab. Author of PoT, MMMU, MMLU-Pro, MAmmoTH, LongRAG, MAP-Neo, YuE, VL-Rethinker
Percy Liang @percyliang
84K Followers 417 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist
Mike Lewis @ml_perception
8K Followers 242 Following Llama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.
Ofir Press @OfirPress
15K Followers 6K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
Kyunghyun Cho @kchonyc
77K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
Xin Eric Wang @xwang_lk
18K Followers 1K Following Professor @ UCSB (@ucsantabarbara). Head of Research @SimularAI. Interim Director @ucsbcrml. #Multimodal #Embodied #Agents. AI for Humanity in the long run.
Hu Xu @Hu_Hsu
723 Followers 667 Following FAIR, Foundational Data Research, #MetaCLIP (scaling CLIP data from scratch) for DINO, Llama, JEPA, PE, Movie Gen etc. @aiatmeta
Jessy Lin @realJessyLin
3K Followers 884 Following PhD @Berkeley_AI, visiting researcher @AIatMeta. Interactive language agents 🤖 💬
Jack Morris @jxmnop
45K Followers 977 Following research @cornell @meta // language models, information theory, science of AI
Weizhe Yuan @WeizheY
342 Followers 297 Following Ph.D. at @nyuniversity. Visiting researcher at @AIatMeta. Previous Intern @cohere, MCDS @LTIatCMU. Working on ML/NLP. Painting lover🎨.
Xian Li @xl_nlp
2K Followers 319 Following Research Scientist @AIatMeta FAIR. NLP, ML. Opinions are my own.
Paul Graham @paulg
2.1M Followers 774 Following
Yisong Yue @yisongyue
22K Followers 3K Following Machine Learning @Caltech (@YueLabCaltech). AI for invention at @AsariAILabs.
Barlas Oğuz @barlas_berkeley
19 Followers 25 Following Research scientist, Meta FAIR. ex-MSFT, Berkeley alumni
Orion Weller @orionweller
2K Followers 947 Following PhD student @jhuclsp interning @AIatMeta FAIR. Prev intern @GoogleDeepMind, @samaya_ai, @allen_ai. Research: LLMs, RAG, and IR
Shirley Wu @ShirleyYXWu
3K Followers 295 Following CS PhD candidate @Stanford working w/ @jure & @james_y_zou on LLM agents and alignment | Prev USTC, Intern @MSFTResearch, @NUSingapore
Joelle Pineau @jpineau1
15K Followers 441 Following Chief AI Officer, @cohere Professor of Computer Science, @mcgillu Core academic member, @Mila_Quebec Ex-Meta (FAIR team)
Zhuang Liu @liuzhuang1234
11K Followers 1K Following Assistant Professor @PrincetonCS. researcher in deep learning, vision, models. previously @MetaAI, @UCBerkeley, @Tsinghua_Uni
Pengfei Liu @stefan_fee
4K Followers 791 Following Associate Prof. at SJTU, leading GAIR Lab (https://t.co/Nfd8KmZx3B) Co-founder of Inspired Cognition, Postdoc at @LTIatCMU, Previously FNLP, @MILAMontreal,
Xuezhe Ma (Max) @MaxMa1987
2K Followers 401 Following Research Lead @USC_ISI and Research Assistant Professor @CSatUSC PhD at CMU ML/NLP @LTIatCMU @CarnegieMellon
Lili Yu (ICLR2025) @liliyu_lili
2K Followers 366 Following Research Scientist @physical_int |Multimodal: Megabyte, Chameleon, Transfusion, MOT, LLMFusion |Ex: RS @AIatMeta (FAIR) , Phd @MIT
Nandan Thakur @beirmug
2K Followers 3K Following PhD @uwaterloo🌲 IR & NLP | I like good evals🔎 l Prev: intern @DbrxMosaicAI @GoogleAI & RA @UKPLab | https://t.co/kxQprYr7Xn, https://t.co/YVvVjSyXOS, TREC-RAG and FreshStack! ✨
Jason Weston @jaseweston
13K Followers 705 Following @MetaAI+NYU. NLP from scratch(Pretrain+FT LLM) 2008, MemNets (pre-Transformer) 2015, DrQA(pre-RAG) 2017, BlenderBot(dialog pre-ChatGPT) 2018+,Self-Reward+ more!
Ari Holtzman @universeinanegg
3K Followers 2K Following Asst Prof @UChicagoCS & @DSI_UChicago, leading Conceptualization Lab https://t.co/BVCT3zdaNV Minting new vocabulary to conceptualize generative models.
Asli Celikyilmaz @ACL... @real_asli
3K Followers 1K Following Research Manager at FAIR Superintelligence Labs @Meta and Affiliate Faculty @uwcse | TACL editor-in-chief | Previously: @MSFTResearch, @UCBerkeley and @UofT
Xueguang Ma @xueguang_ma
839 Followers 637 Following PhD student at @uwaterloo. Working on encoding the world into vectors. Prev. intern at @Meta, @MSFTResearch, @amazon
Saining Xie @sainingxie
23K Followers 1K Following researcher in #deeplearning #computervision | assistant prof at @nyu_courant | rs @googledeepmind | past: rs @meta (FAIR) @ucsandiego | ynwa
Jerry Liu @jerryjliu0
65K Followers 1K Following co-founder/CEO @llama_index Careers: https://t.co/EUnMNmbCtx Enterprise: https://t.co/Ht5jwxSrQB
Jie Huang @jefffhj
13K Followers 639 Following Building intelligence @xAI. Grok-2🍍, 3🍫, 4🫐, 🪄. PhD from UIUC CS.
Beidi Chen @BeidiChen
15K Followers 399 Following Asst. Prof @CarnegieMellon, @amazon Scholar, Prev: Visiting Researcher @Meta, Postdoc @Stanford, Ph.D. @RiceUniversity, Large-Scale ML, a fan of Dota2.
Michi Yasunaga @michiyasunaga
4K Followers 886 Following
Shangbin Feng @shangbinfeng
4K Followers 2K Following PhD student @uwcse @uwnlp. Model collaboration, social NLP, networks and structures. #水文学家
Mikel Artetxe @artetxem
7K Followers 227 Following Co-founder @RekaAILabs and Honorary Researcher @Hitz_zentroa (University of the Basque Country) | Past: Research Scientist @AIatMeta (FAIR)
Maria Lomeli @MariaLomeli_
390 Followers 328 Following Researcher and engineer @AIatMeta, FAIR | PhD from @GatsbyUCL and former postdoc @CambridgeMLG
Minjoon Seo @seo_minjoon
2K Followers 578 Following Co-Founder & CEO at Config Intelligence, Associate Professor at KAIST
Jack Hessel @jmhessel
4K Followers 916 Following soon: @AnthropicAI. Seattle bike lane enjoyer. Opinions my own.
Sida Wang @sidawxyz
480 Followers 307 Following
Shunyu Yao @ShunyuYao12
19K Followers 1K Following @OpenAI Language agents (ReAct, Reflexion, Tree of Thoughts, SWE-agent, CoALA) for digital automation (WebShop, SWE-bench, tau-bench)
Joongwon Kim @danieljwkim
502 Followers 308 Following PhD student @uwcse @uwnlp | Currently at @AIatMeta | Former undergrad @Penn
Dipanjan Das @dipanjand
6K Followers 320 Following Researcher at @GoogleDeepmind. Factuality and Gemini x Search.