To me, diffusion LMs work because they remove unnecessary inductive biases. The left-to-right inductive bias is natural for human but is unlikely to be natural for AI. This gives more capacity to our models like Transformer having a bigger capacity than LSTM. Our experiment…
To me, diffusion LMs work because they remove unnecessary inductive biases. The left-to-right inductive bias is natural for human but is unlikely to be natural for AI. This gives more capacity to our models like Transformer having a bigger capacity than LSTM. Our experiment…
Boosting LLM Performance with Dynamic Skill Selection!
1/ 🚀 What if LLMs could get better at solving math problems by understanding the skills they need? We explored this idea by having LLMs identify and label the skills required for each problem. arxiv.org/abs/2405.12205
Boosting LLM Performance with Dynamic Skill Selection!
1/ 🚀 What if LLMs could get better at solving math problems by understanding the skills they need? We explored this idea by having LLMs identify and label the skills required for each problem. arxiv.org/abs/2405.12205
Interesting progress from @rm_rafailov and @DivGarg9 et. al following our work (applied to mathematical and commonsense reasoning):
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning
arxiv.org/abs/2405.00451
(Also discussed in Llama-3 paper, @AIatMeta )
Interesting progress from @rm_rafailov and @DivGarg9 et. al following our work (applied to mathematical and commonsense reasoning):
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning
arxiv.org/abs/2405.00451
(Also discussed in Llama-3 paper, @AIatMeta ) https://t.co/aq7Icjkpql
Discrete Key-Value Bottleneck (Updated)
Compresses the information of a pre-trained model in learnable "key-value" codebook such that knowledge can be quickly adapted in a continual learning fashion.
arxiv.org/abs/2207.11240
Temporal Latent Bottleneck combines recurrence and self-attention in an unified way. Recurrence integrates information over time, and self-attention models local dependencies in "short" context.
arxiv.org/abs/2205.14794
Temporal Latent Bottleneck combines recurrence and self-attention in an unified way. Recurrence integrates information over time, and self-attention models local dependencies in "short" context.
arxiv.org/abs/2205.14794 https://t.co/RtUdzw8tYO
77K Followers 2K Followinga combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
49K Followers 9K FollowingI lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
12K Followers 745 FollowingResearch Scientist, Deepmind
I try to think hard about everything I tweet, esp on 90s football and 80s music
None of my opinions are really someone else's
27K Followers 2K FollowingProfessor at UMD. AI security & privacy, algorithmic bias, foundations of ML.
Follow me for commentary on state-of-the-art AI.
63K Followers 2K FollowingResearch Scientist at Google DeepMind (WaveNet, Imagen, Veo). I tweet about deep learning (research + software), music, generative models (personal account).
42K Followers 865 FollowingFR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.
28K Followers 1K FollowingResearch at @GoogleDeepMind. Controllable World Simulators (GNNs, Structured World Models, Neural Assets). Veo Team (Ingredients to Video Co-Lead)
26K Followers 876 FollowingResearch Scientist Director in Meta FAIR. Reasoning, Optimization and Understanding LLM. Novelist in spare time. PhD in @CMU_Robotics.
691 Followers 4K FollowingBuilding Gen AI products. Ex-Facebook, I work on AI, Search and ML Infra. Cautiously optimistic about the future and a believer in being nuanced.
74 Followers 3K FollowingI am beggin proggmer from a village in india I am eager to learn programming please guide me fellow programing seniors this junior is willing to learn
20 Followers 309 FollowingWink builds bias-free AI products delivering personalized experiences. We create democratically accessible yet privately owned AI serving real user needs.
154 Followers 1K FollowingVP of Engineering, Arklex AI (@ArklexAI) | Adjunct, Columbia (@Columbia) | Director of Internships, ICPC Foundations (@icpcnews) | Stanford (BS ‘11 MS ‘13)
518 Followers 7K FollowingFounder @Setica —
🌐 https://t.co/k41rINekVX. Alien on planet Earth. Ai researcher and Indie Developer(web & apps).Building Ai models and Ai agents and Saas Apps
77K Followers 2K Followinga combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
45K Followers 64 FollowingStudent of mind and nature, libertarian, chess player, cancer survivor. @ Keen, UAlberta, Amii, https://t.co/u8za2Kod54, The Royal Society, Turing Award
12K Followers 745 FollowingResearch Scientist, Deepmind
I try to think hard about everything I tweet, esp on 90s football and 80s music
None of my opinions are really someone else's
57K Followers 619 FollowingDistinguished Professor (Emeritus), Oregon State Univ.; Former President, Assoc. for the Adv. of Artificial Intelligence; Robust AI & Comput. Sustainability
42K Followers 865 FollowingFR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.
11K Followers 723 Following"If there is not folly in the world, then the world itself is folly. You must understand that mistakes are not always regrets." - Paul Tobin, Bandette🤠
28K Followers 1K FollowingResearch at @GoogleDeepMind. Controllable World Simulators (GNNs, Structured World Models, Neural Assets). Veo Team (Ingredients to Video Co-Lead)
25K Followers 342 FollowingMatchmaker & Dating Coach For Men. 3,000+ happy clients. As seen in NYT, WSJ, Shark Tank. If you're single and ready to get unstuck, let's chat 🤠
11K Followers 749 Followingslightly less attractive cofounder @AskEureka: we’re replacing all doctors with AI. I tweet abt healthcare and tech, prev @Harvard @Google @BCG, dm to say hi :)
92K Followers 809 FollowingWriting about feelings you’ve had but don’t know how to describe. https://t.co/KIGd67QeaB Author of The Pluri Society on Amazon.
31K Followers 877 FollowingVP GenAI @Databricks. Former CEO/cofounder MosaicML & Nervana/IntelAI. Neuro + CS. I like to build stuff that will eventually learn how to build other stuff.
14K Followers 257 FollowingCo-lead of the GenMedia team working on Veo, Imagen, Genie, Nano Banana (aka Gemini-2.5-Flash-Image-Preview), ...
Research Scientist @ DeepMind
3K Followers 43 FollowingNPO founded by @Yoshua_Bengio, committed to advancing safe-by-design AI - OBNL fondée par @Yoshua_Bengio visant à concevoir des systèmes d'IA sécuritaires
9K Followers 102 FollowingMember of Technical Staff at Anthropic AlphaGo, AlphaZero, MuZero, AlphaCode, AlphaTensor, AlphaProof Gemini RL Prev Principal Research Engineer at DeepMind
3K Followers 4K FollowingHardware Implementation @ OpenAI. Prev: TPUs @ Google. Love problem solving and connecting dots. IC Design for AI and AI for ICs. https://t.co/mKd42Dds0c
25K Followers 206 FollowingWorking towards the safe development of AI for the benefit of all @UMontreal, @LawZero_ & @Mila_Quebec
A.M. Turing Award Recipient and most-cited AI researcher.
2K Followers 140 FollowingSilver Professor at NYU Courant and CDS, Research Scientist at FAIR
Research in Machine Learning, past in Quantum Computing & Finance. Posts my own.
13K Followers 753 FollowingResearch eng @GoogleDeepMind on Gemini pretrain. Personal acct. Past: swe intern @SpaceX, ugrad researcher in @tserre lab @BrownUniversity. All opinions my own.
85K Followers 1K Followingi help make https://t.co/jZh799yNH4, the best AI for self-improvement, introspection, and emotional processing. https://t.co/ac0cp4UZ9h
20K Followers 1K FollowingResearcher @MSFTResearch, AI Frontiers Lab; Prof @UWMadison (on leave); learning in context; thinking about reasoning; babas of Inez Lily.
204K Followers 25 FollowingManus is the general AI agent that bridges minds and actions: it doesn't just think, it delivers results. Download our app: https://t.co/XSfjRhjdgo