Filip Pavetić @FPavetic
Joined April 2022-
Tweets32
-
Followers51
-
Following136
-
Likes210
Thrilled to share our latest advances in video understanding 📽️: Gemini 2.5 Pro is a truly magical model to play with, excelling in traditional video analysis and unlocking new use cases I could not imagine a few months ago🪄 More in 🧵 and @Google blog: developers.googleblog.com/en/gemini-2-5-…
Gemini 2.0 Flash's video understanding is here 🚀 Think: search in videos via timecodes, extract text from moving camera footage, analyze screen recordings in real-time interactions with native audio out 🔊 Come and try it aistudio.google.com 😀 youtu.be/Mot-JEU26GQ?si…
amazing work from video understanding jesus @AntoineYang2 alongside @MarioLucic_ @FPavetic @skprat and many others! they've been bringing better, faster video reasoning to a whole new level and have so much more in store ✨🚀♊
amazing work from video understanding jesus @AntoineYang2 alongside @MarioLucic_ @FPavetic @skprat and many others! they've been bringing better, faster video reasoning to a whole new level and have so much more in store ✨🚀♊
Attending #NeurIPS2024? If you're interested in multimodal systems, building inclusive & culturally aware models, and how fractals relate to LLMs, we've 3 posters for you. I look forward to presenting them on behalf of our GDM team @ Zurich & collaborators. Details below (1/4)
🧶PaLI-3 achieves SOTA across many vision-language (and video!) tasks while being 10x smaller than its predecessor PaLI-X. At only 5B parameters, it's also smaller (and stronger) than the concurrent Fuyu-8B model, though sadly we cannot release the model (props to @AdeptAILabs)
TL;DR I was too lazy to keep a fork of MHA, and I was too tired of my exps blowing up due to too high LR. I am still amazed how useful this is even for small models - I can pre-train [Na]-ViT with 1e-2 (previously it blew up at ~5e-3). Try it out!
TL;DR I was too lazy to keep a fork of MHA, and I was too tired of my exps blowing up due to too high LR. I am still amazed how useful this is even for small models - I can pre-train [Na]-ViT with 1e-2 (previously it blew up at ~5e-3). Try it out! https://t.co/3E4hqQ4bDj
Sparsity is one of the most promising areas in deep learning (tokens follow different routes in the model). However, these discrete decisions are messy to handle & optimize. Today we introduce Soft-MoE. The idea is simple: Don't route tokens, route linear combinations of them.
Introducing Soft MoE! Sparse MoEs are a popular method for increasing the model size without increasing its cost, but they come with several issues. Soft MoEs avoid them and significantly outperform ViT and different Sparse MoEs on image classification. arxiv.org/abs/2308.00951
NaViT (arxiv.org/abs/2307.06304) sets us free from square boxes and lets us think outside the box! Let creativity flow and go for the natural designs we've always wanted in ViTs. I share a few cool ideas that are made possible with NaViT:
NaViT (arxiv.org/abs/2307.06304) sets us free from square boxes and lets us think outside the box! Let creativity flow and go for the natural designs we've always wanted in ViTs. I share a few cool ideas that are made possible with NaViT:
At CVPR? Three papers from the Google Deepmind (formerly Brain) Vision team in in Berlin/Zürich/Amsterdam (+collaborators) there. If interested in the work or the team, track down the authors!
Quick summary of our recent work on scaling Vision Transformers - solving stability issues, making training more efficient and cool results: ai.googleblog.com/2023/03/scalin…
Learn about ViT-22B, the result of our latest work on scaling vision transformers to create the largest dense vision model. With improvements to both the stability and efficiency of training, ViT-22B advances the state of the art on many vision tasks → ai.googleblog.com/2023/03/scalin…
2️⃣2️⃣🅱️: We trained a 22B parameter ViT model, and scale continues to prove its merit! I want to zero in on an aspect of this which is useful however at all scales: a method for improving training stability in transformers. arxiv.org/abs/2302.05442
Scaling Vision Transformers to 22 billion parameters continues to improve ImageNet and OOD classification. And while ImageNet top1-accuracy seems to saturate short of 91% after fine-tuning, ObjectNet accuracy continues to increase, resulting in better effective robustness.
Scaling Vision Transformers to 22 billion parameters continues to improve ImageNet and OOD classification. And while ImageNet top1-accuracy seems to saturate short of 91% after fine-tuning, ObjectNet accuracy continues to increase, resulting in better effective robustness. https://t.co/0UaEwbXH54
1/ There is a huge headroom for improving capabilities of our vision models and given the lessons we've learned from LLMs, scaling is a promising bet. We are introducing ViT-22B, the largest vision backbone reported to date: arxiv.org/abs/2302.05442
Basil and I will present this work, today at @NeurIPSConf. Join us at 4pm in the 2nd poster session!
Basil and I will present this work, today at @NeurIPSConf. Join us at 4pm in the 2nd poster session!
Beep beep! Introducing LIMoE, the Language Image Mixture of Experts: a single model, processing both modalities for contrastive image-text modelling. Cruises straight to 84.1% 0shot ImageNet accuracy without any modality-specific architectures or pre-training. (1/10)
Stop by the Google booth at #ECCV2022 at 3:30 pm today to see a demo presented by Austin Stone, @MJLM3 and @agritsenko about OWL-ViT, a simple and scalable approach for open-vocabulary object detection and image-conditioned detection. Try it yourself at bit.ly/owl-vit-demo.

Afroz Mohiuddin @afrozenator
1K Followers 5K Following @OpenAI, ex @Google, @AIAtMeta. Interested in Science, Psychology, Investing and generally everything. Good Thoughts, Good Words, Good Deeds.
Filip Petkovski @fpetkovsky
173 Followers 198 Following Staff Production Engineer @Shopify. Thanos Metrics maintainer.
Antoine Yang @AntoineYang2
2K Followers 471 Following Senior Research Scientist @GoogleDeepMind, Gemini video 💎. Prev: PhD @Inria & @ENS_ULM, MEng @Polytechnique.
Katelyn @TuesoshdvZNCZ
61 Followers 142 Following
Fabian Mentzer @mentzer_f
3K Followers 226 Following Senior Research Scientist at Google DeepMind during the day, modular synth guy at night: https://t.co/Ea84aix9Pm
Sergi Caelles @skprat
923 Followers 77 Following Senior Research Engineer, Video Understanding in Gemini Team, @GoogleDeepMind. Prev: PhD student in #ComputerVision @ETH Zurich
Xiao Wang @brainshawn
148 Followers 31 Following Reseacher in @GoogleDeepMind Zurich current: vision-language & data-centric research; 2015-2020: text understanding; before 2015: distributed systems
Alice Bizeul @AliceBizeul
493 Followers 555 Following Apple MLR Intern & PhD student @ETH_AI_Center working on self-supervised representation learning | Previously @EPFL @MIT, Research Intern @Amazon
Nathan Waters @NathanBWaters
89 Followers 2K Following Multi-modal on Gemini @GoogleDeepMind. Creator of @PlayAidAi
Joel Garcia @JoelG960
154 Followers 6K Following
Darren Laurie (DammK) @6dammk9
51 Followers 191 Following Postmodernist Astolfo. Pixiv: https://t.co/bhd5BYCdUY
Michelle @Michell75582704
540 Followers 7K Following Finance Enthusiast🤑 | | $1M+ Trading Journey | | Help Solopreneurs Achieve Financial Freedom| | Wealth - Health -Motivation - Improvement |
Glintz AI @Glintz_AI
63 Followers 609 Following Welcome to Glintz AI - All subscribers will get a follow from me 🤗
EmotionAI @EmotionAI_xyz
39 Followers 281 Following EmotionAI aims to humanize computer/user interfaces by introducing a systematic/comprehensive understanding of human emotions to the AGI project
Matej Jusup @MatejJusup
201 Followers 180 Following A PhD in multi-agent RL at ETH Zurich and a chess enthusiast (2585 Elo @Chesscom) who developed an LM @GoogleDeepMind capable of playing the game (3200 Elo).
Raghu Nathan S @_raghu_nathan
2 Followers 170 Following
LY @YantoLiem11
207 Followers 4K Following
Maxim Neumann @neu_maxim
134 Followers 141 Following @GoogleDeepMind. In the past life, research scientist on vegetation information retrieval and SAR remote sensing at JPL/@NASA.
Sjoerd van Steenkiste @vansteenkiste_s
2K Followers 675 Following Researching AI models that can make sense of the world / Gemini @GoogleAI
Thomas Unterthiner @TomUnterthiner
538 Followers 138 Following Machine Learning Researcher, Formerly @GoogleDeepMind & Google Brain. Tweets are my own and should never be taken seriously.
Maria Brbic @mariabrbic
2K Followers 223 Following Assistant Professor of #computerscience and #lifesciences at @EPFL @ICepfl @epflSV Previously @Stanford #AI #compbio
Piotr Padlewski @PiotrPadlewski
2K Followers 380 Following Multimodal @anthropic. ex Chief Meme Officer at Reka, ex-Google Deepmind/Brain Zurich
Simon Kornblith @skornblith
3K Followers 953 Following researcher/engineer @AnthropicAI | former @GoogleDeepMind @mitbrainandcog @zotero | @[email protected]
tgeo92 @Tjoskz
443 Followers 5K Following
Michael Tschannen @mtschannen
3K Followers 674 Following Research Scientist @GoogleDeepMind. Representation learning for multimodal understanding and generation. Personal account.
Ibrahim Alabdulmohsin... @ibomohsin
1K Followers 785 Following AI Research Scientist at @GoogleDeepmind
Brad Neuberg @bradneuberg
12K Followers 9K Following Staff Machine Learning engineer @planet. Prev @ Dropbox & Google. Started coworking. Interests: ML, space, Earth Observation, VR. https://t.co/m7fXSRYQW3
Ajay Jain @ajayj_
7K Followers 4K Following Co-founder @genmoai. Co-created denoising diffusion (DDPM), DreamFusion, Dream Fields. Ex Ph.D. @berkeley_ai, @googleai, @facebookai, @nvidiaai, @mit
Pavel Izmailov @Pavel_Izmailov
8K Followers 1K Following Researcher @AnthropicAI 🤖 Incoming Assistant Professor @nyuniversity 🏙️ Previously @OpenAI #StopWar 🇺🇦
Dmitrii Tochilkin @cut_pow
3K Followers 222 Following 3D generation/reconstruction research ex. research @StabilityAI, @Google, @Yandex
Mario Lucic @MarioLucic_
3K Followers 229 Following Pushing the frontier of multimodal intelligence in Gemini with a focus on video understanding.
Zachary Schlosser @Zach_Schloss
399 Followers 3K Following Cultivating individual and collective wisdom. Global development | regional sovereignty | existential risk/hope. Header: Xavi Bou
Carlos Riquelme @rikelhood
2K Followers 2K Following principal researcher @MicrosoftAI previously @StabilityAI @GoogleBrain @Stanford
Neil Houlsby @neilhoulsby
6K Followers 339 Following Member of Technical Staff at Anthropic Amateur athlete https://t.co/G1kDE7Dyau
Suzanne Stathatos @suzstathatos
804 Followers 1K Following PhD candidate @Caltech developing efficient and environmentally-motivated methods in computer vision 🌎 Prev. SDE @Amazon, @NASAJPL; BA, MS @Stanford
Justin Kay @__justinkay
1K Followers 2K Following PhD student @MIT. Co-founder & CTO https://t.co/1LGXaee5ui. https://t.co/yBlEyqXEOa
Sara Beery @sarameghanbeery
11K Followers 3K Following Research on AI and biodiversity 🌍 Asst Prof at @MIT_CSAIL, #AIforConservation slack and @cv4ecology founder
Garrett Merz @merz_garrett
698 Followers 2K Following Postdoc, AI for physics @datascience_uw- prev @UMichPhysics, @OSUphysics. Empty hands & desire to unbuild walls. he/him, more or less
Jeremiah Harmsen @JeremiahHarmsen
2K Followers 540 Following Creator of #TensorFlowHub and @TensorFlow Serving. Lead in Google Brain.
Xiaohua Zhai @XiaohuaZhai
11K Followers 311 Following Researcher at Meta (previously at OpenAI Zürich, Google DeepMind)
Sergi Caelles @skprat
923 Followers 77 Following Senior Research Engineer, Video Understanding in Gemini Team, @GoogleDeepMind. Prev: PhD student in #ComputerVision @ETH Zurich
Antoine Yang @AntoineYang2
2K Followers 471 Following Senior Research Scientist @GoogleDeepMind, Gemini video 💎. Prev: PhD @Inria & @ENS_ULM, MEng @Polytechnique.
Eureka Labs @EurekaLabsAI
73K Followers 1 Following We are building a new kind of school that is AI native.
Peter J. Liu @peterjliu
8K Followers 2K Following AI research-eneur. Hiring eng: https://t.co/fv5QBjsv90. Was Research Scientist @ Google Brain / DeepMind, language model research. 🇨🇦🇺🇸
Sara Hooker @sarahookr
49K Followers 9K Following I lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
Restoran Oliva ZG @restoran_oliva
16 Followers 1 Following Gastronomic delicacies,personnel and modern interieur make sure that every moment in the restaurant Oliva is a little heaven and your place to be in Zagreb.
Shakir Mohamed @shakir_za
45K Followers 1K Following ML with Social Purpose. @[email protected] | Research Scientist @DeepMind | Strengthening African ML @DeepIndaba. He/Him. South African 🇿🇦🏳️🌈🌍
Jürgen Schmidhuber @SchmidhuberAI
163K Followers 0 Following Invented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.
Paul Rubenstein @PaulKRubenstein
382 Followers 191 Following Multimodal LLMs at Google DeepMind in Zurich, views my own
ICLR 2026 @iclr_conf
52K Followers 53 Following International Conference on Learning Representations #ICLR2026. SPC is @BharathHarihar3 and GC is @cvondrick
NBA @NBA
48.4M Followers 2K Following The 2025-26 NBA season tips off Tuesday, Oct. 21 on NBC and Peacock!
Trends in Cognitive S... @TrendsCognSci
30K Followers 516 Following Trends in Cognitive Sciences - monthly review journal featuring developments across cog sci and neurosci. Posts by the editor.
Večernji list @vecernji_list
205K Followers 635 Following Večernji list počeo je izlaziti 1. 7. 1959.
Matej Jusup @MatejJusup
201 Followers 180 Following A PhD in multi-agent RL at ETH Zurich and a chess enthusiast (2585 Elo @Chesscom) who developed an LM @GoogleDeepMind capable of playing the game (3200 Elo).
Jason Wei @_jasonwei
98K Followers 634 Following ai researcher @meta superintelligence labs, past: openai, google 🧠
Kaggle @kaggle
305K Followers 284 Following Kaggle is the largest global AI community of developers, researchers, and enthusiasts who compete, collaborate, and benchmark what's next in AI.
Xiao Wang @brainshawn
148 Followers 31 Following Reseacher in @GoogleDeepMind Zurich current: vision-language & data-centric research; 2015-2020: text understanding; before 2015: distributed systems
Thomas Unterthiner @TomUnterthiner
538 Followers 138 Following Machine Learning Researcher, Formerly @GoogleDeepMind & Google Brain. Tweets are my own and should never be taken seriously.
#ICCV2025 @ICCVConference
11K Followers 63 Following Official account for the IEEE/CVF International Conference on Computer Vision. #ICCV2025 Honolulu 🇺🇸 Hosted by @natanielruizg @anfurnari @YVinker @CSProfKGD
rohan anil @_arohan_
25K Followers 2K Following
Bojan Tunguz @tunguz
252K Followers 8K Following ML ex Nvidia. Creator of @trainxgb. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. Memelord. e/xgb. AMDG.
Anthropic @AnthropicAI
637K Followers 35 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.
Stanford AI Lab @StanfordAILab
211K Followers 332 Following The Stanford Artificial Intelligence Laboratory (SAIL), a leading #AI lab since 1963. ⛵️🤖 Emmy-winning video: https://t.co/lV9smZTC1m
Peking University @PKU1898
680K Followers 202 Following Peking University was established in 1898. This account is dedicated to #PekingUniversity’s global outreach and communications.
Tsinghua University @Tsinghua_Uni
773K Followers 406 Following Tsinghua University is a research university located in Beijing, China, established in 1911.
Towards Data Science @TDataScience
244K Followers 2K Following The world's leading publication for data science and artificial intelligence professionals. Submit an Article ✍️ https://t.co/57pIMegK1o
AI Proteins @AI_Proteins
3K Followers 29 Following better medicines, created rapidly through de novo protein design
Zoubin Ghahramani @ZoubinGhahrama1
32K Followers 670 Following VP Research, Google DeepMind, ex-head of Google Brain. Professor at University of Cambridge. Machine Learning Researcher. ex-Chief Scientist & VP of AI, Uber.
Demis Hassabis @demishassabis
489K Followers 146 Following Nobel Laureate. Co-Founder & CEO @GoogleDeepMind - working on AGI. Solving disease @IsomorphicLabs. Trying to understand the fundamental nature of reality.
Isomorphic Labs @IsomorphicLabs
29K Followers 75 Following We are reimagining the entire drug discovery process from first principles with an AI-first approach. Learn more: https://t.co/eC7rjmB3EZ
John Carmack @ID_AA_Carmack
1.1M Followers 273 Following AGI at Keen Technologies, former CTO Oculus VR, Founder Id Software and Armadillo Aerospace
Fabian Mentzer @mentzer_f
3K Followers 226 Following Senior Research Scientist at Google DeepMind during the day, modular synth guy at night: https://t.co/Ea84aix9Pm
CNN @CNN
63.7M Followers 1K Following It’s our job to #GoThere and tell the most difficult stories. For breaking news, follow @CNNBRK and download the CNN app ➡️ https://t.co/7PQD7o6fLw
Maria Brbic @mariabrbic
2K Followers 223 Following Assistant Professor of #computerscience and #lifesciences at @EPFL @ICepfl @epflSV Previously @Stanford #AI #compbio
Danny Driess @DannyDriess
4K Followers 327 Following Research Scientist @physical_int. Formerly Google DeepMind
Piotr Padlewski @PiotrPadlewski
2K Followers 380 Following Multimodal @anthropic. ex Chief Meme Officer at Reka, ex-Google Deepmind/Brain Zurich
UCLA Women's Basketba... @UCLAWBB
27K Followers 2K Following Official page for UCLA Women's Basketball #GoBruins For more coverage visit: https://t.co/03nWbJLQWE
UCLA Men’s Basketba... @UCLAMBB
113K Followers 202 Following The official X account of the 𝐔𝐂𝐋𝐀 𝐦𝐞𝐧'𝐬 𝐛𝐚𝐬𝐤𝐞𝐭𝐛𝐚𝐥𝐥 program.
UCLA @UCLA
264K Followers 242 Following The official account for the #1 public university in the nation 8 years in a row. Dedicated to research, education and service.
The New York Times @nytimes
55.1M Followers 849 Following News tips? Share them here: https://t.co/ghL9OoYKMM
UC Berkeley @UCBerkeley
247K Followers 670 Following Official account of the University of California, Berkeley. Home of the @CalAthletics Golden Bears. 🐻 #BerkeleyNews #GoBears
Berkeley AI Research @berkeley_ai
225K Followers 367 Following We're graduate students, postdocs, faculty and scientists at the cutting edge of artificial intelligence research.
BBC News (World) @BBCWorld
42.0M Followers 17 Following News, features and analysis from the World's newsroom. Breaking news, follow @BBCBreaking. UK news, @BBCNews. Latest sports news @BBCSport
DuckDuckGo @DuckDuckGo
2.7M Followers 4 Following Independent online protection company. Get our mobile & desktop browser with protections built-in, including our search engine that doesn't track you.
NVIDIA AI @NVIDIAAI
237K Followers 793 Following The latest breakthroughs and the future of AI for business leaders.