Woohoo! Imagine, Verify, Execute (IVE) is accepted to CoRL 2025! 🎉
Congrats to the incredible @umdcs students Seungjae Lee @JayLEE_0301, Daniel Ekpo (@daniekpo7), Haowen Liu!
Woohoo! Imagine, Verify, Execute (IVE) is accepted to CoRL 2025! 🎉
Congrats to the incredible @umdcs students Seungjae Lee @JayLEE_0301, Daniel Ekpo (@daniekpo7), Haowen Liu!
We will present FlexTok at #ICML2025 on Tuesday! Drop by to chat with @JRAllardice and me if you're interested in tokenization, flexible ways to encode images, and generative modeling.
📆 Tue, Jul 15, 16:30 PDT
📍 East Exhibition Hall, Poster E-3010
🌐 flextok.epfl.ch
We will present FlexTok at #ICML2025 on Tuesday! Drop by to chat with @JRAllardice and me if you're interested in tokenization, flexible ways to encode images, and generative modeling.
📆 Tue, Jul 15, 16:30 PDT
📍 East Exhibition Hall, Poster E-3010
🌐 flextok.epfl.ch
Which multimodal LLM should you be using to edit graphics in Blender?
Today, we’re releasing our #CVPR2025 Highlight🌟 work, #BlenderGym 🏋️♀️, the first agentic 3D graphics editing benchmark that will tell you exactly how multimodal LLMs compare in their Blender-editing skills.…
🏡Building realistic 3D scenes just got smarter!
Introducing our #CVPR2025 work, 🔥FirePlace, a framework that enables Multimodal LLMs to automatically generate realistic and geometrically valid placements for objects into complex 3D scenes.
How does it work?🧵👇
Meet Gemini Robotics: our latest AI models designed for a new generation of helpful robots. 🤖
Based on Gemini 2.0, they bring capabilities such as better reasoning, interactivity, dexterity and generalization into the physical world. 🧵 goo.gle/gemini2-roboti…
In the past, we extended the convolution operator to go from low-level image processing to high-level visual reasoning. Can we also extend physical operators for more high-level physical reasoning?
Introducing the Denoising Hamiltonian Network (DHN): arxiv.org/pdf/2503.07596
Thrilled to announce that SG-I2V has been accepted at #ICLR2025 ! Huge thanks to the collaborators, reviewers, and ACs. Looking forward to presenting this in Singapore!
Thrilled to announce that SG-I2V has been accepted at #ICLR2025 ! Huge thanks to the collaborators, reviewers, and ACs. Looking forward to presenting this in Singapore!
Congratulations to @UofTCompSci undergrads Helen Li, Junru Lin, Leo Tenenbaum and Sarah Walker who have received honourable mentions in the @CRAtweets 2024-2025 Outstanding Undergraduate Researcher Award program! cra.org/about/awards/o…
🔥 Introducing MVLift: Generate realistic 3D motion without any 3D training data - just using 2D poses from monocular videos! Applicable to human motion, human-object interaction & animal motion. Joint work w/ @jiajunwu_cs & Karen
💡 How? We reformulate 3D motion estimation as…
Do large multimodal models understand how to make dresses for your winter holiday party💃?
We introduce AIpparel, a vision-language-garment model capable of generating and editing simulation-ready sewing patterns from text and images. Project page at georgenakayama.github.io/AIpparel/.…
[Hiring!] I am hiring multiple PhDs @CSatUSC@USCViterbi for this cycle. If you're interested in scene representations, neural simulation, generative AI, and robotics, feel free to mention my name in your application (no need to email). For USC masters/undergrads who're…
Sharing something exciting we've been working on as a Thanksgiving gift: Diffusion Self-Distillation (DSD), which redefines zero-shot customized image generation using FLUX.
DSD is like DreamBooth, but zero-shot/training-free. It works across any input subject and desired…
📢 Excited to share our new work: AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
snap-research.github.io/ac3d
We analyze what pre-trained video diffusion transformers understand about 3D and demonstrate dynamic scene generation with 3D control.
I'm recruiting graduate students for Fall 2025 to work at the intersection of Computer Vision, Deep Learning, and Robotics.
If you are interested in building a controllable organic simulation engine and enabling safe robot learning, consider applying to UofT's CS PhD program 1/n
Symmetries are everywhere — from butterfly’s wings to Greek temples. But detecting them in noisy data? That’s a challenge. 🦋🏛
Our #SIGGRAPHAsia2024 paper, Robust Symmetry Detection via Riemannian Langevin Dynamics, tackles this: symmetry-langevin.github.io
🧵(1/n)
2K Followers 352 FollowingAssistant Professor of Computing Science @SFU. Ph.D. from @Berkeley_EECS and Bachelor's from @UofTCompSci. Formerly @GoogleAI and Member of @the_IAS.
1K Followers 5K FollowingSenior Research Scientist in comp bio at @deepgenomics. Did PhD work on cancer evolution. My three favourite things are 🍌s, riding my 🚴fast, and vim.
43K Followers 2K FollowingOfficial Twitter for @TheOfficialACM's Special Interest Group on Computer Graphics & Interactive Techniques + its conferences. #SIGGRAPH2025 #SIGGRAPHAsia2025
2K Followers 5 FollowingSpAItial is pioneering spatial foundation models (SFMs), a groundbreaking AI paradigm that generates virtual environments that behave like the real world.
4.3M Followers 3 FollowingOpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6Lg202
37K Followers 565 FollowingAssistant professor at Stanford; Co-founder of Voyage AI (https://t.co/wpIITHLgF0) ;
Working on ML, DL, RL, LLMs, and their theory.
27K Followers 2K FollowingProfessor at UMD. AI security & privacy, algorithmic bias, foundations of ML.
Follow me for commentary on state-of-the-art AI.
9K Followers 2K FollowingAssociate professor of @umdcs @umiacs @ml_umd at UMD. Researcher in #AI/#ML, AI #Alignment, #RLHF, #Trustworthy ML, #EthicalAI, AI #Democratization, AI for ALL.
637K Followers 35 FollowingWe're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.
6K Followers 436 FollowingAssistant Professor, University of Washington
Co-Director RAIVN lab (https://t.co/f0BWKyjW48)
Director PRIOR team (https://t.co/l9RzTetkIk)
5K Followers 552 FollowingAssistant Professor @Harvard SEAS @hseas, Lead the Harvard Computational Robotics Lab. #Robotics, #Optimization, #Control, #Vision, #Learning
109K Followers 166 FollowingUPMC Professor of Computer Science @ CMU, President Elect ICML Board, VP of Research @ Meta (Multimodal LLMs, AI Agents), ex-Director of AI research at @Apple
116 Followers 142 Following@Stanford math undergrad, CS Master's Stude, class of 2025. Currently working on computer vision and graphics. Will be applying for CS PhD this up coming cycle!
10K Followers 2K Followingpostdoc @Oxford_VGG 🇬🇧 | 3D Gen AI | PhD alum @KaustVision 🇸🇦@TU_Muenchen 🇩🇪 | @fihmai founder | my @tedX talk about AI inequality: https://t.co/Y24DtOsASJ