[1/N] 🎥 We've made available a powerful spatial AI tool named ViPE: Video Pose Engine, to recover camera motion, intrinsics, and dense metric depth from casual videos!
Running at 3–5 FPS, ViPE handles cinematic shots, dashcams, and even 360° panoramas.
🔗 research.nvidia.com/labs/toronto-a…
We build Cosmos-Predict2 as a world foundation model for Physical AI builders — fully open and adaptable. Post-train it for specialized tasks or different output types.
Available in multiple sizes, resolutions, and frame rates.
📷 Watch the repo walkthrough…
Cosmos-Predict2 is our latest open video foundation model for Physical AI!
research.nvidia.com/labs/dir/cosmo…
If you’re at #cvpr2025, I would also love to chat with you about world models!
Cosmos-Predict2 is our latest open video foundation model for Physical AI!
research.nvidia.com/labs/dir/cosmo…
If you’re at #cvpr2025, I would also love to chat with you about world models!
Introducing NVIDIA Cosmos, an open-source, open-weight Video World Model. It's trained on 20M hours of videos and weighs from 4B to 14B. Cosmos offers two flavors: diffusion (continuous tokens) and autoregressive (discrete tokens); and two generation modes: text->video and…
github.com/NVIDIA/Cosmos
Cosmos is a developer-first platform designed to help physical AI builders accelerate their development. It has pre-trained world foundation models (diffusion & autoregressive) in different sizes and video tokenizers. They are open models with permissive…
Everything you love about generative models — now powered by real physics!
Announcing the Genesis project — after a 24-month large-scale research collaboration involving over 20 research labs — a generative physics engine able to generate 4D dynamical worlds powered by a physics…
How to build an AI system that can generate 3D worlds from a single image?
All you need is the **RIGHT** data!
By training on over a million diverse 360 videos, our diffusion model can generate realistic visualizations of the world from arbitrary views.
See more results below!
We’ve been busy building an AI system to generate 3D worlds from a single image. Check out some early results on our site, where you can interact with our scenes directly in the browser!
worldlabs.ai/blog
1/n
325K Followers 3K FollowingNVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
15K Followers 216 FollowingAssociate Professor @ SFU (Research Chair), Research Scientist @ Google DeepMind, Associate Professor (status only) @ UofT. Opinions are my own.
2K Followers 1K FollowingSenior research scientist at @NVIDIAAI working on 3D representations for geometric deep learning. PhD in ML, Vision, and Graphics from NYU. Opinions are my own.
8K Followers 878 FollowingAssistant Professor @Cambridge_Eng, working on 3D computer vision and inverse graphics, previously postdoc @StanfordSVL, PhD @Oxford_VGG
16K Followers 308 FollowingTeaching AI to see, model, and interact with our 3D world. Assistant Professor @ MIT, leading the Scene Representation Group (https://t.co/h5gvhLYrtw).
19K Followers 3K FollowingFrom SLAM to Spatial AI; Professor of Robot Vision, Imperial College London; Director of the Dyson Robotics Lab; Co-Founder of Slamcore. FREng, FRS.
3K Followers 675 FollowingPrincipal Research Scientist at @NVIDIA | Former Physicist | Deep Generative Learning | Flows and Diffusion | Proteins and Molecules
Opinions are my own.
19K Followers 466 FollowingAssociate Professor @UTCompSci | Director @NVIDIAAI Co-Leading GEAR | CS PhD @Stanford | Building generalist robot autonomy in the wild | Opinions are my own
538 Followers 6K FollowingTenure-Track Assistant Professor at University of Alabama at Birmingham. Previous: Indiana State University, UC San Diego. PhD from University of Chicago.
72K Followers 2K FollowingThe future is being written in atoms and algorithms. My role is to help ensure we're reading that story accurately & positioning ourselves wisely. Quantum Nerd.
83 Followers 3K FollowingWhy look anywhere else when you can get top-quality services and products directly from me? Whether it's for personal use or business, I’ve got everything cover
949K Followers 764 FollowingProfessor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
1.4M Followers 1K FollowingBuilding @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
84K Followers 702 FollowingDirector, Max Planck Institute for Intelligent Systems (@MPI_IS). Chief Scientist @meshcapade. Building 3D digital humans using vision, graphics, and learning.
325K Followers 3K FollowingNVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
93K Followers 492 FollowingDistinguished Scientist at Google. Computational Imaging, Machine Learning, and Vision. Tweets = personal opinions. May change or disappear over time.
1.2M Followers 279 FollowingWe’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
15K Followers 216 FollowingAssociate Professor @ SFU (Research Chair), Research Scientist @ Google DeepMind, Associate Professor (status only) @ UofT. Opinions are my own.
37K Followers 483 FollowingDigital Geometer, Assoc. Prof. of Computer Science & Robotics @CarnegieMellon @SCSatCMU and member of the @GeomCollective. There are four lights.
182K Followers 63 FollowingBuilding new freedoms of imagination for the world through pioneering research and design. Try Dream Machine for free → https://t.co/LmWmA4H803
451K Followers 77 FollowingTensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundation
488K Followers 146 FollowingNobel Laureate. Co-Founder & CEO @GoogleDeepMind - working on AGI. Solving disease @IsomorphicLabs. Trying to understand the fundamental nature of reality.
28K Followers 1K FollowingResearch at @GoogleDeepMind. Controllable World Simulators (GNNs, Structured World Models, Neural Assets). Veo Team (Ingredients to Video Co-Lead)
9K Followers 874 FollowingAssistant Professor at University of Washington. I like robots, and reinforcement learning. Previously: post-doc at MIT, PhD at Berkeley
117K Followers 376 FollowingNVIDIA Robotics inspires visionaries and developers to create the next generation of AI-driven robots and explore the world of physical AI.
3K Followers 6K FollowingLLM for code and reasoning. PhD student at Cornell. Previously Student Researcher at @google. Previously intern at @theteamatx.
77K Followers 2K Followinga combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign