[1/N] 🎥 We've made available a powerful spatial AI tool named ViPE: Video Pose Engine, to recover camera motion, intrinsics, and dense metric depth from casual videos!
Running at 3–5 FPS, ViPE handles cinematic shots, dashcams, and even 360° panoramas.
🔗 research.nvidia.com/labs/toronto-a…
We are excited to share Cosmos-Drive-Dreams 🚀
A bold new synthetic data generation (SDG) pipeline powered by world foundation models—designed to synthesize rich, challenging driving scenarios at scale.
Models, Code, Dataset, Tookit are released.
Website:…
🚀Excited to introduce GEN3C #CVPR2025, a generative video model with an explicit 3D cache for precise camera control.
🎥It applies to multiple use cases, including single-view and sparse-view NVS🖼️ and challenging settings like monocular dynamic NVS and driving simulation🚗.…
Reward models that help real robots learn new tasks—no new demos needed!
ReWiND uses language-guided rewards to train bimanual arms on OOD tasks in 1 hour!
Offline-to-online, lang-conditioned, visual RL on action-chunked transformers.
🧵
Check our Physgen3D which extends Physgen () to 3D. Try the deflate demo below 👇👇👇 Achieved by our amazing intern @boyuanchen21 and collaborators @jiang_hanxiao, Saurabh, @YunzhuLiYZ Prof. Zhao and @ShenlongWang
Check our Physgen3D which extends Physgen () to 3D. Try the deflate demo below 👇👇👇 Achieved by our amazing intern @boyuanchen21 and collaborators @jiang_hanxiao, Saurabh, @YunzhuLiYZ Prof. Zhao and @ShenlongWang https://t.co/pxxWqmcpvW
Stop by our poster #217 tmr 10:30 if you are at #ECCV2024, Prof @ShenlongWang and Prof @_saurabhg will present tmr. This is how Shenlong did toy experiments at home🤣
Stop by our poster #217 tmr 10:30 if you are at #ECCV2024, Prof @ShenlongWang and Prof @_saurabhg will present tmr. This is how Shenlong did toy experiments at home🤣 https://t.co/Ld4Caat2f4
@_akhaliq The paper presents a novel image-to-video generation method called PhysGen that can convert a single image into a realistic, physically plausible, and temporally consistent video. The key idea is to integrate a model-based physical simulation with a data-driven video generation…
Thank you AK @_akhaliq for featuring our work. Come and visit our stevenlsw.github.io/physgen/ to play the interactive demos! Don't miss our Wednesday morning poster session at #217 if you are at #ECCV2024
Thank you AK @_akhaliq for featuring our work. Come and visit our stevenlsw.github.io/physgen/ to play the interactive demos! Don't miss our Wednesday morning poster session at #217 if you are at #ECCV2024
Introducing: Opening Cabinets and Drawers in the Real World using a Commodity Mobile Manipulator
We develop a system to open unseen cabinets and drawers *zero-shot* from novel environments using the Stretch RE2: arjung128.github.io/opening-cabine…
325K Followers 3K FollowingNVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
729 Followers 460 FollowingResearch scientist @NvidiaAI animation group. I obtained my PhD from University of Toronto @UofT, Vector Institute @VectorInst 😃
84 Followers 3K FollowingWhy look anywhere else when you can get top-quality services and products directly from me? Whether it's for personal use or business, I’ve got everything cover
29 Followers 286 FollowingUndergraduate researcher at Tsinghua University, focusing on World Models and Robotics. Current intern @StanfordSVL. My site: https://t.co/R9bicUDDdk
325K Followers 3K FollowingNVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
949K Followers 764 FollowingProfessor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
84K Followers 702 FollowingDirector, Max Planck Institute for Intelligent Systems (@MPI_IS). Chief Scientist @meshcapade. Building 3D digital humans using vision, graphics, and learning.
19K Followers 465 FollowingAssociate Professor @UTCompSci | Director @NVIDIAAI Co-Leading GEAR | CS PhD @Stanford | Building generalist robot autonomy in the wild | Opinions are my own
8K Followers 878 FollowingAssistant Professor @Cambridge_Eng, working on 3D computer vision and inverse graphics, previously postdoc @StanfordSVL, PhD @Oxford_VGG
1.4M Followers 1K FollowingBuilding @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
1.2M Followers 279 FollowingWe’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
2K Followers 2K FollowingPh.D. Student @PrincetonCS. Prev @Stanford @UW @pika_labs @MSFTResearch @UofIllinois @ZJU_China. I used to work on computer vision, but it's not all I do.
1K Followers 745 FollowingProduct Operations Engineer at AIMonk Labs || Optimizing AI Systems & Driving Operational Excellence ||Sharing Insights on AI and Robotics
14K Followers 519 FollowingYour guide to radiance fields | Host of the podcast @ViewDependent | DM open for business inquiries | https://t.co/llYGWliKUv | discord: https://t.co/lrl64WGvlD
11K Followers 63 FollowingOfficial account for the IEEE/CVF International Conference on Computer Vision. #ICCV2025 Honolulu 🇺🇸 Hosted by @natanielruizg @anfurnari @YVinker @CSProfKGD
1K Followers 708 FollowingResearch Scientist at @NVIDIA | PhD from SJTU @sjtu1896 | Interested in 3D Computer Vision, Human Digitization | Views are my own