Despite much progress in AI, the ability for AI to 'smell' like humans remains elusive. Smell AIs 🤖👃can be used for allergen sensing (e.g., peanuts or gluten in food), hormone detection for health, safety & environmental monitoring, quality control in manufacturing, and more.…
Hello! If you are interested in dynamic 3D or 4D, don't miss the oral session 3A at 9 am on Saturday:
@zhengqi_li
will be presenting "MegaSaM"
I'll be presenting "Stereo4D"
and
@QianqianWang5
will be presenting "CUT3R"
Excited to share our CVPR 2025 paper on cross-modal space-time correspondence!
We present a method to match pixels across different modalities (RGB-Depth, RGB-Thermal, Photo-Sketch, and cross-style images) — trained entirely using unpaired data and self-supervision.
Our…
Can AI image detectors keep up with new fakes?
Mostly, no. Existing detectors are trained using a handful of models. But there are thousands in the wild!
Our work, Community Forensics, uses 4800+ generators to train detectors that generalize to new fakes.
#CVPR2025 🧵 (1/5)
Hello! If you like pretty images and videos and want a rec for CVPR oral session, you should def go to Image/Video Gen, Friday at 9am:
I'll be presenting "Motion Prompting" @RyanBurgert will be presenting "Go with the Flow" and @ChangPasca1650 will be presenting "LookingGlass"
Ever wish YouTube had 3D labels?
🚀Introducing🎥DynPose-100K🎥, an Internet-scale collection of diverse videos annotated with camera pose!
Applications include camera-controlled video generation🤩and learned dynamic pose estimation😯
Download: huggingface.co/datasets/nvidi…
🧩#CVPR2025🌷Introducing Two By Two✌️: The First Large-Scale Daily Pairwise Assembly Dataset with SE(3)-Equivariant Pose Estimation.
🤖2BY2 helps robots master daily 3D assembly tasks—like plugging sockets or arranging flowers—across diverse objects!
🐨Co-lead by @yuqi_Beijing
I’m on the PhD internship market for Spr/Summer 2025! I have experience in multimodal AI (EHR, X-ray, text), explainability for image models w/ genAI, clinician-AI interaction (surveyed 700+ doctors), and tabular foundation models. Please reach out if you think there’s a fit!
🍌We present DenseMatcher!
🤖️DenseMatcher enables robots to acquire generalizable skills across diverse object categories by only seeing one demo, by finding correspondences between 3D objects even with different types, shapes, and appearances.
What happens when you train a video generation model to be conditioned on motion?
Turns out you can perform "motion prompting," just like you might prompt an LLM! Doing so enables many different capabilities. Here’s a few examples – check out this thread 🧵 for more results!
Excited to share MonST3R! -- a simple way to estimate geometry from unposed video of dynamic scene
We achieve competitive results on several downstreams (video depth, camera pose) and believe this is a promising step toward feed-forward 4D reconstruction
monst3r-project.github.io
Differentiable rendering made SIMPLE❗️
Differentiating physically based renderers is hard: Dirac-delta discontinuities arise at object silhouette. Our #SIGGRAPHAsia2024 work shows how a simple relaxation can rescue the day, enabling easy 3D reconstruction and relighting! (1/N)
We present Global Matching Random Walks, a simple self-supervised approach to the Tracking Any Point (TAP) problem, accepted to #ECCV2024. We train a global matching transformer to find cycle consistent tracks through video via contrastive random walks (CRW).
46 Followers 1K FollowingA data set, data annotation sales, selling high-quality annotation solutions similar to AI for science/autonomous driving/lean4 data topics。
19K Followers 8K FollowingOn the quest to understand the fundamental mathematics of intelligence and of the universe with curiosity. https://t.co/mMchI2d4pg Upskilling @StanfordOnline
8K Followers 878 FollowingAssistant Professor @Cambridge_Eng, working on 3D computer vision and inverse graphics, previously postdoc @StanfordSVL, PhD @Oxford_VGG
17K Followers 6K FollowingNeurodivergent physics student with a keen interest in multisensory integration and emergent perception. Exploring research on a proposed ‘sixth sense’. Δ
99 Followers 94 FollowingI am a Research Scientist & Lead@Meta Reality Labs Research, pushing R&D to blur the boundary between virtual and real world. Opinions are my own!
7K Followers 439 FollowingBuilding a general-purpose robot that is more effective than a humanoid. Cofounder of @viktor_vrp. Backed by @fdotinc. Seed soon.
2K Followers 11 Followingprofessor @GeorgiaTech | UIUC ‖ engineer–researcher building next-generation high-performance, multimodal, and creative AI systems
594 Followers 6 FollowingSharpa is a robotics company dedicated to developing ultra-high performance robots and core components, unlocking the limitless possibilities of future.
4K Followers 506 FollowingResearch Scientist at @Meta SuperIntelligence Lab; ex-MSR; Core contributor of Project Florence, Phi-3V, Omniparser; Inventor of FocalNet, SEEM, SoM and Magma.
236 Followers 624 FollowingCS PhD student @NTUsg, BEng @sjtu1896, Intern @ Bytedance Seed. Research on 3D Vision and Generative AI. I am on the job market now!
85K Followers 306 FollowingHigh performance civilian robot manufacturer.
Please everyone be sure to use the robot in a Friendly and Safe manner.
https://t.co/hI6LafokVm