-
Tweets21
-
Followers188
-
Following292
-
Likes33
We are grateful to be awarded an oral presentation -- please come by Wed 10/2 at 1:30pm (I believe we are the first talk in the oral session) as well as the poster session afterward (number 156) at 4:30pm! #ECCV2024 🎉
We are grateful to be awarded an oral presentation -- please come by Wed 10/2 at 1:30pm (I believe we are the first talk in the oral session) as well as the poster session afterward (number 156) at 4:30pm! #ECCV2024 🎉
Visit our webpage at gcd.cs.columbia.edu for many more results! Datasets, code, and pretrained models coming soon. Many thanks to my amazing collaborators: @ChrisWu6080, @EgeOzguroglu, @KyleSargentAI, @ruoshi_liu, @ptokmakov, @achalddave, Changxi Zheng, @cvondrick 🧵(5/5)
Apart from robotics and related scenes, it also works quite well on driving scenarios! In general, we believe our framework can help unlock powerful applications in rich dynamic scene understanding, perception for embodied AI, and interactive 3D video viewing. 🧵(4/5)
Although out-of-distribution generalization is highly challenging, we show promising zero-shot results on real-world examples. In particular, our model exhibits object permanence capabilities, which can be observed by shifting the virtual camera upward in this video. 🧵(3/5)
GCD works by equipping a video-to-video generative model with camera perspective controls. We condition the Stable Video Diffusion architecture on an input video along with a relative pose, such that it can be chosen, and finetune it on paired synthetic multi-view data. 🧵(2/5)
Feel free to come by our poster (number 104 in Nord room) Thursday morning starting at 10:30am! 😃
Check out our #zero123 live demo here! huggingface.co/spaces/cvlab/z… Big shoutout to collaborators: @ruoshi_liu (first author), @ChrisWu6080, @ptokmakov, @ZakharovSergeyN, and @cvondrick. Hope to see you all next week at ICCV! ;-) 🧵(4/4)
Specifically, we finetune Stable Diffusion, which already has useful 2D image priors thanks to being trained on billion-scale data. This pipeline allows us to successfully achieve strong zero-shot performance on objects with complex geometry and artistic styles. 🧵(3/n)
We leverage a recently released large-scale dataset of 3D objects, called Objaverse, from which we render images with random perspectives. We then train an image-to-image translation network with the task of converting one viewpoint to another. 🧵(2/n)
Happy to share our #ICCV2023 paper on 3D reconstruction from a single image! In Zero-1-to-3, we teach diffusion models to control the camera viewpoint, which enables novel view synthesis applications. Website: zero123.cs.columbia.edu Paper: arxiv.org/abs/2303.11328 🧵(1/n)
After some polishing, the code has been published on GitHub ;-) github.com/basilevh/tcow
After some polishing, the code has been published on GitHub ;-) github.com/basilevh/tcow
P.S. Also check out our earlier related work on Revealing Occlusions with 4D Neural Fields (arxiv.org/abs/2204.10916)! This paper is essentially about video-to-4D generation, but requires depth input. On the other hand, we demonstrate that TCOW works in the wild too. 🧵 (7/7)
Visit our project webpage at tcow.cs.columbia.edu for many more results, as well as links to the datasets, code, and pretrained models! Joint work with @ptokmakov, Simon Stent, Jie Li, and @cvondrick. 🧵 (6/n)
Still, since object permanence remains far from solved, we release our benchmarks and invite the research community to continue working on this intriguing problem. 🧵 (5/n)
However, TCOW shines when it comes to handling total occlusion and/or containment, which are highly challenging scenarios that require advanced spatiotemporal reasoning skills. Cup shuffling games are especially tricky, yet we seem to be beginning to tackle them. 🧵 (4/n)
Despite being trained only on synthetic data (using the Kubric simulator), TCOW performs quite well in complex real-world scenes. For example, see the rhino below which maintains its nose and horns (i.e. does amodal completion) throughout the partial occlusion. 🧵 (3/n)
Our framework is capable of distinguishing containment from occlusion events by predicting different segmentation masks for each of them, as visualized in the video above. 🧵 (2/n)
Excited to share our #CVPR2023 paper on tracking with object permanence in video! In TCOW, we propose both a model and a dataset for localizing objects regardless of their visibility. Website: tcow.cs.columbia.edu Paper: arxiv.org/abs/2305.03052 🧵 (1/n)

MaudWollaston @OBTyI22bXZaM8j
0 Followers 284 Following
James Emilian @JamesEmilian2
122 Followers 1K Following PhD @CMU_Robotics | DARPA Triage & visual reasoning @AirLabCMU | Prev: @cmu_bme, @fastdotai, mech design @Pentair
Paul mendy ✝️ @Paulmendy545860
127 Followers 7K Following Those who are happiest are those who do the most for others. Help others without any reason and give without the expectation of receiving anything in return🙏🏿
David @DavidSHolz
92K Followers 8K Following founder @midjourney, prev founder leap motion, nasa, max planck - random vibeposting @davidvibesonly
Zhenjun Zhao @zhenjun_zhao
6K Followers 1K Following PhD from @CUHKofficial. 3D vision, SLAM, SfM, Image Matching (https://t.co/ek376Drwvu).
Victor Lecomte @vclecomte
661 Followers 197 Following CS PhD student at Stanford / Researcher at the Alignment Research Center
Junyi Zhang @junyi42
1K Followers 490 Following CS Ph.D. Student @Berkeley_AI. B.Eng. @SJTU1896 CS. Working with @GoogleDeepMind, previous @MSFTResearch. Vision, generative model, representation learning.
James_jefferson @Jamesjeffe26532
43 Followers 366 Following The future is yours, shape it with us Where wisdom and power unite Join the movement, reveal the truth Guiding humanity towards a brighter future A ILLUMINATI
Amy @amy_holland4
363 Followers 3K Following
Ethan Weber @ethanjohnweber
844 Followers 570 Following Incoming Research Scientist at Meta Reality Labs | Final-year PhD at UC Berkeley | MIT EECS BS '20 & MEng '21 | CV for AR/VR & robotics | https://t.co/YhPzCHLKfQ
Ian Huang @IanHuang3D
502 Followers 108 Following AI PhD @StanfordAILab | Ex-SR @GoogleDeepMind Ex-SR @RealityLabs | Multimodal models for 3D | https://t.co/mqLLezoGAp
Jarne Van den Herrewe... @den_herrewegen
78 Followers 688 Following ML Engineer at Datameister | PhD candidate at Ghent University - imec Self-supervised learning in 3D LinkedIn: https://t.co/NrUfyVpXjE…
Suellees @SuelleestX_
23 Followers 2K Following
Anisha Pal @AnishaPal07
60 Followers 258 Following ML @ PlanetteAI | MSCS @GeorgiaTech | Computer Vision | Climate Modeling
Sangeeth @tweet2sangeeth
0 Followers 101 Following
Zhefei Gong @zhefeigong
141 Followers 2K Following Robot Learning | Looking for 2026 Spring/Fall Phd Opportunities
Ivan Skorokhodov @isskoro
3K Followers 491 Following Research Scientist @Snap. I like neural networks and neural networks like me.
Data Cube @datacubeny
29 Followers 766 Following
Chao-Yuan Wu @chaoyuaw
1K Followers 986 Following Building spatial intelligence at World Labs (@theworldlabs)
Vishal Choudhari @infinivishal
159 Followers 551 Following EE Ph.D. Student at Columbia University | Neural Engineering for Speech and Hearing
Tony Chen @tonychenxyz
597 Followers 1K Following Next: CS PhD @princetonCS. Prev: Undergrad @columbia. Current: Inference research intern @togethercompute.
Andrew Davison @AjdDavison
19K Followers 3K Following From SLAM to Spatial AI; Professor of Robot Vision, Imperial College London; Director of the Dyson Robotics Lab; Co-Founder of Slamcore. FREng, FRS.
Scott Reed @scott_e_reed
17K Followers 575 Following Research Scientist at NVIDIA working on generalist embodied agent research
Bardienus Duisterhof @BDuisterhof
764 Followers 910 Following PhD Student @CMU_Robotics with @jeff_ichnowski || DUSt3R Research Intern @naverlabseurope || 4D Vision for Robot Manipulation 📷 | @bardienus.bsky.social | 🇳🇱
Haoyu Xiong @Haoyu_Xiong_
3K Followers 2K Following PhD student @MIT_CSAIL | Prev @Stanford @CMU_Robotics #Robot_Learning
Yu Chi @YuChi__26
188 Followers 1K Following PhD student @TUM @niessnerlab. Generative Modelling, Neural Rendering.
Sukrut Rao @sukrutrao
451 Followers 1K Following PhD Student @cvml_mpiinf at the Max Planck Institute for Informatics, @SIC_Saar. Member of @neuroexplicit. Explainability in Computer Vision. @cse_iith alumnus.
Catherine Li 🍵 @daikonland
389 Followers 1K Following Synthetic rare cats @AdvexAI ✨| ex-Waymo, ex-Twitter | Cheese, art, memes, and machine learning ✨✨| Views are my own | Random posting
CoraHarriman @l054L8MXggUV20
84 Followers 7K Following
Jiapeng Tang @jiapeng_tang
1K Followers 592 Following Ph.D. student at the Visual Computing Group @TU_Muenchen, working on 3D Computer Vision.
Julia @kogikueiko12743
94 Followers 7K Following
ModestyReade @B6Q367KC47z6anr
78 Followers 7K Following
Julia @teruechiho1868
93 Followers 7K Following
Selythath @selythath51191
85 Followers 5K Following
Lennart Schulze @lenn_artschulze
26 Followers 197 Following PhD student in Machine Learning, Computer Vision, and Robotics @ Columbia University. Previously: Research Fellow @ MIT. R&D @ IBM.
Yuanbo Yang @YuanboYang60742
150 Followers 2K Following Master's student @ZJU_China | Exploring 3D Vision & Generative Models 🌐
Tammy @ikukawamin12441
34 Followers 3K Following
Abhinav Shrivastava @abhi2610
1K Followers 897 Following Associate Professor, University of Maryland, College Park
Xindi Wu @cindy_x_wu
4K Followers 1K Following PhD student @PrincetonCS | Interning @nvidia | Data-centric multimodal ml | prev @roboVisionCMU @CMU_Robotics | @RealityLabs @Snapchat | 🏎️
Lu Ling @LuLing26466911
289 Followers 457 Following @NVIDIA research intern丨PhD @PurdueCS丨#AI 丨#ComputerVision丨Agentic AI丨4D/3D GenAI丨 Multimodals
Alexander Hermans @Pandoro_o
115 Followers 255 Following Postdoc at the @RWTHVisionLab, as well as the @laim_uka
Mohamed El Banani @_mbanani
837 Followers 952 Following MTS @theworldlabs. Prev: @UMichCSE, @GoogleAI, @MetaAI, @GeorgiaTech. I am interested in computer vision, machine learning, and cognitive science. 🇪🇬
Ben Mildenhall @BenMildenhall
8K Followers 2K Following making stuff 3D at @theworldlabs. co-creator of NeRF and dreamfusion.
손수원 @sonsuwon4
0 Followers 11 Following
Zhou Xian @zhou_xian_
15K Followers 139 Following PhD student in robotics & AI @CMU_robotics. (Occasionally) landscape photographer.
Junyi Zhang @junyi42
1K Followers 490 Following CS Ph.D. Student @Berkeley_AI. B.Eng. @SJTU1896 CS. Working with @GoogleDeepMind, previous @MSFTResearch. Vision, generative model, representation learning.
World Labs @theworldlabs
25K Followers 33 Following World Labs is a spatial intelligence company building Large World Models to perceive, generate, and interact with the 3D world.
Ian Huang @IanHuang3D
502 Followers 108 Following AI PhD @StanfordAILab | Ex-SR @GoogleDeepMind Ex-SR @RealityLabs | Multimodal models for 3D | https://t.co/mqLLezoGAp
Ivan Skorokhodov @isskoro
3K Followers 491 Following Research Scientist @Snap. I like neural networks and neural networks like me.
Chao-Yuan Wu @chaoyuaw
1K Followers 986 Following Building spatial intelligence at World Labs (@theworldlabs)
Matthieu Meeus @matthieu_meeus
227 Followers 556 Following PhD student @ImperialCollege Privacy/Safety + AI https://t.co/UBo5kgRqbU
Tony Chen @tonychenxyz
597 Followers 1K Following Next: CS PhD @princetonCS. Prev: Undergrad @columbia. Current: Inference research intern @togethercompute.
Yuval Noah Harari @harari_yuval
641K Followers 164 Following Historian and bestselling author of 'Sapiens', 'Homo Deus', '21 Lessons for the 21st Century', 'Nexus', 'Unstoppable Us' and 'Sapiens: A Graphic History'.
Kamala Harris @KamalaHarris
20.8M Followers 702 Following Always fighting for the people. Wife, Momala, Auntie. She/her. 107 Days available for pre-order now.
Tesla AI @Tesla_AI
403K Followers 18 Following
Deepak Pathak @pathak2206
23K Followers 380 Following Co-Founder & CEO @SkildAI, Faculty @CarnegieMellon. PhD @UCBerkeley. I study topics in AI (machine learning, robotics & computer vision).
Oliver Cameron @olivercameron
46K Followers 502 Following Building superimagination at @odysseyml. Investor in 100+ AI startups. Previously made cars drive themselves.
Dhruv Batra @DhruvBatraDB
19K Followers 608 Following Co-founder & Chief Scientist @yutori_ai. Prev: Senior Director leading FAIR Embodied AI @MetaAI and Professor @GeorgiaTech.
Russ Tedrake @RussTedrake
2K Followers 82 Following Professor at MIT, studying robotics. Vice President of Robotics Research, Toyota Research Institute.
Aaron Hertzmann @AaronHertzmann
3K Followers 472 Following Tweets express my own opinions, and not of institutions I'm affiliated with. he/him.
Lennart Schulze @lenn_artschulze
26 Followers 197 Following PhD student in Machine Learning, Computer Vision, and Robotics @ Columbia University. Previously: Research Fellow @ MIT. R&D @ IBM.
Xindi Wu @cindy_x_wu
4K Followers 1K Following PhD student @PrincetonCS | Interning @nvidia | Data-centric multimodal ml | prev @roboVisionCMU @CMU_Robotics | @RealityLabs @Snapchat | 🏎️
Abhinav Shrivastava @abhi2610
1K Followers 897 Following Associate Professor, University of Maryland, College Park
Thomas Kipf @tkipf
28K Followers 1K Following Research at @GoogleDeepMind. Controllable World Simulators (GNNs, Structured World Models, Neural Assets). Veo Team (Ingredients to Video Co-Lead)
Noah Snavely @Jimantha
9K Followers 843 Following 3D vision fanatic. Professor @cornell_tech & Researcher @GoogleDeepmind. He or they. https://t.co/m7Rs5xUFfG
Walter Scheirer @wjscheirer
3K Followers 723 Following Prof. @NotreDame. IEEE @ComputerSociety PAMI TC Chair. Computer Vision Foundation CTO. Artificial Intelligence + Digital Humanities + History of Technology.
Lu Ling @LuLing26466911
289 Followers 457 Following @NVIDIA research intern丨PhD @PurdueCS丨#AI 丨#ComputerVision丨Agentic AI丨4D/3D GenAI丨 Multimodals
Alexander Kirillov @_alex_kirillov_
8K Followers 364 Following Multimodality @thinkymachines. Previously: post-training MM lead @openai, research Scientist @facebookai Projects: GPT-4o, Advance Voice Mode, SegmentAnything.
Rowan Zellers @rown
14K Followers 974 Following multimodal @thinkymachines. I also like to climb rocks and throw pottery. https://t.co/5Er4j39K71 (he/him)
cvpr_2024 @cvpr2024
364 Followers 0 Following
Mohamed El Banani @_mbanani
837 Followers 952 Following MTS @theworldlabs. Prev: @UMichCSE, @GoogleAI, @MetaAI, @GeorgiaTech. I am interested in computer vision, machine learning, and cognitive science. 🇪🇬
Ethan Weber @ethanjohnweber
844 Followers 570 Following Incoming Research Scientist at Meta Reality Labs | Final-year PhD at UC Berkeley | MIT EECS BS '20 & MEng '21 | CV for AR/VR & robotics | https://t.co/YhPzCHLKfQ
Wayve @wayve_ai
13K Followers 596 Following Wayve is a leading developer of embodied intelligence for autonomous vehicles. We use AI to pioneer a next-generation approach to self-driving: AV2.0.
Alex Kendall @alexgkendall
14K Followers 692 Following CEO at @wayve_ai teaching cars how to drive with machine learning 🇳🇿
Ben Mildenhall @BenMildenhall
8K Followers 2K Following making stuff 3D at @theworldlabs. co-creator of NeRF and dreamfusion.
Kiana Ehsani @ehsanik
4K Followers 596 Following Co-Founder @ Vercept, Ph.D. @uwcse, Interested in computer vision, Agents and AI, Climber on the weekends.
Douglas Lanman @douglaslanman
2K Followers 151 Following Senior Director, Display Systems Research (DSR), Reality Labs Research at Meta
Luma AI @LumaLabsAI
182K Followers 63 Following Building new freedoms of imagination for the world through pioneering research and design. Try Dream Machine for free → https://t.co/LmWmA4H803
Katherine Liu @robo_kat
113 Followers 193 Following Senior Research Scientist @ToyotaResearch, previously Robotics PhD @MIT_CSAIL. Excited about machine learning for embodied intelligence. Opinions my own!
Kosta Derpanis @CSProfKGD
68K Followers 197 Following #CS Assoc Prof @YorkUniversity, #ComputerVision Scientist Samsung #AI, @VectorInst Faculty Affiliate, TPAMI AE, @ELLISforEurope Member #ICCV2025 Publicity Chair