Tomorrow, we are presenting “Model Immunization from a Condition Number Perspective” at ICML:
📢Oral: Jul 17, 1:45–2:00 p.m. EDT @ West Exhib. Hall C
📌Poster: 2:00–4:30 p.m. EDT @ East Exhib. Hall A-B (E-1604)
Come talk to Cedar and learn more about reducing model misuse!
Vision-Language Models (VLMs) can describe the environment, but can they refer within it? Our findings reveal a critical gap: VLMs fall short of pragmatic optimality.
We identify 3 key failures of pragmatic competence in referring expression generation with VLMs: (1) cannot…
Our paper "Group Downsampling with Equivariant Anti-aliasing" will be presented at #ICLR2025 🎉!
We propose a novel subgroup sampling layer connecting Cayley graphs, uniform subgroup subsampling, and anti-aliasing—boosting equivariant models' efficiency with minimal compute.
I received a review like this five years ago. It’s probably the right time now to share it with everyone who wrote or got random discouraging reviews from ICML/ACL.
Excited to present our work, CoDA-NO, on multi-physics systems! Join us at the first poster session of #NeurIPS2024 on Wednesday, December 11, in the East Exhibit Hall, Poster #4102. See you there!
Excited to present our work, CoDA-NO, on multi-physics systems! Join us at the first poster session of #NeurIPS2024 on Wednesday, December 11, in the East Exhibit Hall, Poster #4102. See you there!
Introducing Sora, our text-to-video model.
Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions.
openai.com/sora
Prompt: “Beautiful, snowy…
What should the right representation for robotic manipulation be?
Enter D^3Fields: a 3D, dynamic, and semantic representation using foundation models WITHOUT training for zero-shot generalizable robotic manipulation. Colab is available!
🔗 robopil.github.io/d3fields/
🧵👇
We've released the ScanNet++ data!
Check it out: kaldir.vc.in.tum.de/scannetpp/
280 high-fidelity 3D scenes w/ 1mm geometry, DSLR+iPhone images, semantics
We're currently beta-testing, please bear with us - approval may initially take up to 2 weeks
Test scenes and benchmark to come!
Our new text-to-image model, DALL·E 3, can translate nuanced requests into extremely detailed and accurate images.
Coming soon to ChatGPT Plus & Enterprise, which can help you craft amazing prompts to bring your ideas to life:
openai.com/dall-e-3
539K Followers 17K FollowingThe best from AI community | Ex-Microsoft, Rackspace, Fast Company | Wrote eight books about the future | Silicon Valley robots, holodecks, BCIs, and startups.
264 Followers 3K FollowingCEO & Founder DenArthur Analytics https://t.co/UPoENj17xs
Lover of all things Data and Tech 💻📊
Motorsports fanatic 🏍️
Amateur cyclist 🚲
Lifelong Student
236 Followers 624 FollowingCS PhD student @NTUsg, BEng @sjtu1896, Intern @ Bytedance Seed. Research on 3D Vision and Generative AI. I am on the job market now!
401K Followers 0 FollowingA community supported research lab - exploring new mediums of thought and amplifying the imaginative powers of the human species.
236 Followers 624 FollowingCS PhD student @NTUsg, BEng @sjtu1896, Intern @ Bytedance Seed. Research on 3D Vision and Generative AI. I am on the job market now!
224 Followers 306 FollowingAI Researcher at Together AI @togethercompute | alumni of @UMich @CMUEngineering and Xi'an Jiao-Tong University, China
Opinions are my own.
3K Followers 836 FollowingAssistant Professor @UWCheritonCS, @CIFAR_News AI Chair @VectorInst, @ReviewAcl Co-CTO | PhD @TTIC_Connect | Excited about "grounding" in any form
1K Followers 571 FollowingCS PhD Student @Berkeley_EECS; Prev. MS @princeton_nlp, BS @HDSIUCSD; '25 @siebelscholars; I work on multimodal models; He/Him.
4K Followers 271 FollowingCS PhD student @UCBerkeley. Part-time @AnthropicAI. Part-time eater. Prev @Tsinghua_Uni.
Try to understand and control intelligence as a human.