🧵 Your DiT, faster
Introducing ECAD: we reframe diffusion model caching as multi-objective optimization and evolve Pareto-optimal schedules via a genetic algorithm—achieving 4.47 FID gain at 2.58× speedup, with no retraining or tuning.
🔗 aniaggarwal.github.io/ecad#MachineLearning
🌟 CoLLM: A Large Language Model for Composed Image Retrieval (CVPR 2025)
✨A cutting-edge training paradigm using image-caption pairs
📊High-quality synthetic triplets for training & benchmarking
🔗Project: collm-cvpr25.github.io
📄Paper: arxiv.org/abs/2503.19910#LLM#CIR
How can we make Imitation Leaning generalize?
In my latest work we show that a key point based representation can generalize to novel instances of an object and is agnostic to background changes.
Excited to share "SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive Mining" is accepted to #ICCV2023!
We look into training with sparse labels for object detection. w/ @rssaketh, Rama Chellappa and @abhi2610.
Website: cs.umd.edu/~sakshams/Spar…
11K Followers 6K FollowingRe-energising Your Career is an upbeat, accessible Career Development Hub. It's your specially designed personal Pocket Career Coach.
176 Followers 4K Following“But to you who are listening I say: Love your enemies, do good to those who hate you, bless those who curse you, pray for those who mistreat you🙏🏾😇❤️
124 Followers 1K FollowingOfficial journal of China Society of Image and Graphics (CSIG). The jouarnl is published by Springer, sponsored by CSIG. E-ISSN 2731-9008.
3K Followers 7K FollowingWhy choose us:
We are a leading investment/asset management firm providing premium investment services to investors; Both individuals and companies.
303 Followers 295 FollowingPh.D. in CS at University of Maryland, College Park | Ex- Adobe Research, NVIDIA, Cisco | Speech, Audio and Language Processing Researcher
1K Followers 745 FollowingProduct Operations Engineer at AIMonk Labs || Optimizing AI Systems & Driving Operational Excellence ||Sharing Insights on AI and Robotics
5K Followers 147 FollowingRerun is an open-source SDK for visualizing streams of multimodal data.
⭐ GitHub https://t.co/yf1KZN7DBI
👾 Discord https://t.co/7PIlvsZO9n
3K Followers 609 FollowingAssistant Prof @sbucompsc @stonybrooku
Researcher → @SFResearch
Interests : Human Centered AI / Future of Work / AI & Creativity
Formerly @ColumbiaCompSci
5K Followers 445 FollowingJohn E. Savage Assistant Professor @BrownCSDept. 3D computer vision/AI @BrownVisualComp, Previously: @StanfordAILab, MPI Informatics, @MSFTResearch.
20K Followers 5 FollowingImpossible? Let’s see. From algorithms to neuroscience to AI, Google Research strives to progress science, advance society & improve billions of people’s lives.
1.3M Followers 649 FollowingTrack air traffic in real time from all around the world!
Apps: https://t.co/AnZhJUIrBg | FAQ: https://t.co/WkTgAaePHs | Support: https://t.co/BomORktp7R
2K Followers 546 FollowingSenior Research Scientist @allen_ai (Ai2) | Developing the science and art of multimodal AI agents | Prev. CS PhD, UIUC and EE UG, IIT Kanpur
727 Followers 532 FollowingGeometric Algorithms for Modeling, Motion, and Animation research group: UNC Chapel Hill (1992-2018); University of Maryland, College Park (2018 onwards)
303 Followers 295 FollowingPh.D. in CS at University of Maryland, College Park | Ex- Adobe Research, NVIDIA, Cisco | Speech, Audio and Language Processing Researcher