Junru Lin @_Linjunru

CS undergrad @UofT. junrul.github.io Joined August 2022

Tweets

28
Followers

31
Following

234
Likes

21

Jia-Bin Huang @jbhuang0604

a month ago

Woohoo! Imagine, Verify, Execute (IVE) is accepted to CoRL 2025! 🎉 Congrats to the incredible @umdcs students Seungjae Lee @JayLEE_0301, Daniel Ekpo (@daniekpo7), Haowen Liu!

Jia-Bin Huang @jbhuang0604

4 months ago

Woohoo! Imagine, Verify, Execute (IVE) is accepted to CoRL 2025! 🎉 Congrats to the incredible @umdcs students Seungjae Lee @JayLEE_0301, Daniel Ekpo (@daniekpo7), Haowen Liu!

3 31 164 44K 95

Download Video

1 9 58 8K 16

Roman Bachmann @roman__bachmann

2 months ago

We will present FlexTok at #ICML2025 on Tuesday! Drop by to chat with @JRAllardice and me if you're interested in tokenization, flexible ways to encode images, and generative modeling. 📆 Tue, Jul 15, 16:30 PDT 📍 East Exhibition Hall, Poster E-3010 🌐 flextok.epfl.ch

Roman Bachmann @roman__bachmann

7 months ago

6 32 188 58K 88

Download Image

0 6 24 1K 3

Yunqi (Richard) Gu @richard_yunqigu

5 months ago

Which multimodal LLM should you be using to edit graphics in Blender? Today, we’re releasing our #CVPR2025 Highlight🌟 work, #BlenderGym 🏋️‍♀️, the first agentic 3D graphics editing benchmark that will tell you exactly how multimodal LLMs compare in their Blender-editing skills.…

8 46 83 22K 38

Download Video

Hansheng Chen @HanshengCh

5 months ago

Excited to share our work: Gaussian Mixture Flow Matching Models (GMFlow) github.com/lakonik/gmflow GMFlow generalizes diffusion models by predicting Gaussian mixture denoising distributions, enabling precise few-step sampling and high-quality generation.

1 32 124 11K 54

Download Image

Roman Bachmann @roman__bachmann

5 months ago

Happy to share that we released FlexTok code and models on github.com/apple/ml-flext…. Try them with our interactive @huggingface demo on huggingface.co/spaces/EPFL-VI…

Afshin Dehghan @afshin_dn

5 months ago

Happy to share that we released FlexTok code and models on github.com/apple/ml-flext…. Try them with our interactive @huggingface demo on huggingface.co/spaces/EPFL-VI…

0 7 37 7K 15

Download Video

0 15 74 13K 26

Ian Huang @IanHuang3D

5 months ago

🏡Building realistic 3D scenes just got smarter! Introducing our #CVPR2025 work, 🔥FirePlace, a framework that enables Multimodal LLMs to automatically generate realistic and geometrically valid placements for objects into complex 3D scenes. How does it work?🧵👇

22 99 377 115K 176

Download Video

Google DeepMind @GoogleDeepMind

6 months ago

Meet Gemini Robotics: our latest AI models designed for a new generation of helpful robots. 🤖 Based on Gemini 2.0, they bring capabilities such as better reasoning, interactivity, dexterity and generalization into the physical world. 🧵 goo.gle/gemini2-roboti…

174 471 2K 631K 545

Download Video

Congyue Deng @CongyueD

6 months ago

In the past, we extended the convolution operator to go from low-level image processing to high-level visual reasoning. Can we also extend physical operators for more high-level physical reasoning? Introducing the Denoising Hamiltonian Network (DHN): arxiv.org/pdf/2503.07596

7 59 320 41K 161

Download Image

Koichi Namekata @Koichi_N_

8 months ago

Thrilled to announce that SG-I2V has been accepted at #ICLR2025 ! Huge thanks to the collaborators, reviewers, and ACs. Looking forward to presenting this in Singapore!

Koichi Namekata @Koichi_N_

10 months ago

Thrilled to announce that SG-I2V has been accepted at #ICLR2025 ! Huge thanks to the collaborators, reviewers, and ACs. Looking forward to presenting this in Singapore!

3 13 103 40K 42

Download Video

4 9 42 5K 5

U of T Department of Computer Science @UofTCompSci

8 months ago

Congratulations to @UofTCompSci undergrads Helen Li, Junru Lin, Leo Tenenbaum and Sarah Walker who have received honourable mentions in the @CRAtweets 2024-2025 Outstanding Undergraduate Researcher Award program! cra.org/about/awards/o…

1 2 6 575 0

Download Image

Jiaman Li @jiaman01

9 months ago

🔥 Introducing MVLift: Generate realistic 3D motion without any 3D training data - just using 2D poses from monocular videos! Applicable to human motion, human-object interaction & animal motion. Joint work w/ @jiajunwu_cs & Karen 💡 How? We reformulate 3D motion estimation as…

2 39 216 15K 95

Download Video

Felix Taubner @taubnerfelix

9 months ago

Introducing 🧢CAP4D🧢 CAP4D turns any number of reference images (single, few, and many) into controllable real-time 4D avatars. 🧵⬇️ Website: felixtaubner.github.io/cap4d/ Paper: arxiv.org/abs/2412.12093

13 100 576 75K 498

Download Video

Nakayama George @GeorgeNaka40190

9 months ago

Do large multimodal models understand how to make dresses for your winter holiday party💃? We introduce AIpparel, a vision-language-garment model capable of generating and editing simulation-ready sewing patterns from text and images. Project page at georgenakayama.github.io/AIpparel/.…

1 19 68 12K 33

Download Video

Yue Wang @yuewang314

9 months ago

[Hiring!] I am hiring multiple PhDs @CSatUSC @USCViterbi for this cycle. If you're interested in scene representations, neural simulation, generative AI, and robotics, feel free to mention my name in your application (no need to email). For USC masters/undergrads who're…

1 49 272 32K 126

Shengqu Cai @prime_cai

9 months ago

Sharing something exciting we've been working on as a Thanksgiving gift: Diffusion Self-Distillation (DSD), which redefines zero-shot customized image generation using FLUX. DSD is like DreamBooth, but zero-shot/training-free. It works across any input subject and desired…

24 73 470 60K 267

Download Video

Sherwin Bahmani @sherwinbahmani

9 months ago

📢 Excited to share our new work: AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers snap-research.github.io/ac3d We analyze what pre-trained video diffusion transformers understand about 3D and demonstrate dynamic scene generation with 3D control.

6 24 119 15K 18

Download Video

Igor Gilitschenski @igilitschenski

9 months ago

I'm recruiting graduate students for Fall 2025 to work at the intersection of Computer Vision, Deep Learning, and Robotics. If you are interested in building a controllable organic simulation engine and enabling safe robot learning, consider applying to UofT's CS PhD program 1/n

12 83 439 49K 203

Download Image

Songyou Peng @songyoupeng

11 months ago

Check out our new paper in feed-forward 3DGS model for large scenes! And the code is also available

1 6 84 8K 19

Jihyeon Je @JihyeonJe

11 months ago

Symmetries are everywhere — from butterfly’s wings to Greek temples. But detecting them in noisy data? That’s a challenge. 🦋🏛 Our #SIGGRAPHAsia2024 paper, Robust Symmetry Detection via Riemannian Langevin Dynamics, tackles this: symmetry-langevin.github.io 🧵(1/n)