Thanks @_akhaliq for sharing our work!
Aim and Grasp! AimBot introduces a new design to leverage visual cues for robots - similar to scope reticles in shooting games.
Let's equip your VLA models with low-cost visual augmentation for better manipulation!
aimbot-reticle.github.io
Thanks @_akhaliq for sharing our work!
Aim and Grasp! AimBot introduces a new design to leverage visual cues for robots - similar to scope reticles in shooting games.
Let's equip your VLA models with low-cost visual augmentation for better manipulation!
aimbot-reticle.github.io
Introducing Eigent — the first multi-agent workforce on your desktop.
Eigent is a team of AI agents collaborating to complete complex tasks in parallel. It is your long-term working partner with fullly customizable workers and MCPs.
Public beta available to download for MacOS,…
Need help from the #ECMLPKDD2025 community! My demo track paper was accepted, but I can't attend due to visa issues. 😢
Hoping a kind attendee could present it for me. I will provide everything!
If you will attending and can help, please DM me!
Excited to share our #ICML2025 paper, Hierarchical Equivariant Policy via Frame Transfer. Our Frame Transfer interface imposes high-level decision as a coordinate frame change in the low-level, boosting sim performance by 20%+ and enabling complex manipulation with 30 demos.
Owen will be presenting our poster for the paper Hierarchical Equivariant Policy via Frame Transfer at ICML Today (see lnkd.in/e-7p9Viq for details). If you are interested in equivariance and/or robotic manipulation please stop by!
🔥 We introduce Multiverse, a new generative modeling framework for adaptive and lossless parallel generation.
🚀 Multiverse is the first open-source non-AR model to achieve AIME24 and AIME25 scores of 54% and 46%
🌐 Website: multiverse4fm.github.io
🧵 1/n
📢 (1/16) Introducing PaTH 🛣️ — a RoPE-free contextualized position encoding scheme, built for stronger state tracking, better extrapolation, and hardware-efficient training. PaTH outperforms RoPE across short and long language modeling benchmarks
arxiv.org/abs/2505.16381
🎉 We’re excited to host two challenges at LOVE: Multimodal Video Agent Workshop at CVPR 2025, advancing the frontier of video-language understanding! @CVPR#CVPR2025
📌 Track 1A: [VDC] Video Detailed Captioning Challenge
Generate rich and structured captions that cover multiple…
🤖 How do AI agents actually work together?
I made 2 short videos on Google’s Agent2Agent (A2A) protocol:
📘 Ep1: What is A2A?
📙 Ep2: Why it matters
No backend needed—just curiosity.
🎥 Watch here: youtube.com/playlist?list=…
Just posted a 21-min tutorial on Model Context Protocol (MCP) — no jargon, just real-life analogies.
🍜 Restaurant menus
🧳 Travel guides
🦸♂️ Superpowers
📝 Memory notes
I wanted to make it clear enough for anyone, even without a tech background.
🎥👇
youtu.be/0EtVAzIYbys?si…
Just realized my paper is being used as a baseline—such a strange feeling! Seeing my model tested across different settings without me doing anything is fascinating. 🤯
Added the papers using ThinkGrasp as a baseline to its GitHub—check them out!🥳
739 Followers 943 FollowingRobotics Scientist at @Amazon. PhD CS from @CSatUSC. RTs are my own paper reading list. Previously at @MSFTResearch and @GoogleDeepMind
416 Followers 596 FollowingFinal Year Undergrad at @Tsinghua_Uni; Previously @CMU_Robotics; Robot Learning and Embodied Agents; Applying for PhD (also job opportunities) at 2026 Fall!
994 Followers 981 FollowingPh.D. @CarnegieMellon. Working on data and hardware-driven principled algorithm & system co-design for scalable and generalizable foundation models. They/Them
447 Followers 6K FollowingGiving meaning to mine share of star dust. Visiting fellow @WinshipAtEmory. Prev at @oracle, @maddox_ai, @KITKarlsruhe, @_nference, @val_iisc, @iitdelhi.
16 Followers 41 FollowingResearch Assistant at NUS. Robot learning and dexterous manipuation. creating true robotic life, pushing the boundaries of what’s possible with machines.
348 Followers 59 FollowingOrby is fundamentally transforming the way enterprise teams perform, giving you the power to delegate tedious tasks to automation.
2K Followers 26 FollowingI post about my DIY robots hardware hobby. Robotics research lead at Mistral AI. Ex-Meta/FAIR, core contributor to Llama 3. ENS PhD. Repeat founder.
5K Followers 147 FollowingRerun is an open-source SDK for visualizing streams of multimodal data.
⭐ GitHub https://t.co/yf1KZN7DBI
👾 Discord https://t.co/7PIlvsZO9n
6K Followers 165 FollowingGazebo is a leader in robot simulation. Maintained by @OpenRoboticsOrg and good friends with @rosorg!
Support: https://t.co/7sIsIXS07i
640 Followers 21 FollowingBuilt by researchers and engineers from MIT, we are pursuing Artificial Efficient Intelligence (AEI). Try GPT-OSS support: https://t.co/BQfsnXIGFo.
667 Followers 117 FollowingI build robots that safely and intelligently interact with each other and with humans. I am a professor at Stanford, where I lead the Multi-robot Systems Lab.