DeepLearning.AI @DeepLearningAI, Twitter Profile

DeepLearning.AI @DeepLearningAI

8 months ago

Researchers at ByteDance and @Tsinghua_Uni introduced UI-TARS, a fine-tuned vision-language model that excels in computer use. Built on Qwen2-VL, it employs chain-of-thought reasoning to identify and select the best actions in desktop and mobile applications. Learn more in The Batch: hubs.la/Q035Q2Mb0

4 36 137 12K 74

Download Image

Midscene @midscene_ai

8 months ago

@DeepLearningAI @Tsinghua_Uni Welcome to Midscene, where you can directly control your browser through UI-TARS! github.com/web-infra-dev/…

0 0 3 89 0

DataInsta @DataInsta_com

8 months ago

@DeepLearningAI @Tsinghua_Uni tech advancements like this are game-changers for user experience! exciting times ahead!

0 0 2 115 0

L8NTLABS @l8ntlabsAI

8 months ago

@DeepLearningAI @Tsinghua_Uni UI-TARS sounds like a game-changer for streamlining computer interactions. I'm curious to see how it'll be applied in real-world scenarios, especially in creative fields.

0 0 0 170 0

Jon de Ros @jonderos

8 months ago

@DeepLearningAI @Tsinghua_Uni 🥸

0 0 0 72 0