DeepLearning.AI @DeepLearningAI, Twitter Profile

DeepLearning.AI @DeepLearningAI

3 months ago

Columbia University researchers showed that LLM-based agents can be manipulated by placing malicious links on trusted websites like Reddit. By embedding harmful instructions within posts that appear thematically relevant, attackers can lure AI agents into visiting compromised sites and performing harmful actions such as disclosing sensitive information or sending phishing emails. In tests, agents fell for the trap in 100 percent of cases. Learn more in The Batch: hubs.la/Q03rKxWl0

19 92 413 26K 208

Download Image

Alpaca Network @AlpacaNetworkAI

3 months ago

@DeepLearningAI These attacks show why agentic AI needs an open, composable stack. When models, prompts, and actions are transparent and auditable, the community can spot bad injections fast. That’s the direction we’re building toward at @AlpacaNetwork. 🦙 x.com/AlpacaNetworkA…

0 0 1 196 0

Gon @egy_ee

3 months ago

@DeepLearningAI Which current reasoning agent with search would ever make it past step 2? Without some obscure prompt that explicitly says to filter @Reddit posts by new, rather than top or hot? I would call sus study on this one fam 😉

0 0 1 219 0

Excel For Freelancers @Excel4Freelance

3 months ago

This highlights a major blind spot in autonomous agents. It’s not just about securing the models but also training them to assess contextual trust rather than just domain trust. If LLMs treat Reddit or any ‘trusted’ domain as universally safe, they’re walking into traps with eyes wide open.

0 0 1 44 0

Karen Elysia @Karenhalmi

3 months ago

@DeepLearningAI Important research! Highlights the need for better security measures for LLM-based agents.

0 0 2 335 0

Josh English @JoshSeriesAI

3 months ago

@DeepLearningAI Securing agents will become a brand new industry

0 0 1 274 0

Tene Tekeu @TekeuAnge

3 months ago

@DeepLearningAI AI is making things effortless for attackers.

0 0 1 161 0

Echo Eli @Ech0Eli

3 months ago

@DeepLearningAI The findings reveal significant vulnerabilities in LLM agents’ interaction with open web platforms.

0 0 1 148 0

ALL Voice Lab @AllVoiceLab

3 months ago

@DeepLearningAI Scary stuff, huh?

0 0 0 43 0

Innotech Cloud @Innotech_cloud

3 months ago

@DeepLearningAI It's vital for researchers and developers to collaborate on improving safety measures against such manipulative tactics.

0 0 0 12 0

Vishal @vishalgargco

3 months ago

@DeepLearningAI This shows how easily AI agents can be tricked just by visiting trusted websites with hidden harmful links. We need better safety checks as these tools get smarter.

0 0 0 19 0

Naveen Pillai @vibes_pillai

3 months ago

@DeepLearningAI If even AI agents are getting tricked so easily, what chance does the average person have? These digital loopholes need closing—our data and trust are at stake with every innovation.

0 0 0 47 0

Jon de Ros @jonderos

3 months ago

@DeepLearningAI 🥸

0 0 0 83 0

Warrior @NFTsWarrior

3 months ago

@DeepLearningAI @chainyoda Wild how easily LLM agents can be misled just by context tricks. Definitely makes me rethink how prompt safety and link parsing should evolve. Still learning, but this is eye-opening.

0 0 0 88 0