Shiyu Chang @CodeTerminator, Twitter Profile

Shiyu Chang @CodeTerminator

2 years ago

🛡️New Jailbreak Defenses for LLMs: By harnessing semantic-preserving transformations with randomized smoothing, we have enabled LLMs to defend jailbreaks with minimal impact on their performance for benign tasks. An amazing collaboration between students at UCSB and UPenn.

Jiabao Ji @JiabaoJi

2 years ago

🛡️New Jailbreak Defenses for LLMs: By harnessing semantic-preserving transformations with randomized smoothing, we have enabled LLMs to defend jailbreaks with minimal impact on their performance for benign tasks. An amazing collaboration between students at UCSB and UPenn.

1 1 12 3K 0

Download Image

0 1 11 3K 3