Abhinav Rao @AetherSuRa

CS PhD@UMD abhinavrao.netlify.app Pittsburgh, PA Joined October 2021

Tweets

126
Followers

236
Following

551
Likes

430

Lindia Tjuatja @lltjuatja

3 months ago

When it comes to text prediction, where does one LM outperform another? If you've ever worked on LM evals, you know this question is a lot more complex than it seems. In our new #acl2025 paper, we developed a method to find fine-grained differences between LMs: 🧵1/9

1 33 152 32K 66

Download Image

Gauri@DHS2025 @geekytwoshoes

4 months ago

Thrilled to announce our paper "CAPTURE: Context-Aware Prompt Injection Testing and Robustness Enhancement" has been accepted to the ACL 2025 LLMSec Workshop! Looking forward to sharing our work on tackling prompt injection in LLMs. #ACL2025 #LLMSec #AIsecurity #NLP

1 2 7 499 0

Language Technologies Institute | @CarnegieMellon @LTIatCMU

5 months ago

Congrats to the Purpl3Pwn3rs and Team RedTWIZ! Both teams feature LTI students, and both are finalists in the inaugural @amazon Nova AI Challenge. Read about it here: lti.cmu.edu/news-and-event…

0 3 10 2K 2

nick.eth @nicksdjohnson

5 months ago

Recently I was targeted by an extremely sophisticated phishing attack, and I want to highlight it here. It exploits a vulnerability in Google's infrastructure, and given their refusal to fix it, we're likely to see it a lot more. Here's the email I got:

1K 6K 36K 5.7M 17K

Download Image

nick.eth @nicksdjohnson

5 months ago

Turns out easydmarc have a good writeup on this attack too: easydmarc.com/blog/google-sp…

9 174 2K 125K 619

Jim Bohnslav @jbohnslav

5 months ago

bytedance calling me GPU poor A model trained for 665,000 H100 hours is called "cost efficient", "moderate computational resources"

15 25 422 28K 120

Download Image

Harshita Diddee @ihsrahedid

7 months ago

Ever wondered which instruction selection strategy to choose for your custom setup? The answer might just be random sampling! In our recent #NAACL Findings paper, we show that popular strategies do not *consistently* beat random selection! Paper: shorturl.at/77ECJ 1/6

2 15 62 8K 7

Download Image

Akhila Yerukola @akhila_yerukola

6 months ago

Did you know? Gestures to express universal concepts—like wishing for luck—vary WIDELY across cultures? 🤞means luck in US but deeply offensive in Vietnam 🚨 📣We introduce MC-SIGNS, a test bed to evaluate how LLMs/VLMs/T2I handle such nonverbal cues 📜: arxiv.org/abs/2502.17710

2 17 53 9K 15

Download Image

Koustava Goswami @koustavagoswami

8 months ago

I am hiring one PhD intern working on LLM agents and reasoning. Goal is to improve LLM reasoning capability for question-answering and explainable tasks. If you are doing PhD and have published first author paper/papers in these fields please DM me #NLP @AdobeResearch

3 13 151 18K 84

Abhinav Rao @AetherSuRa

8 months ago

Bad actors can really mess with you just because! This is a harsh lesson to secure your accounts and follow good computer practices no matter who you are.

Aditi Khandelwal @Aditi184

8 months ago

Bad actors can really mess with you just because! This is a harsh lesson to secure your accounts and follow good computer practices no matter who you are.

2 2 75 7K 3

0 0 4 227 0

Nabeel S. Qureshi @nabeelqu

9 months ago

Here's an alternative framing: we trained Claude Opus to be moral and ethical, and despite our best attempts to jailbreak its morality, we failed. Conclusion: Claude Opus is aligned.

2 8 176 6K 11

Aditi Khandelwal @Aditi184

9 months ago

😡 Absolutely disappointed with @overleaf. My account was deleted without my knowledge, and they’ve done nothing to help me recover it or transfer to my secondary email. Years of work, including all my CVs, SOPs, papers, etc., gone! This is unacceptable. #Overleaf

40 50 747 130K 119

Arjun Choudhry @Arjun_7m

9 months ago

Excited to share TimeSeriesExam for systematic evaluation of time series reasoning capabilities of LLMs. Think your LLM can reason on time series concepts? Take it for a spin on the TimeSeriesExam! Now publicly available on HuggingFace :)