This week, we showed how altering internal "features" in our AI, Claude, could change its behavior.
We found a feature that can make Claude focus intensely on the Golden Gate Bridge.
Now, for a limited time, you can chat with Golden Gate Claude: claude.ai
Occasional reminder that the United States could have been the world leader in 5G technology instead of China if we had just given *one guy* a green card when he needed one.
This person has a PhD in computer science from Johns Hopkins and is currently working at Meta on AI research.
And he just got denied an H-1B visa because we made him enter a lottery to get one.
Absolutely insane that this is how our immigration system works.
This person has a PhD in computer science from Johns Hopkins and is currently working at Meta on AI research.
And he just got denied an H-1B visa because we made him enter a lottery to get one.
Absolutely insane that this is how our immigration system works.
Today @ElenaMusi2 led the reading group discussion on LLMs abilities to identify common ground based on the “Views Are My Own, But Also Yours: Benchmarking Theory of Mind using Common Ground” paper by @adilsoubki,@murzakuj, Jordehi, @peterznlp, Markowska, Mirroshandel,@OwenRambow
🚀 Thrilled to unveil our latest research: 'Google Scholar is manipulatable'. We dive deep into the alarming reality of citation manipulation, showing how easily fake citations flood Scholar profiles. Our findings shed crucial light on academic integrity.
arxiv.org/abs/2402.04607
337 Followers 640 FollowingAssistant Professor of CS, New York University Abu Dhabi.
Love networks, Internet, and Web. My hobby is building systems and performing network measurements.
116 Followers 192 FollowingPhD candidate, @PACELab_SBU, @stonybrooku
BTech in EE from @IIT_Bhilai
Website: https://t.co/KrDSf8SUvo
PS: I am not much active on Twitter
223 Followers 972 FollowingPhD candidate at @sbucompsc in @WWBProject working on computational psych; prev: research @Uber AI. You know me if you’re used to calling me dk.
1K Followers 779 FollowingAssistant Professor in Psychology at Stony Brook University. I’m interested in how people interact with LLMs and they impact they might have on our psychology.
204K Followers 25 FollowingManus is the general AI agent that bridges minds and actions: it doesn't just think, it delivers results. Download our app: https://t.co/XSfjRhjdgo
12K Followers 3K Followingresearch @MIT_CSAIL @thinkymachines. working on scalable and principled algorithms in #LLM and #MLSys. in open-sourcing I trust 🐳. she/her/hers
29K Followers 806 FollowingMathematician (Distinguished Professor of #Math at @RutgersU). Here to learn about research, education, and community. Let’s build something together.
25K Followers 89 FollowingA non-profit research lab focused on interpretability, alignment, and ethics of artificial intelligence.
Creators of GPT-J, GPT-NeoX, Pythia, and VQGAN-CLIP
2K Followers 455 FollowingAsst Prof. @ UCSD | PI of LeM🍋N Lab | Former Postdoc at ETH Zürich, PhD @ NYU | computational linguistics, NLProc, CogSci, pragmatics | he/him 🏳️🌈
35K Followers 189 FollowingCo-founder and CEO https://t.co/efv72CKpAG (@WaveFormsAI) - Ex @OpenAI GPT-4o/AVM Audio Research Lead - #Her #TARS - Ex @AIatMeta, @Polytechnique (X11)
30K Followers 123 FollowingMechanistic Interpretability lead DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!
27K Followers 296 FollowingProfessor of linguistics and professor of computer science at Stanford and author of the James Beard award finalist "The Language of Food"
54K Followers 0 FollowingWe are building a world class AI R&D company in Tokyo. We want to develop AI solutions for Japan’s needs, and democratize AI in Japan. https://t.co/1q07mb3TzE
5K Followers 717 FollowingBring GenAI and Knowledge Graph to enterprise systems. | Director of ML @Adobe Experience Platform | Previously @Apple @IBMResearch. Tweets are all mine.
222K Followers 1 FollowingUpdates for developers building with the OpenAI Platform and API • Service status: https://t.co/kZwnwdYqOS • Support: https://t.co/qCi6M5ESZU
No recent Favorites. New Favorites will appear here.