Robert Kirk @_robertkirk

Research Scientist at @AISecurityInst; PhD Student @ucl_dark. LLMs, AI Safety, Generalisation; @Effect_altruism robertkirk.github.io Joined January 2020

Tweets

381
Followers

1K
Following

282
Likes

604

Robert Kirk @_robertkirk

4 days ago

New blog! We @AISecurityInst partnered with @NCSC to write about an emerging practice I'm really excited about: Safeguard Bypass Bounty Programmes (SBBPs). Summary of what these are, why they are useful, & how to do them well 🧵

Robert Kirk @_robertkirk

a week ago

Since I started working on safeguards, we've seen substantial progress in defending certain hosted models, but less progress in measuring & managing misuse risks from open weight models. Three directions I want explored more, drawn from our @AISecurityInst post today 🧵

_robertkirk tweet picture

AI Security Institute @AISecurityInst

a week ago

🚨Open-weight AI models are becoming more powerful, now knocking on the door of today’s closed-weight frontier. This poses critical safety challenges – how can we prevent the misuse of models whose parameters are free to download online? 🧵

NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.

Jim Fan @DrJimFan

325K Followers 3K Following NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.

FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.

Edward Grefenstette @egrefen

42K Followers 865 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.

PhD with @_rockt and @egrefen. Inc. postdoc with @jacobandreas @MIT_CSAIL. Anon feedback: https://t.co/sbebAl53tU

Laura Ruis @LauraRuis

6K Followers 753 Following PhD with @_rockt and @egrefen. Inc. postdoc with @jacobandreas @MIT_CSAIL. Anon feedback: https://t.co/sbebAl53tU

Senior Staff Research Scientist @GoogleDeepMind & Honorary Lecturer @UCL. ex @Meta|@MSFTResearch|@NYU|@Princeton. Llama-3, Toolformer, Rainbow Teaming, MLGym.

Roberta Raileanu @robertarail

9K Followers 2K Following Senior Staff Research Scientist @GoogleDeepMind & Honorary Lecturer @UCL. ex @Meta|@MSFTResearch|@NYU|@Princeton. Llama-3, Toolformer, Rainbow Teaming, MLGym.

Director and Open-Endedness Team Lead @GoogleDeepMind, Professor of AI @AI_UCL, PI @UCL_DARK, Fellow @ELLISforEurope.

Tim Rocktäschel @_rockt

39K Followers 2K Following Director and Open-Endedness Team Lead @GoogleDeepMind, Professor of AI @AI_UCL, PI @UCL_DARK, Fellow @ELLISforEurope.

Minqi Jiang @MinqiJiang

6K Followers 880 Following

Figuring out AI @allen_ai, open models, RLHF, fine-tuning, etc
Contact via email.
Writes @interconnectsai
Wrote The RLHF Book
Mountain runner

Nathan Lambert @natolambert

56K Followers 853 Following Figuring out AI @allen_ai, open models, RLHF, fine-tuning, etc Contact via email. Writes @interconnectsai Wrote The RLHF Book Mountain runner

AI Scientist at https://t.co/SUcb0CBcb7. PhD in AI. Inference Wizardry. ex. @MetaAI @modl_ai

Bam4d @Bam4d

3K Followers 1K Following AI Scientist at https://t.co/SUcb0CBcb7. PhD in AI. Inference Wizardry. ex. @MetaAI @modl_ai

Open-Endedness RS @GoogleDeepMind. Building for an unspecifiable world | Unsupervised Environment Design, Game&Decision Theory, RL, AIS. prev @CHAI_Berkeley

Michael Dennis @MichaelD1729

4K Followers 813 Following Open-Endedness RS @GoogleDeepMind. Building for an unspecifiable world | Unsupervised Environment Design, Game&Decision Theory, RL, AIS. prev @CHAI_Berkeley

former staff prompt engineer @scale_ai

Riley Goodside @goodside

150K Followers 3K Following former staff prompt engineer @scale_ai

Research in ML/NLP at the U of Edinburgh (tenured faculty @InfAtEd @EdinburghNLP), Co-Founder @Miniml_AI, @ELLISforEurope Scholar, https://t.co/5dUI3EFexo

Pasquale Minervini @PMinervini

9K Followers 5K Following Research in ML/NLP at the U of Edinburgh (tenured faculty @InfAtEd @EdinburghNLP), Co-Founder @Miniml_AI, @ELLISforEurope Scholar, https://t.co/5dUI3EFexo

Computational linguistics @AnthropicAI

Jesse Mu @jayelmnop

6K Followers 587 Following Computational linguistics @AnthropicAI

AI policy researcher, wife guy in training, fan of cute animals and sci-fi, Substack writer, stealth-ish non-profit co-founder

Miles Brundage @Miles_Brundage

62K Followers 12K Following AI policy researcher, wife guy in training, fan of cute animals and sci-fi, Substack writer, stealth-ish non-profit co-founder

Co-lead of Genie 3 @GoogleDeepMind & Honorary Associate Professor @UCL_DARK. Dad (👶🐶), CFC fan, BJJ. Views are my own :)

Jack Parker-Holder @jparkerholder

9K Followers 779 Following Co-lead of Genie 3 @GoogleDeepMind & Honorary Associate Professor @UCL_DARK. Dad (👶🐶), CFC fan, BJJ. Views are my own :)

AI professor.

Deep Learning, AI alignment, ethics, policy, & safety.
Formerly Cambridge, Mila, Oxford, DeepMind, ElementAI, UK AISI.

AI is a really big deal.

David Krueger @DavidSKrueger

18K Followers 4K Following AI professor. Deep Learning, AI alignment, ethics, policy, & safety. Formerly Cambridge, Mila, Oxford, DeepMind, ElementAI, UK AISI. AI is a really big deal.

Research @OpenAI Prev: DPhil Student @UniofOxford, RS Intern @SakanaAILabs @DeepMind and RS @CovariantAI

Chris Lu @_chris_lu_

4K Followers 616 Following Research @OpenAI Prev: DPhil Student @UniofOxford, RS Intern @SakanaAILabs @DeepMind and RS @CovariantAI

ML @Mila_Quebec ; previously @GoogleDeepMind

Ethan Caballero is bu... @ethanCaballero

11K Followers 2K Following ML @Mila_Quebec ; previously @GoogleDeepMind

UCL Deciding, Acting, and Reasoning with Knowledge (DARK) Lab at @AI_UCL led by @_rockt, @egrefen, @robertarail, and @jparkerholder.

UCL DARK @UCL_DARK

4K Followers 197 Following UCL Deciding, Acting, and Reasoning with Knowledge (DARK) Lab at @AI_UCL led by @_rockt, @egrefen, @robertarail, and @jparkerholder.

akbir. @akbirkhan

2K Followers 972 Following

Cofounder & CEO @WecoAI.
Automating hill climbing with AI-Driven Exploration (AIDE).
PhD in Machine Learning @UCL_DARK.
(Zheng=j-uhng, j as in job; yao=y-aoww)

Zhengyao Jiang @zhengyaojiang

4K Followers 417 Following Cofounder & CEO @WecoAI. Automating hill climbing with AI-Driven Exploration (AIDE). PhD in Machine Learning @UCL_DARK. (Zheng=j-uhng, j as in job; yao=y-aoww)

borrowed stardust, technical staff @AISecurityInst, 👩‍💻🕊️🚴🏼‍♀️☕️🏔️🌳🐱🎬📚

Ekin @ekinomicss

207 Followers 293 Following borrowed stardust, technical staff @AISecurityInst, 👩‍💻🕊️🚴🏼‍♀️☕️🏔️🌳🐱🎬📚

Life is made of small moments like this.

Nell @Qootu18781

21 Followers 897 Following Life is made of small moments like this.

Hinter jeder Tür, wo d nit uffmache söttsch, ligt öbbis, wo uf di wartet.

Maxim Von Tarnow (tes... @yaponamat

53 Followers 1K Following Hinter jeder Tür, wo d nit uffmache söttsch, ligt öbbis, wo uf di wartet.

Multi-Agent Safety Researcher | Cooperative AI Foundation | Technical Architect Fellow at IQT | LFC Fan

Chandler Smith @ChandlerDSmith

151 Followers 802 Following Multi-Agent Safety Researcher | Cooperative AI Foundation | Technical Architect Fellow at IQT | LFC Fan

Rodrigo Alves Vieira @RodrigoAVieiraX

3 Followers 95 Following

Official journal of China Society of Image and Graphics (CSIG). The jouarnl is published by Springer, sponsored by CSIG. E-ISSN 2731-9008.

Visual-Intelligence @VI_Journal_CSIG

124 Followers 1K Following Official journal of China Society of Image and Graphics (CSIG). The jouarnl is published by Springer, sponsored by CSIG. E-ISSN 2731-9008.

PhD student doing NLP @ University of Sheffield.

Sam Lewis-Lim @samlewislim

9 Followers 230 Following PhD student doing NLP @ University of Sheffield.

James Aung @jjamesaung

142 Followers 632 Following preparedness

Matthew Henty @matthewhenty

296 Followers 2K Following

Global Cyber Security Support

mRr3b00t @UK_Daniel_Card

113K Followers 8K Following Global Cyber Security Support

CyberSec, Art & Nature Loving, Cheese Mourning Yam Yam

Queenie Sunday | Flickr https://t.co/iu33I1cuM7

Tara Makara @queenie_sunday

424 Followers 3K Following CyberSec, Art & Nature Loving, Cheese Mourning Yam Yam Queenie Sunday | Flickr https://t.co/iu33I1cuM7

Documentary photographer, old creaky hacker. Co-author of @OWASP ASVS standard. Blackhat/Brucon Review Board & Co_chair UK Gov Cyber Security Advisory Board

Daniel Cuthbert @dcuthbert

32K Followers 2K Following Documentary photographer, old creaky hacker. Co-author of @OWASP ASVS standard. Blackhat/Brucon Review Board & Co_chair UK Gov Cyber Security Advisory Board

Director General for AI, UK Govt

Ollie ilott @Ollie_ilott

125 Followers 183 Following Director General for AI, UK Govt

Jon @ElfordJon

261 Followers 908 Following

Research Scientist (Frontier Planning) at @GoogleDeepMind.
Research Affiliate @Cambridge_Uni @CSERCambridge & @LeverhulmeCFI.
All views my own.

Haydn Belfield @HaydnBelfield

5K Followers 2K Following Research Scientist (Frontier Planning) at @GoogleDeepMind. Research Affiliate @Cambridge_Uni @CSERCambridge & @LeverhulmeCFI. All views my own.

Alex Borshik @AlexBorshik

104 Followers 710 Following

Faculty at @ELLISInst_Tue & @MPI_IS, leading the AI Safety and Alignment group.
PhD from @EPFL supported by Google & OpenPhil PhD fellowships.

Maksym Andriushchenko @maksym_andr

5K Followers 891 Following Faculty at @ELLISInst_Tue & @MPI_IS, leading the AI Safety and Alignment group. PhD from @EPFL supported by Google & OpenPhil PhD fellowships.

💸 tracking Nasdaq trends is my vibe, says this brave girl! open to insights. DM me about GDP growth! 💬 #Finance #Wealth

云创兽Ai @Flieber638646

1 Followers 93 Following 💸 tracking Nasdaq trends is my vibe, says this brave girl! open to insights. DM me about GDP growth! 💬 #Finance #Wealth

Merri @Merri9523

27 Followers 1K Following

Ph.D. Candidate @Mila_Quebec interested in AI/ML connections with economics, game theory, and social choice theory.

Manfred Diaz @linuxpotter

557 Followers 1K Following Ph.D. Candidate @Mila_Quebec interested in AI/ML connections with economics, game theory, and social choice theory.

Ryan Remmel @ryan_remmel

26 Followers 684 Following

karanbania @bnnarakk

0 Followers 747 Following .

Grzegorz Pohorecki @gregpoho

7 Followers 150 Following

Many things AI @thefuturesoc & @Cambridge_Uni. Fundani, sebenzani, nizimele✨Views my own

George Gor @ggomondi

1K Followers 2K Following Many things AI @thefuturesoc & @Cambridge_Uni. Fundani, sebenzani, nizimele✨Views my own

AI worrier & houseplant enthusiast

sarah @littIeramblings

3K Followers 738 Following AI worrier & houseplant enthusiast

Matthew Clarke @Matthew05049818

0 Followers 3K Following

Policy. Tech. Data. AI. Trade. Housing. Migration. Caledonian, Cheshire/Clwyd border. British. Live, Laugh, Love!

Matt Kilcoyne @MRJKilcoyne

13K Followers 8K Following Policy. Tech. Data. AI. Trade. Housing. Migration. Caledonian, Cheshire/Clwyd border. British. Live, Laugh, Love!

Hacking SEO as Director for https://t.co/JJSjby3RSM. Speaking on the topics in SEO, crypto, AI. A queer techie + Coffee Geek ☕️

Illy G. @illyG1111

114 Followers 417 Following Hacking SEO as Director for https://t.co/JJSjby3RSM. Speaking on the topics in SEO, crypto, AI. A queer techie + Coffee Geek ☕️

Law| AI governance| Department of Science Innovation & Technology

Sumaya Nur @SumayaNur_

2K Followers 665 Following Law| AI governance| Department of Science Innovation & Technology

Research work, Innovation, exemplifying creativity, impact, and a vision for a better future. #nscawards #phd #researcher
website: https://t.co/LlgGrYt7X8

New Scientists @awards67811

111 Followers 6K Following Research work, Innovation, exemplifying creativity, impact, and a vision for a better future. #nscawards #phd #researcher website: https://t.co/LlgGrYt7X8

Don't Be A Trump First Pathetic “Patriotic” Pedo Protector Defending Delulu Donald’s Denying, Distracting & Deflecting

🇺🇸Release The E... @mcnultydigital

442 Followers 8K Following Don't Be A Trump First Pathetic “Patriotic” Pedo Protector Defending Delulu Donald’s Denying, Distracting & Deflecting

Senior Analyst @ GoldmanSachs - Alum @UT_Dallas, @FastNU_Official, @jindal_utdallas

waleedsial @JrWaleedsial

399 Followers 8K Following Senior Analyst @ GoldmanSachs - Alum @UT_Dallas, @FastNU_Official, @jindal_utdallas

vishnu prasad V @viznuvv

1 Followers 5K Following Minimum guy !!

Assistant Prof. @ Sharif UT / Trustworthy and Secure AI / Blockchain.

Amirmahdi Sadeghzadeh @amsadeghzadeh

6 Followers 101 Following Assistant Prof. @ Sharif UT / Trustworthy and Secure AI / Blockchain.

單身，想認識一個真心朋友，或者男友

不需要投資 @CcSet88qNh2rFD

1 Followers 28 Following 單身，想認識一個真心朋友，或者男友

Models Matrices @MatricesLayers

162 Followers 3K Following

Founding Data Scientist, quantifying AI risk @testudoinsure | 🇵🇹

Martim Cruz @martimC11

487 Followers 616 Following Founding Data Scientist, quantifying AI risk @testudoinsure | 🇵🇹

```move 37 ``` | samy is my hero | the world is forked | i groom & gaslight llms | #ff0000 team

Ayla Croft @aylacroft

3K Followers 1K Following ```move 37 ``` | samy is my hero | the world is forked | i groom & gaslight llms | #ff0000 team

Independent journalist. Writes on tech & society. Knight-Bagehot Fellow ‘25 at @columbia. Bylines: @TIME, @wired, @techreview, @AJEnglish, @restofworld

Varsha Bansal @VarshaaBansal

6K Followers 5K Following Independent journalist. Writes on tech & society. Knight-Bagehot Fellow ‘25 at @columbia. Bylines: @TIME, @wired, @techreview, @AJEnglish, @restofworld

Things I like : Mathematics, Machine Learning, Causality, Networks, and Philosophy.

@[email protected]... @EdHenry_

2K Followers 1K Following Things I like : Mathematics, Machine Learning, Causality, Networks, and Philosophy.

He who controls the spice controls the universe.

g @georgedeath

29 Followers 1K Following He who controls the spice controls the universe.

Current Summer Fellow @GovAI_, generally JD Candidate at Harvard Law, previously eng. at @InflectionAI, @SubstackInc, SRE @Google, @BrownUniversity.

Ben Murphy @benjaminmmurphy

163 Followers 826 Following Current Summer Fellow @GovAI_, generally JD Candidate at Harvard Law, previously eng. at @InflectionAI, @SubstackInc, SRE @Google, @BrownUniversity.

cs @GeorgiaTech | strong preference for aligned AI | co-director @ https://t.co/pJCxjv7GWv | prev @startupxchange

Yixiong Hao @Yixiong_Hao

396 Followers 1K Following cs @GeorgiaTech | strong preference for aligned AI | co-director @ https://t.co/pJCxjv7GWv | prev @startupxchange

🚀 AISecHub | AI & Cybersecurity | Discussing AI-driven threats, securing AI systems, and sharing insights on emerging challenges 💡

AISecHub @AISecHub

4K Followers 4K Following 🚀 AISecHub | AI & Cybersecurity | Discussing AI-driven threats, securing AI systems, and sharing insights on emerging challenges 💡

systems architect. Bridging strategy, #AIresearch and #cybersecurity for next-gen, human-aligned tech. Designing for impact, resilience, & trust.

Reg Saddler @zaibatsu

445K Followers 166K Following systems architect. Bridging strategy, #AIresearch and #cybersecurity for next-gen, human-aligned tech. Designing for impact, resilience, & trust.

Psychologist, Founder of CASUAL LTD., Philanthropist by UNICEF and Infinite Peace NGO

Novy Vos, MD, PhD @NovyVos

111 Followers 2K Following Psychologist, Founder of CASUAL LTD., Philanthropist by UNICEF and Infinite Peace NGO

someone random @serackerardy

278 Followers 5K Following dev

Official account of ASES academic journals. Peer-reviewed, open access, multidisciplinary publications and calls for papers.

Publishing in 7 fields

Journalases @journalases

29 Followers 502 Following Official account of ASES academic journals. Peer-reviewed, open access, multidisciplinary publications and calls for papers. Publishing in 7 fields

!.! @xypyth

46 Followers 4K Following

Just a young man dreaming of robots serving under me while I do nothing.

Neet Feudal Lord of R... @NeetFeudalLord

1K Followers 8K Following Just a young man dreaming of robots serving under me while I do nothing.

We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.

Google DeepMind @GoogleDeepMind

1.2M Followers 279 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.

Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.

Andrej Karpathy @karpathy

1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.

Associate Professor at UC Berkeley
Co-founder, Physical Intelligence

Sergey Levine @svlevine

108K Followers 133 Following Associate Professor at UC Berkeley Co-founder, Physical Intelligence

FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.

Edward Grefenstette @egrefen

42K Followers 865 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.

PhD with @_rockt and @egrefen. Inc. postdoc with @jacobandreas @MIT_CSAIL. Anon feedback: https://t.co/sbebAl53tU

Laura Ruis @LauraRuis

6K Followers 753 Following PhD with @_rockt and @egrefen. Inc. postdoc with @jacobandreas @MIT_CSAIL. Anon feedback: https://t.co/sbebAl53tU

Assistant Professor @uwcse and Staff Research Scientist at @GoogleAI. Let's get off this app: https://t.co/jbH2oAjbPN

Natasha Jaques @natashajaques

30K Followers 1K Following Assistant Professor @uwcse and Staff Research Scientist at @GoogleAI. Let's get off this app: https://t.co/jbH2oAjbPN

Senior Staff Research Scientist @GoogleDeepMind & Honorary Lecturer @UCL. ex @Meta|@MSFTResearch|@NYU|@Princeton. Llama-3, Toolformer, Rainbow Teaming, MLGym.

Roberta Raileanu @robertarail

9K Followers 2K Following Senior Staff Research Scientist @GoogleDeepMind & Honorary Lecturer @UCL. ex @Meta|@MSFTResearch|@NYU|@Princeton. Llama-3, Toolformer, Rainbow Teaming, MLGym.

Director and Open-Endedness Team Lead @GoogleDeepMind, Professor of AI @AI_UCL, PI @UCL_DARK, Fellow @ELLISforEurope.

Tim Rocktäschel @_rockt

39K Followers 2K Following Director and Open-Endedness Team Lead @GoogleDeepMind, Professor of AI @AI_UCL, PI @UCL_DARK, Fellow @ELLISforEurope.

Minqi Jiang @MinqiJiang

6K Followers 880 Following

Eric Jang @ericjang11

103K Followers 4K Following AI at @1x_tech

We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.

Anthropic @AnthropicAI

637K Followers 35 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.

Figuring out AI @allen_ai, open models, RLHF, fine-tuning, etc
Contact via email.
Writes @interconnectsai
Wrote The RLHF Book
Mountain runner

Nathan Lambert @natolambert

56K Followers 853 Following Figuring out AI @allen_ai, open models, RLHF, fine-tuning, etc Contact via email. Writes @interconnectsai Wrote The RLHF Book Mountain runner

AI Scientist at https://t.co/SUcb0CBcb7. PhD in AI. Inference Wizardry. ex. @MetaAI @modl_ai

Bam4d @Bam4d

3K Followers 1K Following AI Scientist at https://t.co/SUcb0CBcb7. PhD in AI. Inference Wizardry. ex. @MetaAI @modl_ai

Open-Endedness RS @GoogleDeepMind. Building for an unspecifiable world | Unsupervised Environment Design, Game&Decision Theory, RL, AIS. prev @CHAI_Berkeley

Michael Dennis @MichaelD1729

4K Followers 813 Following Open-Endedness RS @GoogleDeepMind. Building for an unspecifiable world | Unsupervised Environment Design, Game&Decision Theory, RL, AIS. prev @CHAI_Berkeley

Mechanistic Interpretability lead DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!

Neel Nanda @NeelNanda5

30K Followers 123 Following Mechanistic Interpretability lead DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!

former staff prompt engineer @scale_ai

Riley Goodside @goodside

150K Followers 3K Following former staff prompt engineer @scale_ai

Research Scientist, Deepmind

I try to think hard about everything I tweet, esp on 90s football and 80s music

None of my opinions are really someone else's

Felix Hill @FelixHill84

12K Followers 745 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else's

Computational linguistics @AnthropicAI

Jesse Mu @jayelmnop

6K Followers 587 Following Computational linguistics @AnthropicAI

CSO & co-founder, Reliant AI. Ex RL research lead at Google Brain, DeepMind. Known for Atari 2600 RL benchmark, Distributional RL (MIT Press 2023).

Marc G. Bellemare @marcgbellemare

16K Followers 349 Following CSO & co-founder, Reliant AI. Ex RL research lead at Google Brain, DeepMind. Known for Atari 2600 RL benchmark, Distributional RL (MIT Press 2023).

AI policy researcher, wife guy in training, fan of cute animals and sci-fi, Substack writer, stealth-ish non-profit co-founder

Miles Brundage @Miles_Brundage

62K Followers 12K Following AI policy researcher, wife guy in training, fan of cute animals and sci-fi, Substack writer, stealth-ish non-profit co-founder

Safeguards Analysis Team, @AISecurityInst

Jai Patel @jaipatelAISI

29 Followers 19 Following Safeguards Analysis Team, @AISecurityInst

Wearables with brains for people with heart. Turn tiny moments of awesome into the best times ever. Tell the world how you #MakePebbleYours ❤️

Pebble @Pebble

75K Followers 530 Following Wearables with brains for people with heart. Turn tiny moments of awesome into the best times ever. Tell the world how you #MakePebbleYours ❤️

Computer Scientist. See also https://t.co/EXWR5k634w .
@harvard @openai opinions my own.

Boaz Barak @boazbaraktcs

24K Followers 587 Following Computer Scientist. See also https://t.co/EXWR5k634w . @harvard @openai opinions my own.

PhD student in AI at UCL. Statistical learning theory.

Reuben Adams @ReubenJAdams

41 Followers 136 Following PhD student in AI at UCL. Statistical learning theory.

$Now: stealth, Past: {Senior Scientist @GoogleDeepMind, JRF @ChCh_Oxford @UniofOxford, Fellow @VectorInst, PhD @Cambridge_Uni}$

Ilia Shumailov🦔 @iliaishacked

3K Followers 792 Following Now: stealth, Past: {Senior Scientist @GoogleDeepMind, JRF @ChCh_Oxford @UniofOxford, Fellow @VectorInst, PhD @Cambridge_Uni}

Creator @datasetteproj, co-creator Django. PSF board. Hangs out with @natbat. He/Him. Mastodon: https://t.co/t0MrmnJW0K Bsky: https://t.co/OnWIyhX4CH

Simon Willison @simonw

115K Followers 6K Following Creator @datasetteproj, co-creator Django. PSF board. Hangs out with @natbat. He/Him. Mastodon: https://t.co/t0MrmnJW0K Bsky: https://t.co/OnWIyhX4CH

Trying to learn about deep learning faster than deep learning can learn about me.

Avi Schwarzschild @A_v_i__S

896 Followers 248 Following Trying to learn about deep learning faster than deep learning can learn about me.

Slow Boring, cohosting https://t.co/wxUj3JFSFf, Bloomberg columnist

Matthew Yglesias @mattyglesias

548K Followers 2K Following Slow Boring, cohosting https://t.co/wxUj3JFSFf, Bloomberg columnist

ai safety lukewarm takes | maths undergrad @ UoB

aidan ewart @aidanprattewart

458 Followers 520 Following ai safety lukewarm takes | maths undergrad @ UoB

AI PhD student @berkeley_ai /w @ancadianadragan & Stuart Russell. Working on AI safety ⊃ preference changes/AI manipulation.

Micah Carroll @MicahCarroll

1K Followers 689 Following AI PhD student @berkeley_ai /w @ancadianadragan & Stuart Russell. Working on AI safety ⊃ preference changes/AI manipulation.

Turtle hatchling trying to make it to the ocean. I work at Redwood Research.

Joshua Clymer @joshua_clymer

2K Followers 113 Following Turtle hatchling trying to make it to the ocean. I work at Redwood Research.

Head of Alignment at the UK AI Security Institute (AISI). Semi-informed about economics, physics and governments. views my own

Benjamin Hilton @benjamin_hilton

3K Followers 857 Following Head of Alignment at the UK AI Security Institute (AISI). Semi-informed about economics, physics and governments. views my own

Aspiring 10x reverse engineer @GoogleDeepMind

Arthur Conmy @ArthurConmy

4K Followers 1K Following Aspiring 10x reverse engineer @GoogleDeepMind

UK AISI's alignment team. Views my own. sand god doula.

Will Kirby @wk1rby

217 Followers 536 Following UK AISI's alignment team. Views my own. sand god doula.

Yak Shaver and Security Researcher. Head of Research&Development at Chainlink Labs. Formerly at 🇨🇭 ETH Zürich, 🗽 Cornell Tech,⛓️ IC3.

Lorenz Breidenbach @ethlorenz

2K Followers 466 Following Yak Shaver and Security Researcher. Head of Research&Development at Chainlink Labs. Formerly at 🇨🇭 ETH Zürich, 🗽 Cornell Tech,⛓️ IC3.

Chief scientist at Redwood Research (@redwood_ai), focused on technical AI safety research to reduce risks from rogue AIs

Ryan Greenblatt @RyanPGreenblatt

6K Followers 4 Following Chief scientist at Redwood Research (@redwood_ai), focused on technical AI safety research to reduce risks from rogue AIs

I test language models @ the UK AI Security Institute

Asa Cooper Stickland @AsaCoopStick

1K Followers 905 Following I test language models @ the UK AI Security Institute

Senior Research Fellow @forethought_org

Understanding the intelligence explosion and how to prepare

Tom Davidson @TomDavidsonX

1K Followers 237 Following Senior Research Fellow @forethought_org Understanding the intelligence explosion and how to prepare

Ex-OpenAI safety researcher (danger evals & AGI readiness), https://t.co/XtUTLK3jEo. Likes maximizing benefits and minimizing risks of AI

Steven Adler @sjgadler

9K Followers 753 Following Ex-OpenAI safety researcher (danger evals & AGI readiness), https://t.co/XtUTLK3jEo. Likes maximizing benefits and minimizing risks of AI

Research scientist @ Google DeepMind working on AGI safety & alignment

Erik Jenner @jenner_erik

918 Followers 152 Following Research scientist @ Google DeepMind working on AGI safety & alignment

AI researcher, interested in LLMs and reinforcement learning | Previously @UCL_DARK, @imperialcollege, @UniMelb

Yi Xu @_yixu

513 Followers 423 Following AI researcher, interested in LLMs and reinforcement learning | Previously @UCL_DARK, @imperialcollege, @UniMelb

make things radically good 🌎 @anthropicai | give me feedback: https://t.co/R1OyioKMXy

Logan Graham @logangraham

7K Followers 6K Following make things radically good 🌎 @anthropicai | give me feedback: https://t.co/R1OyioKMXy

Michal Bravansky @michalbravansky

169 Followers 1K Following @verifee @ucl

Research Scientist (Frontier Planning) at @GoogleDeepMind.
Research Affiliate @Cambridge_Uni @CSERCambridge & @LeverhulmeCFI.
All views my own.

Haydn Belfield @HaydnBelfield

5K Followers 2K Following Research Scientist (Frontier Planning) at @GoogleDeepMind. Research Affiliate @Cambridge_Uni @CSERCambridge & @LeverhulmeCFI. All views my own.

Co-Executive Director @MATSprogram, Co-Founder @LondonSafeAI, Regrantor @Manifund | PhD in physics | Accelerate AI alignment + build a better future for all

Ryan Kidd @ryan_kidd44

2K Followers 1K Following Co-Executive Director @MATSprogram, Co-Founder @LondonSafeAI, Regrantor @Manifund | PhD in physics | Accelerate AI alignment + build a better future for all

Advancing AI honesty, control, safety at @open_phil. Prev Harvard AISST (https://t.co/xMMztyYR3O), Harvard '23.

Max Nadeau @MaxNadeau_

1K Followers 521 Following Advancing AI honesty, control, safety at @open_phil. Prev Harvard AISST (https://t.co/xMMztyYR3O), Harvard '23.

PhD student in Foundational AI @ucl @ai_ucl @uclcs
Enrichment Fellow @turinginst

2x ML Research Intern at Apple working on Differential Privacy

Lorenz Wolf @lorenz_wlf

56 Followers 213 Following PhD student in Foundational AI @ucl @ai_ucl @uclcs Enrichment Fellow @turinginst 2x ML Research Intern at Apple working on Differential Privacy

Incoming AI safety and technical AI governance DPhil @UniofOxford • MSc in AI at ETH Zurich • 2x @MATSprogram • Talos AI Governance Fellowship • 🇪🇺🇨🇿

Evžen Wybitul @evzen_wy

262 Followers 762 Following Incoming AI safety and technical AI governance DPhil @UniofOxford • MSc in AI at ETH Zurich • 2x @MATSprogram • Talos AI Governance Fellowship • 🇪🇺🇨🇿

Research Scientist @AISecurityInst| AI Policy Researcher @GovAI_ | Frontier AI Safety Cases

Marie Davidsen Buhl @MarieBassBuhl

239 Followers 96 Following Research Scientist @AISecurityInst| AI Policy Researcher @GovAI_ | Frontier AI Safety Cases

alignment @OpenAI. past @AISecurityInst @verses_xyz @kernel_magazine @readtrellis @copysmith_ai

Jasmine @j_asminewang

6K Followers 1K Following alignment @OpenAI. past @AISecurityInst @verses_xyz @kernel_magazine @readtrellis @copysmith_ai

Associate Prof @MITEECS working on value (mis)alignment in AI systems; @dhadfieldmenell@bsky.social; he/him

Dylan HadfieldMenell @dhadfieldmenell

4K Followers 2K Following Associate Prof @MITEECS working on value (mis)alignment in AI systems; @[email protected]; he/him

CTO at Robust Intelligence. Formerly, Microsoft, Endgame/Elastic, Mandiant/FireEye, Sandia & MIT Lincoln Labs.

'He who forgives ends the quarrel'

Hyrum Anderson @drhyrum

3K Followers 1K Following CTO at Robust Intelligence. Formerly, Microsoft, Endgame/Elastic, Mandiant/FireEye, Sandia & MIT Lincoln Labs. 'He who forgives ends the quarrel'

Science & governance of AI

Currently: Agent Security lead @ U.S. Center for AI Standards and Innovation

Ben Edelman @EdelmanBen

398 Followers 50 Following Science & governance of AI Currently: Agent Security lead @ U.S. Center for AI Standards and Innovation

Walking the world, one city at a time. I like turtles, cats, & buses. Subscribe to my Substack: https://t.co/j6mE4TVfBl

Chris Arnade 🐢🐱... @Chris_arnade

92K Followers 3K Following Walking the world, one city at a time. I like turtles, cats, & buses. Subscribe to my Substack: https://t.co/j6mE4TVfBl

🪼 AGI policy dev lead @GoogleDeepMind | rekkid junkie, dimensional glider, deep ArXiv dweller, interstellar fugitive, uncertain | 🛸

Séb Krier @sebkrier

12K Followers 7K Following 🪼 AGI policy dev lead @GoogleDeepMind | rekkid junkie, dimensional glider, deep ArXiv dweller, interstellar fugitive, uncertain | 🛸

David Chanin @chanindav

100 Followers 202 Following

AI safety researcher. Interested in developing a science of how LLMs generalise, and understanding relevant safety risks.

Daniel Tan @DanielCHTan97

335 Followers 405 Following AI safety researcher. Interested in developing a science of how LLMs generalise, and understanding relevant safety risks.

Waiting on a robot body. All opinions are universal and held by both employers and family. Now a dedicated grok hate account.

Accepting ML/NLP PhD students.

Naomi Saphra @nsaphra

10K Followers 1K Following Waiting on a robot body. All opinions are universal and held by both employers and family. Now a dedicated grok hate account. Accepting ML/NLP PhD students.

policy for v smart things @openai. Past: CS PhD @HarvardSEAS/@SchmidtFutures/@MIT_CSAIL. Tweets my own; on my head be it.

Yo Shavit @yonashav

7K Followers 957 Following policy for v smart things @openai. Past: CS PhD @HarvardSEAS/@SchmidtFutures/@MIT_CSAIL. Tweets my own; on my head be it.

Settle controversies using real-world data and math. Calculating reality the way no human can.

Rootclaim @Rootclaim

5K Followers 167 Following Settle controversies using real-world data and math. Calculating reality the way no human can.

Professor of Machine Learning, University of Oxford
@OATML_Oxford Group Leader
Director of Research at the UK govt's AI Security Institute (AISI)

Yarin @yaringal

41K Followers 245 Following Professor of Machine Learning, University of Oxford @OATML_Oxford Group Leader Director of Research at the UK govt's AI Security Institute (AISI)

Co-founder @join_ef; Chair @ARIA_Research; UK AI

Matt Clifford @matthewclifford

33K Followers 2K Following Co-founder @join_ef; Chair @ARIA_Research; UK AI

ZOG in exile.
The most ironic outcome is the most likely. Reducing the irony is my job.
There is no antimemetics division

michael vassar @HiFromMichaelV

5K Followers 169 Following ZOG in exile. The most ironic outcome is the most likely. Reducing the irony is my job. There is no antimemetics division

Investigating power, society, and industry. Please don't get your ought all over my is.

Ben Landau-Taylor @benlandautaylor

13K Followers 25 Following Investigating power, society, and industry. Please don't get your ought all over my is.

survey artist, too earnest.
https://t.co/IcEgPhVD3o

Aella @Aella_Girl

239K Followers 394 Following survey artist, too earnest. https://t.co/IcEgPhVD3o

Open and scalable technology for understanding AI systems.

Transluce @TransluceAI

8K Followers 15 Following Open and scalable technology for understanding AI systems.

Safety and alignment at Meta Superintelligence. Prev: VP of Research at Scale AI, research at Google DeepMind / Brain (Gemini, LaMDA, RL / TFAgents, AlphaChip).

Summer Yue @summeryue0

6K Followers 365 Following Safety and alignment at Meta Superintelligence. Prev: VP of Research at Scale AI, research at Google DeepMind / Brain (Gemini, LaMDA, RL / TFAgents, AlphaChip).

Menswear writer. Editor at @putthison. Creator of @RLGoesHard. Bylines at The New York Times, The Financial Times, Politico, Esquire, and Mr. Porter

derek guy @dieworkwear

1.4M Followers 958 Following Menswear writer. Editor at @putthison. Creator of @RLGoesHard. Bylines at The New York Times, The Financial Times, Politico, Esquire, and Mr. Porter

anthropic researcher, poet, flautist, DJ ⭐️
everything has to do with loving and not loving, rumi

mrinank 🌳 @MrinankSharma

2K Followers 556 Following anthropic researcher, poet, flautist, DJ ⭐️ everything has to do with loving and not loving, rumi

No recent Favorites. New Favorites will appear here.

Trends for United States

Karen

162 B posts

Davey Johnson

4.034 posts

Elizabeth Warren

48,4 B posts

Gameday

38 B posts

#SaturdayVibes

4.026 posts

#Caturday

5.164 posts

#askdave

#UFCParis

8.705 posts

Thug

126 B posts

Good Saturday

33,8 B posts

Duke

19,5 B posts

Rose Garden Club

15,7 B posts

Max Verstappen

33,3 B posts

Ken Dryden

12,2 B posts

Pat McAfee

#ForeverWithYeontan

15,7 B posts

Oklahoma

12,8 B posts

Big Noon Kickoff

Bader

29,3 B posts

The Dad

92,8 B posts

You might like

Jack Parker-Holder

Rishabh Agarwal

@abhishekunique7

Roberta Raileanu

Joseph Suarez 🐡

@yayitsamyzhang

Stefano Albrecht