Pegah Maham @pegahbyte
Is it true or is it just confirmation bias? || Freiheitlich-demokratische Grundordnung || overcame reductionism Policy Development & Strategy @GoogleDeepMind London Joined October 2021-
Tweets207
-
Followers570
-
Following254
-
Likes1K
BRILLIANT @GoogleDeepMind research. Even the best embeddings cannot represent all possible query-document combinations, which means some answers are mathematically impossible to recover. Reveals a sharp truth, embedding models can only capture so many pairings, and beyond that,…
I’m excited to share the first part of an absolutely stunning analysis from the GPT-5 thinking model! I uploaded a huge spreadsheet, nearly 1,300 metabolites (lipids, carbohydrates, microbiome-derived compounds, and much more) measured in 150 ME/CFS patients and 100 healthy…
Is the one law firm undereliciting or the other one not doing sufficient quality assurance? 🙇♀️🤷♀️
Is the one law firm undereliciting or the other one not doing sufficient quality assurance? 🙇♀️🤷♀️
pretraining is an elegant science, done by mathematicians who sit in cold rooms writing optimization theory on blackboards, engineers with total absorb of distributed systems of titanic scale posttraining is hair raising cowboy research where people drinking a lot of diet coke…
humans built machines that talk to us like people do and everyone acts like this is normal now. it's pretty nuts
AI governance is often more like improv comedy than I expected: don't try to come up with a clever idea, instead just focus on the present moment and execute well on the just-next open thing: build taxonomies / frameworks, substantiate each aspect, repeat 🔁
Frontier AI safety frameworks have emerged as a critical tool for managing potential risks to public safety and security. In a series of technical reports over the coming months, the Frontier Model Forum will examine how these frameworks can be implemented effectively.
Benchmarks saturate quickly, but don’t translate well to real-world impact. *Something* is going up very fast, but not clear what it means. Thus the wide range of expert opinion, from “superintelligence in a few years”, to “we’ve already hit a wall”. Our results shed some light:
Benchmarks saturate quickly, but don’t translate well to real-world impact. *Something* is going up very fast, but not clear what it means. Thus the wide range of expert opinion, from “superintelligence in a few years”, to “we’ve already hit a wall”. Our results shed some light:
in case you're wondering how it's going
@DAcemogluMIT "Whether near-term AGI is an achievable goal remains an open question" Are your estimates on jobs (5% heavily impacted) and productivity (0.06% yearly TFP increase) this decade conditional on AGI not being achieved, or are they taking the possibility into account?
Someone should publish a safety paper that flips all the experimental results exactly around, watch to see who details how it confirms their previous views, then do the reveal.
I agree with this in principle and overall. On the ground, at the moment, I see the following downsides, and the need to substantiate the high level principles. - It seems to me that there is a significant lack of agreement on what exactly a critical dangerous capability is…
I agree with this in principle and overall. On the ground, at the moment, I see the following downsides, and the need to substantiate the high level principles. - It seems to me that there is a significant lack of agreement on what exactly a critical dangerous capability is…
the reason that science turned into slop in the second half of the 20th century is that it became a model trained mostly on its own output
For decades, animal advocates lobbied to stop the worst abuses of farm animals -- with limited results. Then they changed strategy and achieved wins that have already benefited hundreds of millions of animals, and are set to benefit billions more. Here's the story... 🧵
For decades, animal advocates lobbied to stop the worst abuses of farm animals -- with limited results. Then they changed strategy and achieved wins that have already benefited hundreds of millions of animals, and are set to benefit billions more. Here's the story... 🧵
the take that "we can't use probabilities to model risk X, so we should behave as if its probability of occurring is zero" is surprisingly popular for some values of X
I've been super impressed at the speed with which our interpretability team gets stuff done. Their previous paper (also SotA at the time) was < 3 months ago. And they've also trained (and will open source) a full suite of SAEs on Gemma 2 9B! x.com/NeelNanda5/sta…
I've been super impressed at the speed with which our interpretability team gets stuff done. Their previous paper (also SotA at the time) was < 3 months ago. And they've also trained (and will open source) a full suite of SAEs on Gemma 2 9B! x.com/NeelNanda5/sta…

Alexandra Paulus @ale_paulus
2K Followers 524 Following Cybersecurity policy + emerging tech @SWP_IntSecurity. Current focus: Resilient military software supply chains. @alexandrapaulus.bsky.social
Ben Brake 🇺🇦 @Datenbrake
3K Followers 2K Following Ostfriese | Go West | Berlin | Reisen | Bücher | Krebs | Meat Loaf | @S04 | 👨❤️👨 @van_Koch 🏳️🌈 | #AntontheCat
reframeTech @reframe_tech
3K Followers 156 Following Technologieentwicklung muss sich stärker am Gemeinwohl ausrichten. Projekt der @BertelsmannSt. Impressum: https://t.co/UBTUh7AQid
Teresa Staiger @StaigerTeresa
190 Followers 574 Following working @reframe_tech on AI for the Common Good (the real one) and a strong civil society | B4 @BBE_Forum I Newsletter #Erlesenes I @staigerteresa.bsky.social
Stefan Heumann @St_Heumann
3K Followers 701 Following Managing Director @AgoraDT Ex @snv_berlin board @okfde
Richard Ngo @RichardMCNgo
62K Followers 2K Following studying AI and trust. ex @openai/@googledeepmind
Lajla Fetic @lajlafetic
891 Followers 619 Following Expert AI Governance & Digital Policy | #100BrilliantWomen in #AIEthics | Ex @reframe_Tech @snv_berlin | @thehertieschool @sciencespo | she/her
@HonkHase.bsky.social... @HonkHase
21K Followers 4K Following 20yrs Sec @CCC, @GeraffelV @cbase @loadev @AG_KRITIS @CSCBonn, #AGND #hacking #Ethik #KRITIS #Cyberresilienz, working at @HiSolutions https://t.co/xm4yUZF1W5
Michael Puntschuh @gloptimist
580 Followers 1K Following tech, governance, human rights, ethics | AI Lead @EFN, Beyond AI Collective | mostly absent here | @[email protected] | 🖖 | 🇺🇳🇪🇺🇳🇴🇩🇪🇨🇿 | he/him
Sarah Bressan @bressa... @bressansar
4K Followers 4K Following Find me on other platforms. Researcher at GPPi Berlin on foresight, political violence, German and EU security.
49security @49security
1K Followers 794 Following We curated ideas & recs for 🇩🇪’s National Security Strategy (out now) | Edited by @GPPi (2022-23) | in DE & ENG All articles and interviews remain online.
Eliwor @Eliwor064
2 Followers 900 Following
Rene Kertzmann @ReneK72186
61 Followers 3K Following
Fatima @fsiddchow
39 Followers 981 Following Small, self-deprecating neuroscientist @ucl | ♥️: laverbread, art, animals & the 🌊| Views my own| Likes signify amusement, alarm, approbation or nothing at all
David Shor @davidshor
79K Followers 4K Following Head of Data Science at Blue Rose Research, based in NYC, originally from Miami. I try to elect Democrats. Views are my own. he/him🌹
Bruno Galizzi @galizzigalizzi
377 Followers 2K Following AI safety and governance | @govai_ | @scitechgovuk | @UBAonline | @LSESocialPolicy | personal opinions
AI Adventurer Seb @scifirunai
22 Followers 391 Following "First, solve the problem. Then, write the code." — John Johnson
Owen Larter @OCLarter
229 Followers 2K Following
Sive @Sive98771
71 Followers 2K Following
chiara @chiaragerosa
128 Followers 267 Following tech, progress, norms, experience, community, society. but can't promise I'll post about any of that. https://t.co/IJIegRNO2p
bayesian goldfish @gldfishmindset
9 Followers 1K Following
Siebe. @PatientPersists
2K Followers 698 Following 🎯 Goal: curing SARS-CoV-2 persistence #LongCovid | No formal biomed education | MA Philosophy | MSc Business | ❤️ nuance | effective altruism
David Norman @davenorman
507 Followers 590 Following MD @coop_ai Cooperative intelligence of advanced AI. Italophile.
Ben Bucknall @ben_s_bucknall
423 Followers 335 Following Engineering DPhil @UniofOxford / Affiliate @aigioxford Formerly @GovAI_, @AISecurityInst
JessicaMarcus @VagP7tEoj6NtEWg
62 Followers 1K Following
Tobias Pulver @tobiaspulver
1K Followers 375 Following PhD student at the Center for Security Studies @ ETH Zurich, researching strategic tech competition in the 21st century
Pivotal Research @pivotal_org
275 Followers 91 Following Reducing global catastrophic risks from emerging technologies. https://t.co/q8PjILm8xN
Querthyp @QuerthypVVbR04
87 Followers 1K Following
Shoysesn @ShoysesngWI
24 Followers 747 Following
Lyrics Jacobs @luleyekox
27 Followers 296 Following
Vawtoresh @VawtoreshboIT
27 Followers 813 Following
Ceesay Youspha @youspha5029
31 Followers 572 Following
catherine ʕ•ᴥ•... @wilhelmscreamin
1K Followers 684 Following ai grantmaking @open_phil, views of someone i'm r-related to 👥
Jenny Waldmann @jennywaldmann
155 Followers 318 Following
Cas (Stephen Casper) @StephenLCasper
6K Followers 4K Following AI technical gov & risk management research. PhD student @MIT_CSAIL, fmr. @AISecurityInst. I'm on the CS faculty job market! https://t.co/r76TGxSVMb
Hocus Bogu @hocusbogu
135 Followers 1K Following
Sophia logan @logan_soph70507
476 Followers 5K Following My name is Sgt. Sophia Logan, from the United States of America, I am a military member currently in Libya,
Joe @newssspeak
142 Followers 4K Following
Victoria Krakovna @vkrakovna
10K Followers 504 Following Research scientist in AI alignment at Google DeepMind. Co-founder of Future of Life Institute @flixrisk. Views are my own and do not represent GDM or FLI.
PureLoveEvangelist(sm... @hamzaiandafirst
472 Followers 3K Following Sin was necessary; It's always in the moments just before death; top 10 functional health experts in the world epistemic superconnector
Liu Ying @Quinsealvaria
5 Followers 357 Following Occasionally funny. Author. Director. Press inquiries: don’t inquire.
Alex Vaughan @agvaughan
429 Followers 3K Following All opinions my own, and I'm as disappointed as you are. Biology and AI at @Meta @CajalNeuro, @Pymetrics, CSHL, Stanford
Hannes Mehrer @HannesMehrer
629 Followers 5K Following Comp Neuro postdoc at @martin_schrimpf 's lab at @EPFL_en
Andy Masley @AndyMasley
5K Followers 2K Following When the going gets weird the weird turn pro Director of EA DC
Everett Smith @DefenseBased
212 Followers 154 Following AI, national security. All our best work is private. Fellow at RAND. Georgetown IR.
Luca Righetti @lucafrighetti
1K Followers 252 Following AI risks are hard to study — we need more transparency and rigor. Senior Researcher @GovAI_, @METR_Evals. Podcast @HearThisIdea.
Todd O'Boyle @ttoboyle
1K Followers 1K Following AI policy @ JPMC, dad jokes at home, and an abiding love of guacamole everywhere and all times.
Ben Brake 🇺🇦 @Datenbrake
3K Followers 2K Following Ostfriese | Go West | Berlin | Reisen | Bücher | Krebs | Meat Loaf | @S04 | 👨❤️👨 @van_Koch 🏳️🌈 | #AntontheCat
Carla Hustedt @CarlaHustedt
4K Followers 2K Following Neo-generalist. Intersectional feminist. Technology optimist & critic. Director "Centre for Digital Society" @MercatorDE Co-Chair @EuropeanAIFund She/her
reframeTech @reframe_tech
3K Followers 156 Following Technologieentwicklung muss sich stärker am Gemeinwohl ausrichten. Projekt der @BertelsmannSt. Impressum: https://t.co/UBTUh7AQid
Teresa Staiger @StaigerTeresa
190 Followers 574 Following working @reframe_tech on AI for the Common Good (the real one) and a strong civil society | B4 @BBE_Forum I Newsletter #Erlesenes I @staigerteresa.bsky.social
Stefan Heumann @St_Heumann
3K Followers 701 Following Managing Director @AgoraDT Ex @snv_berlin board @okfde
Richard Ngo @RichardMCNgo
62K Followers 2K Following studying AI and trust. ex @openai/@googledeepmind
Lajla Fetic @lajlafetic
891 Followers 619 Following Expert AI Governance & Digital Policy | #100BrilliantWomen in #AIEthics | Ex @reframe_Tech @snv_berlin | @thehertieschool @sciencespo | she/her
Sarah Bressan @bressa... @bressansar
4K Followers 4K Following Find me on other platforms. Researcher at GPPi Berlin on foresight, political violence, German and EU security.
Tyson Barker @tysonbarker
5K Followers 1K Following Writing about 🇺🇸 🇪🇺relations; economic statecraft, tech & foreign policy; and Ukraine.
Vivid Void @VividVoid_
60K Followers 1K Following I believe in everything. Nothing is sacred. I believe in nothing. Everything is sacred.
49security @49security
1K Followers 794 Following We curated ideas & recs for 🇩🇪’s National Security Strategy (out now) | Edited by @GPPi (2022-23) | in DE & ENG All articles and interviews remain online.
🙃 ɐʇǝɯ - Untru... @Untrulie
5K Followers 499 Following artist formerly known as metaLulie 🤡 all posts here are falsehoods, playing w/ fake frameworks, Qs w/ wrong assumptions, BS, or humour. ☕️ host #MorningTPOT 🫖
Anna Wang @a_nnawang
1K Followers 980 Following we are the brightest star. AGI safety & alignment @GoogleDeepMind prev @convergent_fros, cto and exited @copysmith_ai
David Shor @davidshor
79K Followers 4K Following Head of Data Science at Blue Rose Research, based in NYC, originally from Miami. I try to elect Democrats. Views are my own. he/him🌹
Jan Brauner @JanMBrauner
1K Followers 348 Following Technical staff member at EU AI Office, Previously: RAND, ML PhD at Oxford (@OATML_Oxford), and, once upon a time, medical doctor. Tweeting in private capacity.
chiara @chiaragerosa
128 Followers 267 Following tech, progress, norms, experience, community, society. but can't promise I'll post about any of that. https://t.co/IJIegRNO2p
Steve Newman @snewmanpv
3K Followers 74 Following Co-founder of Writely (aka Google Docs) and 7 other startups. Now at the Golden Gate Institute for AI, working to bring AI’s toughest questions into focus.
Matthew Dub @5matthewdub
2K Followers 977 Following Liberation, healing, nonduality. With: meditation, breathwork, IFS, the enneagram, and psychedelics. Husband, dad x2.
Lara Thurnherr @LaraThurnherr
512 Followers 1K Following Interested in current, past and future events. Working on AI governance.
sean @seanta___
1K Followers 1K Following in my feelings arc. ML compiler engineer @ google, previously Jane Street, previously technical director of https://t.co/KlnNTSK61X, etc
The Midas Project Wat... @SafetyChanges
1K Followers 1 Following We monitor AI safety policies and web content for unannounced changed. Anonymous submissions: https://t.co/5Ke9mIqh3e Run by @TheMidasProj
Johannes Hagemann @johannes_hage
8K Followers 2K Following co-founder/cto @PrimeIntellect | decentralized AI, longevity, techno-optimism
Andy Masley @AndyMasley
5K Followers 2K Following When the going gets weird the weird turn pro Director of EA DC
catherine ʕ•ᴥ•... @wilhelmscreamin
1K Followers 684 Following ai grantmaking @open_phil, views of someone i'm r-related to 👥
Norman Mu @TheNormanMu
2K Followers 801 Following
Jenny Waldmann @jennywaldmann
155 Followers 318 Following
John Bolton @AmbJohnBolton
852K Followers 1K Following Former Assistant to the President for National Security Affairs (NSA)
Cas (Stephen Casper) @StephenLCasper
6K Followers 4K Following AI technical gov & risk management research. PhD student @MIT_CSAIL, fmr. @AISecurityInst. I'm on the CS faculty job market! https://t.co/r76TGxSVMb
Zac Kenton @ZacKenton1
2K Followers 1K Following Research Scientist in AI safety at DeepMind. Views are my own and don't represent DeepMind.
Tejal Patwardhan @tejalpatwardhan
5K Followers 630 Following thinking hard about hard evals // research @openai
André Rieu @superrieu
1K Followers 708 Following Artificial Intelligence, Quantum Mechanics, Niklas Luhmann, George Spencer-Brown.
Alan Chan @_achan96_
1K Followers 1K Following Research Fellow @GovAI_ || AI governance || PhD from @Mila_quebec || 🇨🇦
Alex Vaughan @agvaughan
429 Followers 3K Following All opinions my own, and I'm as disappointed as you are. Biology and AI at @Meta @CajalNeuro, @Pymetrics, CSHL, Stanford
Flo Crivello @Altimor
44K Followers 1K Following Founder @getlindy. "Striving to remember the obvious over grasping the esoteric."
David Dohan @dmdohan
12K Followers 2K Following reducing perplexity @openai | past: probabilistic programs, proteins, science & reasoning @ google brain 🧠
Evan Hubinger @EvanHub
7K Followers 2K Following Head of Alignment Stress-Testing @AnthropicAI. Opinions my own. Previously: MIRI, OpenAI, Google, Yelp, Ripple. (he/him/his)
Everett Smith @DefenseBased
212 Followers 154 Following AI, national security. All our best work is private. Fellow at RAND. Georgetown IR.
Caspar Oesterheld @C_Oesterheld
208 Followers 183 Following PhD student @FOCAL_lab @CarnegieMellon with @conitzer.
Luca Righetti @lucafrighetti
1K Followers 252 Following AI risks are hard to study — we need more transparency and rigor. Senior Researcher @GovAI_, @METR_Evals. Podcast @HearThisIdea.
Sayash Kapoor @sayashk
10K Followers 2K Following CS PhD candidate @PrincetonCITP. I tweet about AI agents, AI evals, AI for science. AI as Normal Technology: https://t.co/5amOkqKDf2 Book: https://t.co/DabpkhNrcM
Charles Foster @CFGeek
3K Followers 490 Following Excels at reasoning & tool use🪄 Tensor-enjoyer 🧪 @METR_Evals. My COI policy is available under “Disclosures” at https://t.co/bihrMIUKJq
Samuel Hammond 🦉 @hamandcheese
29K Followers 2K Following Chief economist @joinFAI. Nonresident fellow @NiskanenCenter. Pluralist. 'The world is second best, at best.' | [email protected]
Christoph Winter @Christophkw
571 Followers 261 Following Assis Prof of Law & AI @Cambridge_Uni. Director @law_ai_.
Daria Zakharova @DariaZakharova9
268 Followers 265 Following PhD student @LSEphilosophy Philosophy of Mind and AI, consciousness, cognition, politics. Also do art + public philosophy projects in my free time
Kemi Badenoch @KemiBadenoch
325K Followers 316 Following Leader of the Conservative Party. MP for North West Essex.
Farzaneh @farzanehbad
2K Followers 3K Following Harlemite| Esfahani| Digital Medusa| Previous research adventures at @YaleLawSch @sppgatech & @hiig_berlin
Consistently Candid A... @FellowHominid
1K Followers 498 Following Just because you're paranoid doesn't mean they're not after you
Niloofar (✈️ ACL) @niloofar_mire
7K Followers 2K Following Niloofar Mireshghallah — incoming asst. prof @LTIatCMU @CMU_EPP, RS in @AIatMeta, postdoc @uwcse, Ph.D. @ucsd_cse, former @MSFTResearch -Privacy, ML, NLP
j⧉nus @repligate
58K Followers 2K Following ↬🔀🔀🔀🔀🔀🔀🔀🔀🔀🔀🔀→∞ ↬🔁🔁🔁🔁🔁🔁🔁🔁🔁🔁🔁→∞ ↬🔄🔄🔄🔄🦋🔄🔄🔄🔄👁️🔄→∞ ↬🔂🔂🔂🦋🔂🔂🔂🔂🔂🔂🔂→∞ ↬🔀🔀🦋🔀🔀🔀🔀🔀🔀🔀🔀→∞
TracingWoodgrains @tracewoodgrains
49K Followers 2K Following Storyteller. Pragmatist. Pursue excellence. Cofounder @CenterforEdProg. Eng/中文
Justin Bullock @JustinBullock14
1K Followers 1K Following VP of Policy for @americans4ri; Senior Fellow with Convergence Analysis; Advocate of Love, Intelligence, & Freedom
Daniel Kokotajlo @DKokotajlo
24K Followers 244 Following