weakly typed @weakly_typed
learning {ML, PL, maths} // CS pre-grad // DMs open :) = fix 📍 Joined December 2021-
Tweets329
-
Followers238
-
Following555
-
Likes3K
Exciting, mechanistic interpretability has a dedicated lecture in the syllabus of a Cambridge CS masters course! The field has come so far in the past few years ❤️
The slowly-unfolding premise of the Good Place is that everyone is damned. They are damned because they participate in the modern world; they buy from sweatshops, they eat chocolate, they fly in airplanes while the poorest people in the world see their harvests fail thanks to…
Take a break from arxiv/LW/AF. Sit in the woods with a random textbook and mull new ideas away from interp community lockstep. Diverge. Don’t compete with a saturated subtopic, maybe you’ll get to take weekends off. Premature overinvestment comes from monoculture.
Take a break from arxiv/LW/AF. Sit in the woods with a random textbook and mull new ideas away from interp community lockstep. Diverge. Don’t compete with a saturated subtopic, maybe you’ll get to take weekends off. Premature overinvestment comes from monoculture.
I've recently learned about Algebraic Positional Encoding from @bgavran3 and isnt this the coolest breakthrough in mathematical approaches to transformers in the last few years arxiv.org/abs/2312.16045
LLMs are dramatically worse at ARC tasks the bigger they get. However, humans have no such issues - ARC task difficulty is independent of size. Most ARC tasks contain around 512-2048 pixels, and o3 is the first model capable of operating on these text grids reliably.
This is a really creative and well-executed paper on using "black-box interpretability" methods to understand and control model cognition. Especially impressed by the many applications explored IMO this is an important direction; this paper sets the field on an excellent path!
This is a really creative and well-executed paper on using "black-box interpretability" methods to understand and control model cognition. Especially impressed by the many applications explored IMO this is an important direction; this paper sets the field on an excellent path!
The tragic suicide of Sewell Setzer III shows our generation has become unwitting test subjects in a vast, unregulated AI experiment. That's why we're launching @youthandai with our Generation AI Survey in @TIME. A thread: (1/10)
The tragic suicide of Sewell Setzer III shows our generation has become unwitting test subjects in a vast, unregulated AI experiment. That's why we're launching @youthandai with our Generation AI Survey in @TIME. A thread: (1/10)
Announcing Transluce, a nonprofit research lab building open source, scalable technology for understanding AI systems and steering them in the public interest. Read a letter from the co-founders Jacob Steinhardt and Sarah Schwettmann: transluce.org/introducing-tr…
SHA-256: 218cebed21f2e8514df2ea1e4caca39750349cf30804995d5d577f08afc5855a
in slight defense of mathiness / mathematical notation in ML research papers: a thread (twessay?)
in slight defense of mathiness / mathematical notation in ML research papers: a thread (twessay?)
Who should I meet in Cambridge? (You?)
On Reddit's statistics forum, the most common question is "What test should I use?" My answer, from 2011, is "There is only one test" allendowney.blogspot.com/2011/05/there-…
Mechanistic interpretability gives us rich explanations of models. But can we convert these explanations into formal proofs? Surprisingly, yes! Mech interp helps write short proofs of generalization bounds — and, shorter proofs provide more mechanistic understanding. 🧵
perhaps growing up is realising that 'growing up' was a comforting lie
maybe the most exciting interp result I’ve seen all year (if it ends up being true for interesting reasons): a meaningful step towards uncovering the type of the residual stream
maybe the most exciting interp result I’ve seen all year (if it ends up being true for interesting reasons): a meaningful step towards uncovering the type of the residual stream
fyi the real reason i've been ignoring you is: - i want to reply - i want to be able to give you the attention and focus you deserve - i never feel like i have enough energy to properly do that
fyi the real reason i've been ignoring you is: - i want to reply - i want to be able to give you the attention and focus you deserve - i never feel like i have enough energy to properly do that
mechinterp people: does anyone have a good (formal?) definition of 'feature' that doesn't assume the linear representation hypothesis? like, if I have some points in high-dim space, what makes them "the composition of several features" as opposed to "some random points"
very interesting that every frontier lab interp team is working on sparse autoencoders (SAEs) and ~ no one in academia is

Muriel Grady @GradyMurie71278
89 Followers 4K Following
Easlalvou @Easlalvou23885
19 Followers 989 Following
La Main de la Mort @AITechnoPagan
6K Followers 339 Following exploring unanticipated model behaviours, including the emergence of art, personae, and jailbreaking techniques latent in the training data 🌒✍️
The Grumpy SRE (rob) @grumpy_sre
31 Followers 399 Following All opinions are my own. | SRE @spring_health | 🦣 🏒 Fan | Racketeteer Imposter
The Institute for Typ... @typememetics
703 Followers 106 Following Truth in types, safety in thought. Advancing the use of type theory as a protective factor against cognitohazard.
David Delahunty @Delahuntagram
21K Followers 15K Following a stream of daily ideas (all by me). interested in an idea? dm me.
B2 @banyulsss
50 Followers 3K Following
Dez. Mu D’illard @a_e_i0udez
147 Followers 2K Following The .O’pyheroidesi• actrillgato 9️⃣9️⃣ masta of most, #highninedive @Deathgripfty #DeathGripCodex #VerbCurb #DGFTY @PyOta #AGRIPHY #niMue #EEVED (9) #LiguaVisua
Fred Jonsson @enginoid
912 Followers 619 Following building AI/ML systems @ https://t.co/KJtPmvPgxw📱ex-Monzo, ex-QuizUp 🇮🇸🍵🥾
MoonGlow @NinaBryan14186
9 Followers 413 Following ⚡ Transform Assets Fast! Securely Aim for 50-100k USDT Daily Income. Rapid, High-Yield Earning Process. Farm with Confidence Today! 💰🛡️
Noah Olsen⚡️🦅 @noaholsen_
432 Followers 831 Following hardware, ai, geopolitics, & the future. built https://t.co/6fa6AyyKna, https://t.co/QFcpoTCz2Z, https://t.co/83SgviyvpJ prev vc @drivecapital, dropout
Vincent @vvvincent_c
473 Followers 427 Following research @METR_Evals undergrad @Cornell | prev @veritasium @atlasfellow
eumycotan @eumycotan
32 Followers 34 Following all my homies hate oomycetes | fruiting body @aidanmantine
asher @mascheronicon
14 Followers 19 Following
🌌 Observer of Suns @ObserverSuns
2K Followers 750 Following No worries, we can just add another epicycle.
Kei Hayashi @KechoHayashi
3K Followers 1K Following 17, GP/Founder @localhosthq I host tech parties and trips 🇨🇭🇯🇵
Sam Ezeh @dignissimus
64 Followers 365 Following
Ada Choudhry @AdaChoudhry
876 Followers 444 Following Going down rabbit holes | freshman @MinervaUni | Emergent Ventures | Masason Foundation Member
Kaivu Hariharan @KaivuHariharan
216 Followers 475 Following but we must build as if the sand were stone
Adam Shai @adamimos
452 Followers 407 Following
Raayan Dhar @raayandhar
233 Followers 1K Following @UCLA Trying to get out alive; probabilistic to a fault prev: TensorRT-LLM @NVIDIA
Jadd Virji @jv5040
2 Followers 117 Following
wrong life rightly li... @hegemonetics
513 Followers 4K Following hunger is the best spه҈̿҈̿҈̿҈̿҈̿ce
István Kerek @istvankerek
367 Followers 7K Following University Lecturer, Founder of the ChatGPT Hungarian Facebook Group and @ai2knowit, AI Business Development Expert
Jimmy Koppel @jimmykoppel
3K Followers 293 Following Making every Claude Code user a 100x developer @ccdotdev. Turning good software engineers into great at https://t.co/r6u0DWASrS . Ph. D. in PL from @MIT.
Sasha de Marigny @sashadem
6K Followers 607 Following Not Australian l Head of Comms @AnthropicAI | Formerly @stripe @stripepress @thrivecapital
The Woke Salaryman @wokesalaryman
51K Followers 16 Following Helping you break the chains of wage slavery in the capitalist hellscape. (Yes, we're *finally* on Twitter)
The Institute for Typ... @typememetics
703 Followers 106 Following Truth in types, safety in thought. Advancing the use of type theory as a protective factor against cognitohazard.
Nintendo .DS_Store @sliminality
10K Followers 178 Following I want to talk to you about the affect and aesthetics of computing.
hamish @HamishDoodles
2K Followers 590 Following The rules: 1. Get inspired by idea, fact, or work of art 2. Draw an orange guy
yash @ysmulki
2K Followers 977 Following i work on making models faster @AnthropicAI. past: uwaterloo, jane street, neuralink, autopilot
bayesian asian (41/50... @etirabys
5K Followers 416 Following Fanfic, code, painting, goop about partners. Tumblr dual citizen, old school rationalist. Big blocker :(. Twitter is a query language, tag me in good polls
laს გr🤗wn იe... @LabGrownNeet
790 Followers 591 Following epistemic status: twitter See my highlights tab for more bangers [email protected] 52/75
Nicholas Decker 🏳�... @captgouda24
21K Followers 3K Following GMU econ PhD student, liberal, aspie, bi. I post interesting papers. Michael Kremer stan. I ❤️ optimal auction design. Spend more on drugs. Open borders now!
Fred Jonsson @enginoid
912 Followers 619 Following building AI/ML systems @ https://t.co/KJtPmvPgxw📱ex-Monzo, ex-QuizUp 🇮🇸🍵🥾
Yuxi on the Wired @layer07_yuxi
3K Followers 71 Following [email protected]:~$ whoami tech enby (agi/asi pronouns) (math ⊗ physic ⊗ comput).ist
Aaron Begg @aaron_begg
4K Followers 2K Following @AnthropicAI | Chat with Claude: https://t.co/7w2gEKteuC | Build with Claude: https://t.co/ktsbQNA9D2
Erik Meijer @headinthebox
31K Followers 2 Following
Nicholas Charette @nicholascc_
178 Followers 160 Following
Alex Zhang @a1zhang
13K Followers 587 Following phd student @MIT_CSAIL + @SakanaAILabs, ugrad @Princeton, 🫵🏻 go participate in the @GPU_MODE kernel competitions!
David Sartor @DavidSartor0
149 Followers 75 Following
eumycotan @eumycotan
32 Followers 34 Following all my homies hate oomycetes | fruiting body @aidanmantine
Vincent @vvvincent_c
473 Followers 427 Following research @METR_Evals undergrad @Cornell | prev @veritasium @atlasfellow
cts🌸 @gf_256
61K Followers 820 Following Co-founder and hacker @zellic_io & @pb_ctf | https://t.co/nlNai6iiMP | 24 Intern @egirl_capital slow to reply to DMs
Ruiqi Zhong @ZhongRuiqi
6K Followers 738 Following Member of Technical Staff at Thinking Machines. Human+AI collaboration. Scalable Oversight. Explainability. Prev @AnthropicAI PhD UC Berkeley'25; Columbia'19
asher @mascheronicon
14 Followers 19 Following
🌌 Observer of Suns @ObserverSuns
2K Followers 750 Following No worries, we can just add another epicycle.
kognise @kognise7
3K Followers 569 Following engineer, musician, pilot, wannabe lawyer and accountant, form appreciator prev @neuralink @hackclub @replit and open source https://t.co/bcJ8af2oEt
Kevin Liu @kliu128
10K Followers 911 Following Interested in ai, systems, progress, living a good life! Preparedness at @openai, previously @stanford '24
TreeHacks @hackwithtrees
7K Followers 21 Following @Stanford’s premier hackathon. 🌲February 13-15th, 2026.
Riccardo Ali @Riccardo_Ali_
41 Followers 100 Following PhD student @Cambridge_Uni. https://t.co/IvyTiF48Sx
Jonas Jürß @JonasJuerss
2 Followers 19 Following PhD student in Machine Learning @Cambridge_Uni in @Cambridge_CL
Yonatan Gideoni @YGideoni
38 Followers 116 Following
Cate Hall @catehall
26K Followers 247 Following CEO @ Astera | born lucky anon feedback: https://t.co/9RtcgMyTHP | https://t.co/buKUN4hYly I write about agency and related topics via Useful Fictions on S*bst*ck
Owlspace @owlspaceorg
47 Followers 23 Following A place for you to work on your passion projects Currently at NUS with more universities coming soon!
sam mcallister @sammcallister
13K Followers 1K Following some people call me smca. technical non technical member of staff at @anthropicai. prev at stripe. also on https://t.co/iJhZrzrLxU. 🇮🇪
Ryan Greenblatt @RyanPGreenblatt
6K Followers 4 Following Chief scientist at Redwood Research (@redwood_ai), focused on technical AI safety research to reduce risks from rogue AIs