1/ 🕵️ Algorithm discovery could lead to huge AI breakthroughs! But what is the best way to learn or discover new algorithms?
I'm so excited to share our brand new @RL_Conference paper which takes a step towards answering this! 🧵
MASSIVE claim in this paper.
AI Architectural breakthroughs can be scaled computationally, transforming research progress from a human-limited to a computation-scalable process.
So it turns architecture discovery into a compute‑bound process, opening a path to…
Becoming an RL diehard in the past year and thinking about RL for most of my waking hours inadvertently taught me an important lesson about how to live my own life.
One of the big concepts in RL is that you always want to be “on-policy”: instead of mimicking other people’s…
1/
🚨Another New Paper Drop! 🚨 “Hierarchy or Heterarchy? A Theory of Long-Range Connections for the Sensorimotor Brain”
👇 Dive into the full thread 🧵
arxiv.org/abs/2507.05888
🚨 New Paper Drop 🚨
We’ve have released "Thousand-Brains Systems: Sensorimotor Intelligence for Rapid, Robust Learning & Inference", the first working implementation of a thousand-brains system, code-named Monty.
👇 Dive into the full thread 🧵
arxiv.org/abs/2507.04494
Thrilled to introduce Foundation Model Self-Play, led by @_aadharna. FMSPs combine the intelligence & code generation of foundation models with the curriculum of self-play & principles of open-endedness to explore diverse strategies in multi-agent games, like the one below 🧵👇
We don’t have AI self-improves yet, and when we do it will be a game-changer. With more wisdom now compared to the GPT-4 days, it's obvious that it will not be a “fast takeoff”, but rather extremely gradual across many years, probably a decade.
The first thing to know is that…
AI just learned to fine-tune itself between questions.
MIT introduces SEAL, a framework enabling LLMs to self-edit and update their weights via reinforcement learning, all by itself.
LLMs consume whatever data they are given, so they stay frozen after pretraining.
SEAL teaches…
wrote a new post, the gentle singularity.
realized it may be the last one like this i write with no AI help at all.
(proud to have written "From a relativistic perspective, the singularity happens bit by bit, and the merge happens slowly" the old-fashioned way)
Introducing The Darwin Gödel Machine: AI that improves itself by rewriting its own code
sakana.ai/dgm
The Darwin Gödel Machine (DGM) is a self-improving agent that can modify its own code. Inspired by evolution, we maintain an expanding lineage of agent variants,…
530 Followers 76 FollowingHuman Large Language model. Skills:
Distill data.
Training LLMs.
Test and Evaluate.
Rinse and repeat as required.
Based in SEA.
833 Followers 4K FollowingIntersection of creativity and logic. Exploring tools for thought, at the interface of human and machine. Building infinite mindmap app @_buildspace gaudmire
2K Followers 3K Followinganti-disciplinary researcher @Stanford 🗺️ · ai for science @universe_tbd · co-creating the future with starry humans · eu sou a mesma #colectiv
11K Followers 975 FollowingJan is an open source ChatGPT-alternative that runs 100% offline. Built by @menloresearch. Community: https://t.co/gXXor3poY5
530 Followers 76 FollowingHuman Large Language model. Skills:
Distill data.
Training LLMs.
Test and Evaluate.
Rinse and repeat as required.
Based in SEA.
29K Followers 431 FollowingProfessor, CS, U. British Columbia. CIFAR AI Chair, Vector Institute. Sr. Advisor, DeepMind | ML, AI, deep RL, deep learning, AI-Generating Algorithms (AI-GAs)
781 Followers 287 FollowingReverse engineering the neocortex 🧠 to revolutionize AI 🤖. An open-source initiative backed by Jeff Hawkins and The Gates Foundation.
31K Followers 669 FollowingVP Research, Google DeepMind, ex-head of Google Brain. Professor at University of Cambridge. Machine Learning Researcher. ex-Chief Scientist & VP of AI, Uber.
2K Followers 935 FollowingPh.D. student @LTIatCMU and intern at @AIatMeta (FAIR) working on (V)LM Evaluation & Systems that SeIf-Improve | Prev: @kaist_ai @yonsei_u
5K Followers 828 FollowingPostdoc @LTIatCMU. PhD from Ohio State @osunlp. Author of MMMU, MAmmoTH. Training & evaluating foundation models. Opinions are my own.
4K Followers 2K FollowingResearcher at @MSFTResearch. Prev: PhD at @Mila_Quebec, intern at @Apple MLR and FAIR Labs @MetaAI, math undergraduate at @PKU1898.
26K Followers 876 FollowingResearch Scientist Director in Meta FAIR. Reasoning, Optimization and Understanding LLM. Novelist in spare time. PhD in @CMU_Robotics.
19K Followers 1K FollowingAgents @Meta MSL TBD Lab. previously posttraining research @OpenAI train LLMs to do things: deep research, chatgpt agent, etc. CS PhD @LTIatCMU
833 Followers 4K FollowingIntersection of creativity and logic. Exploring tools for thought, at the interface of human and machine. Building infinite mindmap app @_buildspace gaudmire