The proprietary frontier models of today are ephemeral artifacts. Essentially very expensive sandcastles. Destined to be washed away by the rising tide of open source replication (first) and algorithmic disruption (later).
New blog post about asymmetry of verification and "verifier's law": jasonwei.net/blog/asymmetry…
Asymmetry of verification–the idea that some tasks are much easier to verify than to solve–is becoming an important idea as we have RL that finally works generally.
Great examples of…
New Anthropic research: Persona vectors.
Language models sometimes go haywire and slip into weird and unsettling personas. Why? In a new paper, we find “persona vectors"—neural activity patterns controlling traits like evil, sycophancy, or hallucination.
Inference-Time Scaling and Collective Intelligence for Frontier AI
sakana.ai/ab-mcts/
We developed AB-MCTS, a new inference-time scaling algorithm that enables multiple frontier AI models to cooperate, achieving promising initial results on the ARC-AGI-2 benchmark.…
Is LLM use finally making me less capable?
I started using LLMs three years ago for text and code gen. Now, I use several of them, for a ton more things.
In fact, I feel like I use them for a huge fraction of the cognitive tasks that I perform that can be described in text.…
Truly exciting achievements - current frontier AI models would be probably considered AGI 10 years ago, but AI goalposts always keep moving, and critics always downplay the achievements and emphasize imperfections (same old, same old :)
Truly exciting achievements - current frontier AI models would be probably considered AGI 10 years ago, but AI goalposts always keep moving, and critics always downplay the achievements and emphasize imperfections (same old, same old :)
1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
After iterating hundreds of prompts to trigger blackmail in Claude, I was shocked to see these prompts elicit blackmail in every other frontier model too.
We identified two distinct factors that are each sufficient to cause agentic misalignment:
1. The developers and the agent…
After iterating hundreds of prompts to trigger blackmail in Claude, I was shocked to see these prompts elicit blackmail in every other frontier model too.
We identified two distinct factors that are each sufficient to cause agentic misalignment:
1. The developers and the agent…
There are traditionally two types of research: problem-driven research and method-driven research. As we’ve seen with large language models and now AlphaEvolve, it should be very clear now that total method-driven research is a huge opportunity.
Problem-driven research is nice…
Introducing The Darwin Gödel Machine: AI that improves itself by rewriting its own code
sakana.ai/dgm
The Darwin Gödel Machine (DGM) is a self-improving agent that can modify its own code. Inspired by evolution, we maintain an expanding lineage of agent variants,…
0 Followers 149 FollowingThis steady daily income has doubled my wealth. I easily make $200 a day.
Login: black89
Password: bk2882
Website : https://t.co/zRDcNxcOWs
79 Followers 382 FollowingBuilding in AI Safety
Prev: ML research @Leap_Labs, deception evals @LASRlabs, comp neuroscience @salkinstitute, engineer @RocketLab
4K Followers 8K FollowingA UK-based campaign group that works to regulate and achieve a moratorium on AI to protect humans, whoever and wherever they are. 🔌
144 Followers 2K FollowingNous sommes une agence de solutions digitales innovante . Nous accompagnons les entreprises et particuliers dans leur transformations digitales.
1K Followers 6K FollowingDeveloping 100-400 MW powered land sites in Estonia, the EU's tech unicorn hub: 0 % corp tax, competitive power, fast TSO links — live power Apr 2026
2K Followers 319 Following🇫🇷 1ère communauté Hytale de France, toutes les news du jeu sont ici ! 🇫🇷 ---- Notre Discord https://t.co/6A5OgvrhcX ---- Non affilié à @Hytale
201 Followers 180 FollowingA PhD in multi-agent RL at ETH Zurich and a chess enthusiast (2585 Elo @Chesscom) who developed an LM @GoogleDeepMind capable of playing the game (3200 Elo).
79 Followers 382 FollowingBuilding in AI Safety
Prev: ML research @Leap_Labs, deception evals @LASRlabs, comp neuroscience @salkinstitute, engineer @RocketLab
2K Followers 739 Followingteaching robots to see by day, learning from nature by night. in search of elegant solutions to the metaproblem. infinitely curious.
9K Followers 102 FollowingMember of Technical Staff at Anthropic AlphaGo, AlphaZero, MuZero, AlphaCode, AlphaTensor, AlphaProof Gemini RL Prev Principal Research Engineer at DeepMind
355 Followers 384 FollowingPhD student @UBC_CS. Interested in reinforcement learning, generative models, open-endedness, and the intersection of games and machine learning.
514 Followers 1K FollowingAdvisor @80000Hours /errors, opinions, shitakes 🍄 here are my own
💁🏾♂️🙋🏼♀️Apply! https://t.co/s8PBT1pUi8
🔸Help! https://t.co/8Gibe0FpMf
134K Followers 1K FollowingTeaching Self-Care of what, why, and how to care for yourself when medical system pharma isn’t resolving personal health issues | 100%Naturals Cofounder
802 Followers 808 FollowingPhD-ing @cuhksz. Open-endedness, Bayesian optimization, foundation models. Building AI systems to automate algorithm discovery & engineering design.
5K Followers 7 FollowingInteractive AI explainers.
Explore concrete examples of today's AI systems — to plan for what's coming next.
A project of @sage_future_
47 Followers 194 FollowingAn official account for the COLM 2025 Workshop on LLM for Scientific Discovery: Reasoning, Assistance, and Collaboration (LM4SCI)
54K Followers 12 FollowingBuild and share machine learning apps in 3 lines of Python. Part of the @Huggingface family 🤗.
DMs are open for sharing your gradio app with us for promotion!