Geoffrey Angus @GeoffreyAngus

Building stuff. Formerly @Google, @Stanford. San Francisco, CA Joined November 2015

Tweets

138
Followers

191
Following

353
Likes

6K

shreya rajpal @ShreyaR

3 weeks ago

Introducing ❄️ @snowglobe_so, the simulation engine for AI chatbots. Magically simulate the behavior of your users to test and improve your chatbots. Find failures before your users do.

118 85 1K 543K 630

Download Video

will brown @willccbb

2 months ago

cant stop thinking about this one insanely elegant, seems insanely powerful

26 57 848 100K 945

Download Image

Announcing DeepSWE 🤖: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B. DeepSWE achieves 59% on SWEBench-Verified with test-time scaling (and 42.2% Pass@1), topping the SWEBench leaderboard for open-weight models. Built in…

9 81 496 263K 329

Download Image

Pierce Freeman @piercefreeman

2 months ago

Text diffusion models might be the most unintuitive architecture around Like: let's start randomly filling in words in a paragraph and iterate enough times to get something sensible But now that google's gemini diffusion is near sota, I think we need to take them seriously

2 3 5 644 4

Download Video

Dylan Patel @dylan522p

2 months ago

The Nvidia Tensor Core is the most important evolution of computer architecture in the last decade We explain why / how it's evolved Shout out to collaborators @bfspector @tri_dao @colfaxintl @charles_irl @ia_buck Neil Movva Jonah Alben esp @simonguozirui for the cutest cover pic

SemiAnalysis @SemiAnalysis_

2 months ago

4 31 244 83K 145

8 23 324 49K 134

carsonfarmer @carsonfarmer

3 months ago

This looks super cool. Our own research team was exploring similar ideas for building an internal corpus of context for our content generation tasks. Now we just got a huge head start on it!

Sabri Eyuboglu @EyubogluSabri

3 months ago

This looks super cool. Our own research team was exploring similar ideas for building an internal corpus of context for our content generation tasks. Now we just got a huge head start on it!

13 73 306 66K 223

Download Image

16 3 20 1K 0

James Zou @james_y_zou

3 months ago

Excited to introduce #CollabLLM -- a method to train LLMs to collaborate better w/ humans! Selected as #icml2025 oral (top 1%)🏅 New multi-turn training objective + user simulator👇

Shirley Wu @ShirleyYXWu

3 months ago

Excited to introduce #CollabLLM -- a method to train LLMs to collaborate better w/ humans! Selected as #icml2025 oral (top 1%)🏅 New multi-turn training objective + user simulator👇

8 62 203 63K 103

Download Image

6 11 52 7K 26

Jordan Juravsky @jordanjuravsky

3 months ago

Cartridges, powered by Tokasaurus! 🤝⚡️🦖

Geoffrey Angus @GeoffreyAngus

3 months ago

Cartridges, powered by Tokasaurus! 🤝⚡️🦖

1 11 44 6K 11

Download Image

0 3 11 1K 0

Sabri Eyuboglu @EyubogluSabri

3 months ago

An advantage of training a cache/prefix (as opposed to a lora adapter), is that we can serve per-user cartridges using the same optimizations and kernels, which inference engines already use for per-user kv caches. @GeoffreyAngus just integrated cartridges into Tokasaurus (a…

Geoffrey Angus @GeoffreyAngus

3 months ago

1 11 44 6K 11

Download Image

1 5 18 2K 4

Download Image

Geoffrey Angus @GeoffreyAngus

3 months ago

what is happening

0 0 0 85 0

Geoffrey Angus @GeoffreyAngus

3 months ago

wandb pls

1 2 8 658 0

Vipul Ved Prakash @vipulved

3 months ago

.@togethercompute API has the fastest DeepSeek v3 endpoint (2x faster than next best API endpoint) and almost 5x faster than DeepSeek API. See how to use it directly with @cline to make all your Cline workflows snappier!

Together AI @togethercompute

3 months ago

3 4 20 15K 5

Download Image

1 2 7 5K 0

Sabri Eyuboglu @EyubogluSabri

3 months ago

When we put lots of text (eg a code repo) into LLM context, cost soars b/c of the KV cache’s size. What if we trained a smaller KV cache for our documents offline? Using a test-time training recipe we call self-study, we find that this can reduce cache memory on avg 39x…