Daniel Pirs @axldpi

Stuttgart, Germany Joined September 2013

Tweets

478
Followers

31
Following

132
Likes

4K

Andrej Karpathy @karpathy

2 years ago

# On the "hallucination problem" I always struggle a bit with I'm asked about the "hallucination problem" in LLMs. Because, in some sense, hallucination is all LLMs do. They are dream machines. We direct their dreams with prompts. The prompts start the dream, and based on the…

715 3K 15K 2.3M 5K

Hamel Husain @HamelHusain

a month ago

TOC for the open book "Beyond Naive RAG: Practical Advanced Methods" from our RAG series. This condenses 5 hours of instruction into something you can read in ~30 minutes. Link: maven.com/p/945082/beyon… @bclavie @beirmug @orionweller @antoine_chaffin @BEBischof

12 93 792 434K 1K

Download Image

Shreya Shankar @sh_reya

a month ago

One of the things we do in the beginning of the evals course is show students real system prompts. Most are mind boggled & ask why these prompts are so long, how did OpenAI/Gemini/Claude/Manus/etc arrive at these prompts, etc. It would be awesome to have an answer. Currently we…

Simon Willison @simonw

a month ago

26 46 902 82K 192

9 13 208 32K 145

Simon Willison @simonw

3 months ago

I like this take by @KentBeck on how AI-assisted programming changes the balance of which skills are most important

18 155 1K 138K 770

Download Image

Kat Woods ⏸️ 🔶 @Kat__Woods

3 months ago

I hate it when people just read the titles of papers and think they understand the results. The "Illusion of Thinking" paper does 𝘯𝘰𝘵 say LLMs don't reason. It says current “large reasoning models” (LRMs) 𝘥𝘰 reason—just not with 100% accuracy, and not on very hard…

69 62 823 76K 252

Martin Nebelong @MartinNebelong

3 months ago

Generating 360 footage in Veo3 and watching it on my VR headset feels like scifi 😍 Just add (360° to your prompt)

112 209 2K 174K 1K

Download Video

langfuse.com @langfuse

3 months ago

Biggest Langfuse update yet: We're open sourcing ALL product features under the MIT license! ✅ LLM-as-a-Judge Evaluations ✅ Annotation Queues ✅ Prompt Experiments ✅ Playground ✅ And more... We wrote a bit about why we are making this change on our blog 👇

12 51 234 47K 90

Download Image

Jonathan Gorard @getjonwithit

3 months ago

Calling c the "speed of light" completely misses the point. Rather, c is the "spacetime exchange rate": how many units of space you can exchange for one unit of time. In actuality, everything travels at the "speed of light", just not necessarily through space alone... (1/4)

675 2K 20K 1.7M 11K

Download Image

Jack Morris @jxmnop

4 months ago

excited to finally share on arxiv what we've known for a while now: All Embedding Models Learn The Same Thing embeddings from different models are SO similar that we can map between them based on structure alone. without *any* paired data feels like magic, but it's real:🧵

Jack Morris @jxmnop

6 months ago

35 26 967 871K 288

Download Gif

125 619 6K 904K 5K

Simon Willison @simonw

4 months ago

Here's the full workshop handout plus annotated slides from "Building software on top of Large Language Models", a three hour tutorial I presented yesterday at PyCon US #PyConUS simonwillison.net/2025/May/15/bu…

13 104 666 42K 777

Shreya Shankar @sh_reya

4 months ago

This is a really good question. In my experience, domain experts' reluctance to write prompts boils down to - not knowing how to write a good prompt in the first place (it's not necessarily as simple as instructing a human expert/coworker to do the task) - not having the…

verrsane @verrsane

4 months ago

22 0 48 21K 13

3 20 147 20K 152

Download Image

Simon Willison @simonw

4 months ago

A feature I would love to see from every single hosted API vendor is some kind of special case where if you prompt "what model ID are you?" it replies with a definitely-not-hallucinated stable version identifier (If model vendors are going to start switching date-based aliases…

Simon Willison @simonw

4 months ago

5 3 60 39K 7

24 15 310 35K 33

Benjamin Lang (eu/acc 🇪🇺) @defyconstraints

4 months ago

@daveg Client-side front ends to a new compute paradigm

1 1 2 52 0

David Galbraith @daveg

4 months ago

The most significant events during my working lifetime were October 11 1993 (beta of Mosaic web browser for Mac released) and November 30th 2022 (ChatGPT released).

1 1 6 895 0

Ethan Mollick @emollick

4 months ago

One thing the GPT-4o personality issue demonstrates is that treating AI like every other online product by maximizing for engagement & likeability will have unintended consequences that could cause real problems, both for the usefulness of the models & for the people using them,

45 62 892 53K 110

Simon Willison @simonw

4 months ago

Exploring Promptfoo via Dave Guarino’s SNAP evals simonwillison.net/2025/Apr/24/ex…

5 5 31 7K 25

John Carmack @ID_AA_Carmack

5 months ago

@rubyrangerr I think you are misunderstanding what this tech demo actually is, but I will engage with what I think your gripe is — AI tooling trivializing the skillsets of programmers, artists, and designers. My first games involved hand assembling machine code and turning graph paper…

207 1K 8K 1.2M 2K

BURKOV @burkov

5 months ago

Many people, myself included, didn't try to build a product around a language model because during the time you would work on a business-specific dataset, a larger generalist model will be released that will be as good for your business tasks as your smaller specialized model.…

28 65 624 41K 451

Dimitri @flowmitry

5 months ago

Today I tested 6 prompt management systems to store prompts in one place and update them without changing my product's source code. Only 1/6 functioned smoothly and didn't have bugs on my journey. Kudos to the @langfuse team it is looking promising!

1 3 4 844 0

Simon Willison @simonw

5 months ago

I really wish @OpenAI would give the new image generation feature in GPT-4o a usable name Are we really expected to say "I made this using GPT-4o image generation"? (That's also pretty unclear given that GPT-4o in ChatGPT used to be able to generate images using DALL-E instead)