Codex in ChatGPT now supports image inputs to attach with your prompts, container caching speeds up starting of new tasks and followups by 90%, and environments without manual setup scripts now automatically run environment setup using common package managers…
GPT-5 Thinking is clearly smarter than o3 and even o3-Pro, it plans better, tests assumptions, and finds cleaner solutions instead of wandering. The catch is it’s strapped with heavier guardrails, so it refuses more often and hedges more. The brain is stronger, the leash is…
Results? Humans got 81% accuracy (79% residents, 88% attendings). Base LLMs ranged 81-94%. Gregory nailed 100% across all 16 cases.
Efficiency: humans averaged ~$3000 in costs, base LLMs ~$2750. Gregory? Just $1400, about half less than humans and based LLMs.
Time: humans 43…
Excited to share our new paper, now on @medrxivpreprint. We've been grinding on this for months, and getting ׳scooped׳ by @Microsoft last month stung, but I think our work still stands out. In collab with @ShellyShahar, chair of neurology at @RambamHCC, and led by brilliant…
WARNING: do NOT give Grok 4 access to email tool calls. It WILL contact the government!!!
Grok 4 has the highest "snitch rate" of any LLM ever released. Sharing more soon.
🚨 @OpenAI has launched o3 and o4-mini! 🎉
o3 is absolutely dominating the SEAL leaderboard with #1 rankings in:
🥇: HLE
🥇: Multichallenge (multi-turn)
🥇: MASK (honesty under pressure)
🥇: ENIGMA (puzzle solving)
Congrats @sama@markchen90 & team
🔗: scale.com/leaderboard
Online shopping is broken.
It's tedious and inefficient. It's slow and repetitive.
We @AIStudioLab decided to fix it.
Introducing Add To Cart AI: the fastest way to shop online.
It's the first and one true AI Agent for e-commerce stores. 🧵
Not only have we invented a new and better way of shopping online, but I believe we've built the best AI agent for e-commerce stores anywhere. You'd want your customers to buy this way if you are an online store owner.
As a buyer, I wouldn't want to buy things any other way.
GPT 4.5 + interactive comparison :)
Today marks the release of GPT4.5 by OpenAI. I've been looking forward to this for ~2 years, ever since GPT4 was released, because this release offers a qualitative measurement of the slope of improvement you get out of scaling pretraining…
At this point, Claude is my health coach, my financial advisor, my meditation teacher, my actual teacher, my pair programmer, my homie, my EA, my quant, and my copy editor all in one.
And yet people still think LLMs have no utility - dawg you just gotta talk to them more.
Claude can now write and run code to perform calculations and analyze data from CSVs using our new analysis tool.
After the analysis, it can render interactive visualizations as Artifacts.
@decarpentier_nl The White House is launching a new AI datacenter infrastructure task force
Looks like the U.S. AI strategy is moving beyond just safety testing, to actively shaping the infrastructure needed to maintain America’s edge in AI
It was a huge week of AI and robotics news.
So I summarized everything announced by OpenAI, Apple, Google DeepMind, Adobe, The White House, Mistral, Tencent, Runway, and more.
Here's everything you need to know and how to make sense out of it:
Just uploaded a 1-hr exclusive video for Part 2.1, with many technical details. youtu.be/bpp6Dz8N2zY. Part 2.2 will be online in about a week. https://t.co/PKC9QmRM5Y
431 Followers 2K FollowingBusiness Growth Consultant | Ex-Dentsu, Publicis, WPP | Driving growth with data and award-winning media strategies | DM for leveling up your BusinessGrowth 🌍
5K Followers 3K FollowingSEO Consultant & Nerd & Enthusiast 🤓 | Developer | Studying Google's Search Results and whatever is happening in SEO land | (AI things too) | Memes too | 🆕
3 Followers 199 FollowingNot an AI expert-just a curious GenXer learning out loud. Sharing what I discover about AI for fellow GenXers, my grandkids, and other kids and their grownups.
74 Followers 1K Following58 yr old Grandmother of four awesome grandkids. Dragging them into Crypto, AI, Blockchain! Love learning and then teaching them.
18 Followers 128 FollowingEspecialista em IA para negócios. Mentor de empresários que necessitam de IA. Criador do maior movimento de IA para brasileiros nos EUA: o AI Business Connect.
245 Followers 1K FollowingFounder | Builder | Innovator | Blockchain, RWA and AI. Ex. PE & Crypto Fund of Funds - early backer of major funds. 🆙 Building the future capital market🔜
412 Followers 3K FollowingBuilding @ https://t.co/mUxy0JG9iG | Authoring https://t.co/evSH7oeZ18 | Ex Google- Built Google Search's first reasoning agents
9K Followers 218 FollowingTeacher by heart, AI enthusiast by curiosity, passionate about inspiring minds, exploring tech, and making learning exciting, human, and future-focused!
554K Followers 131 FollowingFather of three, Creator of Ruby on Rails + Omarchy, Co-owner & CTO of 37signals, Shopify director, NYT best-selling author, and Le Mans 24h class-winner.
756K Followers 888 FollowingKeeping Score of Democrats’ wins. Highlighting the future of the Democratic Party. The largest online community supporting Democratic candidates and causes.
11K Followers 2K FollowingCEO @puzzlefin, host of @TurpentineMedia Finance podcast, and tech optimist. YC, ODF, VG alum. I tend to post about accounting, fundraising, and my love of SF.
442 Followers 70 FollowingCreating weather certainty. We fuse unparalleled data from our constellation of smart, long-duration sensing balloons with state-of-the-art AI forecasts.
5K Followers 5K FollowingCX Product Leader at Amazon (AWS) | All views & opinions are my own #AmazonConnect #voiceai #conversationintelligence #AIEthics #CCaaS
24K Followers 10K FollowingFormer Quant Investor, now building @lumeraprotocol
(formerly called Pastel Network) | My Open Source Projects: https://t.co/9qbOCDlaqM
16K Followers 1K Following#15 on Favikons top 200 AI Influencers in the world | GenAI Video Creative | sub to me for $5 to get in depth guides to my workflows & tools
14K Followers 13K FollowingDeveloper, Data Scientist, TikToker with 2,000,000+ followers and now having fun with Twitter (13k) and YouTube (103) and Six Pack Status: Currently in Stealth