Introducing Grok Code Fast 1, a speedy and economical reasoning model that excels at agentic coding.
Now available for free on GitHub Copilot, Cursor, Cline, Kilo Code, Roo Code, opencode, and Windsurf.
x.ai/news/grok-code…
Gemma 3 270M! Great to see another awesome, small open-weight LLM for local tinkering.
Here's a side-by-side comparison with Qwen3. Biggest surprise that it only has 4 attention heads!
Gemma 3 270M! Great to see another awesome, small open-weight LLM for local tinkering.
Here's a side-by-side comparison with Qwen3. Biggest surprise that it only has 4 attention heads! https://t.co/Iy7O0DsQGu
Introducing GLM-4.5V: a breakthrough in open-source visual reasoning
GLM-4.5V delivers state-of-the-art performance among open-source models in its size class, dominating across 41 benchmarks.
Built on the GLM-4.5-Air base model, GLM-4.5V inherits proven techniques from…
Most web agents still click around blindly because they never store real knowledge about page parts or user goals.
This work builds Web‑CogReasoner, an agent that learns in 3 clear rounds, memorize facts, grasp concepts, then practice procedures, and thinks through that stack…
gpt-oss is out!
we made an open model that performs at the level of o4-mini and runs on a high-end laptop (WTF!!)
(and a smaller one that runs on a phone).
super proud of the team; big triumph of technology.
This paper didn’t go viral but it should have.
A tiny AI model called HRM just beat Claude 3.5 and Gemini.
It doesn’t even use tokens.
They said it was just a research preview.
But it might be the first real shot at AGI.
Here’s what really happened and why OpenAI should be…
🚀 Qwen3-30B-A3B Small Update: Smarter, faster, and local deployment-friendly.
✨ Key Enhancements:
✅ Enhanced reasoning, coding, and math skills
✅ Broader multilingual knowledge
✅ Improved long-context understanding (up to 256K tokens)
✅ Better alignment with user intent…
Introducing GLM-4.5 and GLM-4.5 Air: new flagship models designed to unify frontier reasoning, coding, and agentic capabilities.
GLM-4.5: 355B total / 32B active parameters
GLM-4.5-Air: 106B total / 12B active parameters
API Pricing (per 1M tokens):
GLM-4.5: $0.6 Input / $2.2…
🚨 AI models just invented better, novel AI models.
Chinese researchers fed all LLM research into a model and it discovered 106 novel AI model architectures that converge to lower loss with better benchmarks.
ASI-Arch is one of the coolest AI papers this year. En route AGI.
Proud to introduce Group Sequence Policy Optimization (GSPO), our stable, efficient, and performant RL algorithm that powers the large-scale RL training of the latest Qwen3 models (Instruct, Coder, Thinking) 🚀
📄 huggingface.co/papers/2507.18…
🚀 We’re excited to introduce Qwen3-235B-A22B-Thinking-2507 — our most advanced reasoning model yet!
Over the past 3 months, we’ve significantly scaled and enhanced the thinking capability of Qwen3, achieving:
✅ Improved performance in logical reasoning, math, science & coding…
Deep Research Agents with Test-Time Diffusion
Google keeps pushing on diffusion.
This time, they apply diffusion to deep research agents, specifically the report generation process.
It achieves a 69.1% win rate vs. OpenAI Deep Research on long-form research.
My notes:
>>> Qwen3-Coder is here! ✅
We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…
models like Kimi, DeepSeek and Qwen will cost the closed AI labs BILLIONS of dollars.
that's why nobody is talking about them.
despite these LLMs absolutely crushing all of the benchmarks.
Claude 4 Opus is literally *100x* more expensive than Kimi K2
yet both models have…
Bye Qwen3-235B-A22B, hello Qwen3-235B-A22B-2507!
After talking with the community and thinking it through, we decided to stop using hybrid thinking mode. Instead, we’ll train Instruct and Thinking models separately so we can get the best quality possible. Today, we’re releasing…
230k GPUs, including 30k GB200s, are operational for training Grok @xai in a single supercluster called Colossus 1 (inference is done by our cloud providers).
At Colossus 2, the first batch of 550k GB200s & GB300s, also for training, start going online in a few weeks.
As Jensen…
230k GPUs, including 30k GB200s, are operational for training Grok @xai in a single supercluster called Colossus 1 (inference is done by our cloud providers).
At Colossus 2, the first batch of 550k GB200s & GB300s, also for training, start going online in a few weeks.
As Jensen…
148 Followers 3K FollowingFrom addicted plumber to $27M in a year on the Palm in Dubai🇦🇪
Helping men get rich, ripped & respected Start your own f*cking comeback👇
162 Followers 295 FollowingI like to do everything that seems difficult, because overcoming difficulties is a step forward for me。 I like walking on the beach
15K Followers 467 FollowingHusband, Father, Life-time Learner, Creator and Innovator. AI and HPC Computing Guru in Decentralized Manner. Supported by Extreme AI Labs. ** X为本人唯一平台,请勿转发.
355K Followers 1K FollowingML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).
19K Followers 465 FollowingAssociate Professor @UTCompSci | Director @NVIDIAAI Co-Leading GEAR | CS PhD @Stanford | Building generalist robot autonomy in the wild | Opinions are my own
18K Followers 1K FollowingProfessor @ UCSB (@ucsantabarbara). Head of Research @SimularAI. Interim Director @ucsbcrml. #Multimodal #Embodied #Agents. AI for Humanity in the long run.
369K Followers 3K FollowingCo-founder of @ethereum | Founder of @Consensys | Chairman of @ConsensysMesh | Chairman of @SharpLinkGaming $SBET. Building on #ETH. Views expressed are my own.
66K Followers 286 FollowingChief US Economist, Bloomberg LP @economics. Former Fed/CEA/US Treasury, @uchi_economics @UCberkeley. All opinions are my own.
15K Followers 460 FollowingComputer systems person, interaction designer. founding eng @modal
→ dreams of: a simpler, more honest, more human sort of software
(people are good, be kind!)
198K Followers 38 FollowingThe Gemini app turns research into reality, bringing frontier AI experiences like Veo 3, Deep Think, and more to hundreds of millions of people.
14.9M Followers 70 FollowingApple CEO Auburn 🏀 🏈 Duke 🏀 National Parks 🏞️ “Life's most persistent and urgent question is, 'What are you doing for others?'” - MLK. he/him
87.8M Followers 154 FollowingOfficial NASA account. Exploring the universe, advancing science, and inspiring the next generation of explorers.
Verification: https://t.co/8nok3NP4PW
2.6M Followers 29 FollowingFrequently updated assortment of tweets related to the world of science (with some side tracking) - As an Amazon Associate we earn from qualifying purchases
1.8M Followers 1K Following#50YearsOfESA: we're the European Space Agency, keeping you posted on European space activities.
Please see our Privacy Notice: https://t.co/UkkEqaJOwd
2.0M Followers 619 FollowingProfessional rocket orientation specialist, explainer of flamey stuff and rocket chaser. Bringing space down to Earth for everyday people 🚀
No recent Favorites. New Favorites will appear here.