mk @mankit_ye62017

CS/AI undergraduate student,AI AGENT,18 & Learning & Growing & Love coding Joined December 2024

Tweets

74
Followers

19
Following

187
Likes

59

xAI @xai

a week ago

Introducing Grok Code Fast 1, a speedy and economical reasoning model that excels at agentic coding. Now available for free on GitHub Copilot, Cursor, Cline, Kilo Code, Roo Code, opencode, and Windsurf. x.ai/news/grok-code…

869 2K 15K 18.5M 3K

Sebastian Raschka @rasbt

3 weeks ago

Gemma 3 270M! Great to see another awesome, small open-weight LLM for local tinkering. Here's a side-by-side comparison with Qwen3. Biggest surprise that it only has 4 attention heads!

Omar Sanseviero @osanseviero

3 weeks ago

Gemma 3 270M! Great to see another awesome, small open-weight LLM for local tinkering. Here's a side-by-side comparison with Qwen3. Biggest surprise that it only has 4 attention heads! https://t.co/Iy7O0DsQGu

124 334 3K 717K 1K

Download Image

26 200 1K 465K 789

Download Image

Z.ai @Zai_org

4 weeks ago

Introducing GLM-4.5V: a breakthrough in open-source visual reasoning GLM-4.5V delivers state-of-the-art performance among open-source models in its size class, dominating across 41 benchmarks. Built on the GLM-4.5-Air base model, GLM-4.5V inherits proven techniques from…

117 353 2K 291K 749

Download Image

Rohan Paul @rohanpaul_ai

a month ago

Most web agents still click around blindly because they never store real knowledge about page parts or user goals. This work builds Web‑CogReasoner, an agent that learns in 3 clear rounds, memorize facts, grasp concepts, then practice procedures, and thinks through that stack…

2 28 171 14K 172

Download Image

OpenAI @OpenAI

a month ago

GPT-5 is here. Rolling out to everyone starting today. openai.com/gpt-5/

4K 7K 33K 4.6M 4K

Download Video

Sam Altman @sama

a month ago

gpt-oss is out! we made an open model that performs at the level of o4-mini and runs on a high-end laptop (WTF!!) (and a smaller one that runs on a phone). super proud of the team; big triumph of technology.

2K 4K 46K 4.2M 8K

Anthropic @AnthropicAI

a month ago

Today we're releasing Claude Opus 4.1, an upgrade to Claude Opus 4 on agentic tasks, real-world coding, and reasoning.

622 1K 10K 4.1M 1K

Download Image

Shruti @heyshrutimishra

a month ago

This paper didn’t go viral but it should have. A tiny AI model called HRM just beat Claude 3.5 and Gemini. It doesn’t even use tokens. They said it was just a research preview. But it might be the first real shot at AGI. Here’s what really happened and why OpenAI should be…

351 1K 10K 1.3M 12K

Download Image

Poonam Soni @CodeByPoonam

a month ago

Microsoft just dropped a study showing the 40 jobs most at risk by AI and the 40 most secure.

1K 5K 28K 5.0M 30K

Download Image

Qwen @Alibaba_Qwen

a month ago

🚀 Qwen3-30B-A3B Small Update: Smarter, faster, and local deployment-friendly. ✨ Key Enhancements: ✅ Enhanced reasoning, coding, and math skills ✅ Broader multilingual knowledge ✅ Improved long-context understanding (up to 256K tokens) ✅ Better alignment with user intent…

83 275 2K 384K 575

Download Image

Z.ai @Zai_org

a month ago

Introducing GLM-4.5 and GLM-4.5 Air: new flagship models designed to unify frontier reasoning, coding, and agentic capabilities. GLM-4.5: 355B total / 32B active parameters GLM-4.5-Air: 106B total / 12B active parameters API Pricing (per 1M tokens): GLM-4.5: $0.6 Input / $2.2…

265 647 3K 1.2M 1K

Download Image

Deedy @deedydas

a month ago

🚨 AI models just invented better, novel AI models. Chinese researchers fed all LLM research into a model and it discovered 106 novel AI model architectures that converge to lower loss with better benchmarks. ASI-Arch is one of the coolest AI papers this year. En route AGI.

87 462 2K 175K 1K

Download Image

Chujie Zheng @ChujieZheng

a month ago

Proud to introduce Group Sequence Policy Optimization (GSPO), our stable, efficient, and performant RL algorithm that powers the large-scale RL training of the latest Qwen3 models (Instruct, Coder, Thinking) 🚀 📄 huggingface.co/papers/2507.18…

29 248 2K 316K 1K

Download Image

Qwen @Alibaba_Qwen

a month ago

🚀 We’re excited to introduce Qwen3-235B-A22B-Thinking-2507 — our most advanced reasoning model yet! Over the past 3 months, we’ve significantly scaled and enhanced the thinking capability of Qwen3, achieving: ✅ Improved performance in logical reasoning, math, science & coding…

182 612 4K 784K 872

Download Image

alphaXiv @askalphaxiv

2 months ago

check out the IMO 2025 prompt in full detail: alphaxiv.org/abs/2507.15855

1 22 138 177K 172

elvis @omarsar0

2 months ago

Deep Research Agents with Test-Time Diffusion Google keeps pushing on diffusion. This time, they apply diffusion to deep research agents, specifically the report generation process. It achieves a 69.1% win rate vs. OpenAI Deep Research on long-form research. My notes:

8 130 648 79K 651

Download Image

Qwen @Alibaba_Qwen

2 months ago

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

316 1K 9K 2.0M 4K

Download Image

David Ondrej @DavidOndrej1

2 months ago

models like Kimi, DeepSeek and Qwen will cost the closed AI labs BILLIONS of dollars. that's why nobody is talking about them. despite these LLMs absolutely crushing all of the benchmarks. Claude 4 Opus is literally *100x* more expensive than Kimi K2 yet both models have…

92 116 1K 85K 342

Qwen @Alibaba_Qwen

2 months ago

Bye Qwen3-235B-A22B, hello Qwen3-235B-A22B-2507! After talking with the community and thinking it through, we decided to stop using hybrid thinking mode. Instead, we’ll train Instruct and Thinking models separately so we can get the best quality possible. Today, we’re releasing…

214 579 4K 985K 835

Download Image

Elon Musk @elonmusk

2 months ago

230k GPUs, including 30k GB200s, are operational for training Grok @xai in a single supercluster called Colossus 1 (inference is done by our cloud providers). At Colossus 2, the first batch of 550k GB200s & GB300s, also for training, start going online in a few weeks. As Jensen…