Harnessing AI 🤖 to push the boundaries of innovation 🚀 | Passionate about science 🧠, engineering ⚙️, and the endless pursuit of knowledge 📚.yunusgungor.com Muğla, TürkiyeJoined July 2023
Chain-of-Agents
Interesting idea to train a single model with the capabilities of a multi-agent system.
84.6% reduction in inference cost!
Distillation and Agentic RL are no joke!
Here are my notes:
Problem-solving is at least 50% of every job in tech and science.
Mastering problem-solving will make your technical skill level shoot up like a hockey stick. Yet, we are rarely taught how to do so.
Here are my favorite techniques that'll loosen even the most complex knots:
Absolutely Golden resource: A Comprehensive Survey of Self-Evolving AI Agents
Self‑evolving agents are built to adapt themselves safely, not just run fixed scripts, guided by 3 laws, endure, excel, evolve.
The survey maps a 4‑stage shift,
MOP (Model Offline Pretraining) to…
Wow, this is really cool! This reserach answers this question: what if your computer-use AI was not a black box?
OpenCUA: Open Foundations for Computer-Use Agents
Researchers from HKU, Moonshot AI, and others present OpenCUA—a fully open-source framework for building and…
introducing qqWen: our fully open-sourced project (code+weights+data+detailed technical report) for full-stack finetuning (pretrain+SFT+RL) a series of models (1.5b, 3b, 7b, 14b & 32b) for a niche financial programming language called Q
All details below!
The best Chinese open agentic/reasoning models. When to use each?
• Kimi K2 – if you need a well-rounded, strong open base with agentic plus long-context strength.
• GLM-4.5 – the most tool-savvy, agent-native model today.
• Qwen3 – the best one if you need control,…
Context engineering is the new prompt engineering.
And it’s becoming the most critical AI skill.
Together with @MiqJ (Product Lead at @OpenAI) we created a comprehensive guide.
Key insights: 🧵👇
Context Engineering
@dbreunig and I did a meetup on context engineering last night. Wanted to share slides (below) + a recap of some themes / discussion points.
1/ Context grows w/ agents. @ManusAI_HQ mentions typical task requires ~50 tool calls.
manus.im/blog/Context-E…
2/…
This paper makes a bold claim!
AlphaGo Moment for Model Architecture Discovery
The researchers introduce ASI-Arch, the first Artificial Superintelligence for AI Research (ASI4AI), enabling fully automated neural architecture innovation.
No human-designed search space. No human…
Beautiful @GoogleResearch paper.
LLMs can learn in context from examples in the prompt, can pick up new patterns while answering, yet their stored weights never change.
That behavior looks impossible if learning always means gradient descent.
The mechanisms through which this…
Brilliant paper from Google.
Test-Time Diffusion Deep Researcher (TTD-DR) - Conceptualizes research report generation as a diffusion process.
The agent rewrites itself every step, so errors fade instead of pile up.
TTD-DR builds a living draft, keeps feeding it fresh search…
Good answers follow good reasoning
VeriFree is a new method that keeps the benefits of reinforcement learning (RL) but gets rid of a verifier model and rule-based checking.
It trains the model to get closer to a known good answer, called a reference answer.
Benefits:
• It's…
🧐How can we teach Multimodal models or Agents “When to Think” like humans?
👉Check Out: Think-or-Not (TON)
🔥Selective Reasoning via Reinforcement Learning for Vision-Language Models
arXiv: arxiv.org/pdf/2505.16854
code: github.com/kokolerk/TON
We introduce “thought dropout”…
Learn to Reason via Mixture-of-Thought
Interesting paper to improve LLM reasoning utilizing multiple reasoning modalities:
- code
- natural language
- symbolic (truth-table) representations
Cool idea and nice results.
My notes below:
Trust your AI, but can it trust itself? 🤔
Introducing an online reinforcement learning framework, RISE (Reinforcing Reasoning with Self-Verification), enabling LLMs to simultaneously level-up BOTH their problem-solving AND self-checking skills!
🧐 Problems tackled:
✅…
Elegant theoretical derivations are exclusive to physics. Right?? Wrong!
In a new preprint, we:
✅"Derive" a spiking recurrent network from variational principles
✅Show it does amazing things like out-of-distribution generalization
👉[1/n]🧵
w/ co-lead @dekelgalor & Jake Yates
What would truly open-source AI look like? Not just open weights, open code/data, but *open development*, where the entire research and development process is public *and* anyone can contribute. We built Marin, an open lab, to fulfill this vision:
22K Followers 22K FollowingDeepbrain AI services AI technologies such as video and speech synthesis, live chatbots, and more required to create AI Humans.
https://t.co/l6BCYy0n8l
3K Followers 3K Followingfounder of rag startup, ex Pinterest Search / Homefeed, https://t.co/0VwMvjB9Xh, Altiscale, Google Ads, Search, Google Code Jam organizer
63K Followers 2K FollowingResearch Scientist at Google DeepMind (WaveNet, Imagen, Veo). I tweet about deep learning (research + software), music, generative models (personal account).
12K Followers 2K FollowingSenior AI Researcher at the Samsung SAIT AI Lab 🐱💻
I build generative AI for images, videos, text, tabular data, weights, molecules, and video games.
1K Followers 103 FollowingAI/RL researcher, Assistant Prof. at @Tsinghua_Uni, leading the RL lab at @AntResearch_, PhD at @berkeley_ai, frequent flyer and milk tea lover.
9K Followers 522 FollowingHelping 1M people build with AI | Building AI'm In • Free AI Hub for Builders | + Growing Sketech • Unfiltered Notes Devs Love
2K Followers 298 FollowingGet your MVP or SaaS within weeks @ https://t.co/9Wm9cQ662G (1 Slot for Sept)
Senior Full Stack Developer (5+ Years)
20+ Products Built
10K Followers 105 FollowingAI21 Labs builds Foundation Models and AI Systems for the enterprise that accelerate the use of GenAI in production.
🥂Meet Jamba
https://t.co/xUBjKZHKVH
10K Followers 235 FollowingInterpretability/Finetuning @AnthropicAI
Previously: Staff ML Engineer @stripe, Wrote BMLPA by @OReillyMedia, Head of AI at @InsightFellows, ML @Zipcar
17K Followers 78 Following财经作者,写作中国商业深度报道,包括AI/科技巨头/风险投资/人物,也是播客《张小珺商业访谈录》主持人、制作人。Financial writer covering China business world, also the producer and host of "Zhang Xiaojun Podcast."
132K Followers 1K FollowingPrompt Engineer, dedicated to learning and disseminating knowledge about AI, software engineering, and engineering management.
38K Followers 693 FollowingDesigner & maker
✦ I help founders build products people love
✦ Built 60+ products for high-growth startups.
See work → https://t.co/TgoHSKsCPN | Book a call ↓