Bảo Nguyễn Long @Debugger_w_

Joined May 2023

Tweets

44
Followers

3
Following

119
Likes

48

Alex Zhang @a1zhang

a week ago

🙏

7 101 638 53K 692

Download Image

0 9 126 16K 100

A classic paper, collab between @AIatMeta , @GoogleDeepMind , and @NVIDIAAIDev Language models keep personal facts in a measurable amount of “storage”. This study shows how to count that storage—and when models swap memorization for real learning. 📡 The Question Can we…

16 171 938 100K 939

Download Image

Google AI Studio @GoogleAIStudio

6 days ago

x.com/i/article/1960…

92 692 5K 1.5M 7K

TuringPost @TheTuringPost

a week ago

The freshest must-read research papers for you: ▪️ Diffusion LMs Know the Answer Before Decoding ▪️ Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference ▪️ StepWiser ▪️ ThinkDial ▪️ Provable Benefits of In-Tool Learning for LLMs ▪️ Understanding…

2 39 188 10K 130

Download Image

Lior⚡ @LiorOnAI

a week ago

The best fine-tuning guide you'll find on arXiv this year. Covers: > NLP basics > PEFT/LoRA/QLoRA techniques > Mixture of Experts > Seven-stage fine-tuning pipeline

29 207 1K 88K 2K

Download Image

elvis @omarsar0

a week ago

Overview of Self-Evolving Agents There is a huge interest in moving from hand-crafted agentic systems to lifelong, adaptive agentic ecosystems. What's the progress, and where are things headed? Let's find out:

26 189 1K 129K 1K

Download Image

elvis @omarsar0

3 weeks ago

Retrieval-Augmented Reasoning with Lean Language Models Great paper showing how to fuse RAG and reasoning into a single small-footprint language model. Distillation works if done correctly. Very exciting results! Here are my notes:

15 92 559 72K 623

Download Image

Deedy @deedydas

4 weeks ago

Tencent just dropped China's version of Google Genie 3! Yan is an incredible world model that generates 1080p worlds at 60fps (!) with no game engine, pure AI inference, at 0.11s latency and infinite video length. It's trained on ~150 days of video gameplay. The specs are…

32 178 1K 162K 831

Download Image

elvis @omarsar0

4 weeks ago

Speed Always Wins Very nice and comprehensive new report on recent efficient architectures for LLMs.

12 103 605 73K 619

Download Image

Rohan Paul @rohanpaul_ai

4 weeks ago

Absolutely Golden resource: A Comprehensive Survey of Self-Evolving AI Agents Self‑evolving agents are built to adapt themselves safely, not just run fixed scripts, guided by 3 laws, endure, excel, evolve. The survey maps a 4‑stage shift, MOP (Model Offline Pretraining) to…

9 80 354 26K 418

Download Image

Sumanth @Sumanth_077

a month ago

Turn PDF files into clean, LLM-ready data! Dolphin is an open source document parsing framework that converts PDFs into structured formats like Markdown, HTML, LaTeX, and JSON. 100% Open Source

50 569 4K 412K 7K

Download Image

AI Engineer @aiDotEngineer

a month ago

🆕 Releasing our entire Agent Reliability track! ft. - @tanmaigo (Hasura/PromptQL) - @dexhorthy (HumanLayer/Context Engineering) - @psomal (Temporal) - Anish Agarwal et al (@traversal_ai) - @mr_cheu (Glean) - @calcsam (Mastra) - @itamar_mar (Qodo) Agents are very hyped, but not…

2 36 238 40K 222

Download Image

steve hsu @hsu_steve

a month ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? ... Our results reveal that CoT reasoning is a brittle mirage that vanishes when it is pushed beyond training distributions. This work offers a deeper understanding of why and when CoT reasoning fails, emphasizing the ongoing…

200 969 6K 782K 5K

Download Image

Jason Weston @jaseweston

a month ago

🤖Introducing: CoT-Self-Instruct 🤖 📝: arxiv.org/abs/2507.23751 - Builds high-quality synthetic data via reasoning CoT + quality filtering - Gains on reasoning tasks: MATH500, AMC23, AIME24 & GPQA-💎 - Outperforms existing train data s1k & OpenMathReasoning - Gains on…

1 65 384 23K 287

Download Image

elvis @omarsar0

a month ago

Hierarchical Reasoning Model This is one of the most interesting ideas on reasoning I've read in the past couple of months. It uses a recurrent architecture for impressive hierarchical reasoning. Here are my notes:

44 286 2K 255K 2K

Download Image

Anthropic @AnthropicAI

a month ago

New Anthropic research: Persona vectors. Language models sometimes go haywire and slip into weird and unsettling personas. Why? In a new paper, we find “persona vectors"—neural activity patterns controlling traits like evil, sycophancy, or hallucination.

232 942 6K 1.4M 4K

Download Image

TuringPost @TheTuringPost

3 months ago

Log-linear attention — a new type of attention proposed by @MIT which is: - fast and efficient as linear attention - expressive as softmax It uses a small but growing number of memory slots that increases logarithmically with the sequence length. Here's how it works:

13 226 1K 103K 1K

Download Image

C Zhang @ChongZitaZhang

4 months ago

pku-epic.github.io/GraspVLA-web/ sim of grasp is ready for visual rendering (also almost ready for physics, based on other recent works on dynamics simulation)

2 27 179 24K 119

Download Image

AI at Meta @AIatMeta

4 months ago

Introducing Meta Perception Language Model (PLM): an open & reproducible vision-language model tackling challenging visual tasks. Learn more about how PLM can help the open source community build more capable computer vision systems. Read the research paper, and download the…

53 273 1K 91K 573

Download Video

TuringPost @TheTuringPost

4 months ago

9 notable AI models of the week: ▪️ 2 Olmo 2 Furious, @allen_ai ▪️ Phi-4 models from @Microsoft ▪️ Llama-Nemotron, @AIatMeta and @nvidia ▪️ DeepSeek-Prover-V2 ▪️ FoundationAI-SecurityLLM-Base-8B ▪️ Mellum-4b-base, @jetbrains ▪️ Amazon Nova Premier ▪️ Granite 4.0 Tiny Preview by…