LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…
Are you ready for web-scale pre-training with RL ? 🚀
🔥 New paper: RLP : Reinforcement Learning Pre‑training
We flip the usual recipe for reasoning LLMs: instead of saving RL for post‑training, we bring exploration into pretraining.
Core idea: treat chain‑of‑thought as an…
Damn, very interesting paper. after rapid loss reduction, we see deceleration and follow "scaling law": this is because at these steps, gradients start to conflict each other.
Updates are 'fightining for modal capacity' in some sense, and larger the model less fighting there…
Looking closer, PyTorch also uses FP32, but here's the real reason why bnb Adam is better: we optimized for float numerics, order does matter! Computing sqrt(v) + eps*c2 then dividing avoids amplifying errors vs PyTorch's sqrt(v)/c2 + eps. Same math, better stability!
Looking closer, PyTorch also uses FP32, but here's the real reason why bnb Adam is better: we optimized for float numerics, order does matter! Computing sqrt(v) + eps*c2 then dividing avoids amplifying errors vs PyTorch's sqrt(v)/c2 + eps. Same math, better stability!
We are releasing 📄 FinePDFs:
the largest PDF dataset spanning over half a billion documents!
- Long context: Documents are 2x longer than web text
- 3T tokens from high-demand domains like legal and science.
- Heavily improves over SoTA when mixed with FW-EDU&DCLM web copora.
GPT-5 Thinking is incredible! I asked algo interview questions that are asked to SSEs. these are not available on the internet, made up by adding more constraints or twisting familiar scenarios. More than solving the questions, the reasoning it shows gives me goose bumps!
What if you could not only watch a generated video, but explore it too? 🌐
Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt.
From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵
New Anthropic research: Persona vectors.
Language models sometimes go haywire and slip into weird and unsettling personas. Why? In a new paper, we find “persona vectors"—neural activity patterns controlling traits like evil, sycophancy, or hallucination.
Now you can just use an agent than can solve olympiad level problems with completely FREE.
Also this intelligence can be utilized at coding, science, ...etc any domain you want.
We just opensourced our agent system Crux.
We don't require you subscribe or any payments.
Just…
New paper & surprising result.
LLMs transmit traits to other models via hidden signals in data.
Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵
Modern reasoning models think in plain English.
Monitoring their thoughts could be a powerful, yet fragile, tool for overseeing future AI systems.
I and researchers across many organizations think we should work to evaluate, preserve, and even improve CoT monitorability.
🚀 Hello, Kimi K2! Open-Source Agentic Model!
🔹 1T total / 32B active MoE model
🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models
🔹Strong in coding and agentic tasks
🐤 Multimodal & thought-mode not supported for now
With Kimi K2, advanced agentic intelligence…
Introducing SmolLM3: a strong, smol reasoner!
> SoTA 3B model
> dual mode reasoning (think/no_think)
> long context, up to 128k
> multilingual: en, fr, es, de, it, pt
> fully open source (data, code, recipes)
huggingface.co/blog/smollm3
1.2M Followers 831K FollowingHumanitarian | Environmentalist | 📱 Digital media 👥 Advocacy 🌍 Global development 🎓 @LondonU ⭕️ @UN Global Goals #Goalkeepers2030 🌿 For people and planet.
238 Followers 2K FollowingNanosciences, Artificial Intelligence, and Quantum Computing will take the world to the next level.
Write code is my hobby, ML and Numerical methods is the goal
2K Followers 1K FollowingBuilding new AI hardware at @Positron_AI. 2013 Thiel Fellow, hardware hacker, entrepreneur. Previously founded @REXComputing | https://t.co/vqJ6oJMqWG
3K Followers 1K FollowingResearch Engineering Lead at @StanfordCRFM. Previously co-founder at Semantic Machines ⟶ MSFT. Lead developer of Levanter and Marin @[email protected]
105K Followers 789 FollowingWriting my own AI story. Recent: NPI, AlphaGo tuning, learn to learn, AlphaCode, Gato, ReST, r-Gemma, Imagen3, Veo, Genie, MAI …
8K Followers 357 FollowingHead Wadhwani School of Data Science and AI (@WSAI_IITM), Center for Responsible AI (@cerai_iitm), and Professor at IIT Madras (@iitmadras)
188K Followers 234 FollowingLaunch, land, operate in space – anywhere, anytime. We're the space and defense tech company delivering critical missions from LEO to the Moon and beyond.
16K Followers 364 FollowingRuns an AI Safety research group in Berkeley (Truthful AI) + Affiliate at UC Berkeley. Past: Oxford Uni, TruthfulQA, Reversal Curse. Prefer email to DM.
30K Followers 515 FollowingTrends in Cognitive Sciences - monthly review journal featuring developments across cog sci and neurosci. Posts by the editor.
58K Followers 570 FollowingCo-founder & CTO @hyperbolic_labs cooking fun AI systems. Prev: OctoAI (acquired by @nvidia) building Apache TVM, PhD @ University of Washington.
1.2M Followers 2 FollowingSubscribe for the best X experience: ad-free, post edits, content monetization, Grok AI with higher limits, video downloads, long posts, X Pro, and more.
359K Followers 1K FollowingML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).
68K Followers 2K FollowingI am an experimental psychologist who studies visual illusions as well as makes illusion artworks. #illusion #opticalillusion #perception #錯視
8K Followers 2K FollowingScience Communicator and Video Producer, Genetics PhD. Let me talk to you about genes. Or blimps. (she/her) Email: [email protected]
53K Followers 63 FollowingStudent of mind and nature, libertarian, chess player, cancer survivor. @ Keen, UAlberta, Amii, https://t.co/u8za2Kod54, The Royal Society, Turing Award
87K Followers 0 FollowingMake any song you can imagine. Join the Studio Waitlist now: https://t.co/496iRq3Xtg. Download Suno app 🎧 https://t.co/rIZHcFlaxI | https://t.co/lDIddI7Kuk
229K Followers 1 FollowingUpdates for developers building with the OpenAI Platform and API • Service status: https://t.co/kZwnwdYqOS • Support: https://t.co/qCi6M5ESZU
634K Followers 927 FollowingActor, television presenter, producer, director, husband & dad, who just happens to be short! President of @LPUKOnline, the charity for people with dwarfism.
21K Followers 98 FollowingThe #1 AI Engineering podcast & newsletter. Technical insights and news today you will use at work tomorrow! Hosted by @swyx and @fanahova
4K Followers 88 FollowingOfficial account for DeepSpeed, a library that enables unprecedented scale and speed for deep learning training + inference.
日本語 : @DeepSpeedAI_JP
124K Followers 498 FollowingPrinceton CS prof. Director @PrincetonCITP. I use X to share my research and commentary on the societal impact of AI.
BOOK: AI Snake Oil. Views mine.
368K Followers 6K FollowingChief Scientist, Google DeepMind & Google Research. Gemini Lead. Opinions stated here are my own, not those of Google. TensorFlow, MapReduce, Bigtable, ...
1.1M Followers 1K Followingnoun | a reference source containing words alphabetically arranged along with information about their forms, pronunciations, functions, and etymologies