This new DeepMind research shows just how broken vector search is.
Turns out some docs in your index are theoretically incapable of being retrieved by vector search, given a certain dimension count of the embedding.
Plain old BM25 from 1994 outperforms it on recall.
1/4
I believe LLMs will inevitably surpass humans in coding.
Let us think about how humans actually learn to code. Human learning of coding has two stages. First comes memorization and imitation: learning syntax and copying good projects. Then comes trial and error: writing code,…
Researchers, don’t miss this: ‘The Big LLM Architecture Comparison’ by @rasbt lays out how modern models like DeepSeek-V3 and Kimi K2 differ in structure, efficiency, and capabilities.
Great for model design inspiration!
Link in comments.
Just came up with Multi-Scale Control for Stable Diffusion and I'm losing my mind!
Instead of your prompt flowing through ALL upsample/downsample blocks like normal, you can now inject DIFFERENT prompts at different resolution stages of the UNet.
Discovered something wild:…
pressure is a crazy feeling
it's an energy that will turn you into a pussy or a killer
you either run through the fucking wall and build confidence
OR
shut down and feel bad for yourself
BUT
you can always get back up and run through the fucking wall
SO
the real question…
Whop is now officially independent from Stripe.
What this means for anybody running a business on Whop:
> Orchestration through multi-PSPs is now available (lifts your revenue by 5 - 10%)
> Sellers can get paid out in nearly every country via local banks, BTC, or stablecoins…
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models
"Our work here reveals a critical phenomenon, temporal oscillation, where correct answers often emerge in the middle process, but are overwritten in later denoising steps. To address this issue, we…
MolmoAct: Action Reasoning Models that can Reason in Space
"Reasoning is central to purposeful action, yet most robotic foundation models map perception and instructions directly to control, which limits adaptability, generalization, and semantic grounding. We introduce…
Information geometry for deep learning:
neuromanifold of stochastic NNs, Fisher-Rao norm, natural gradient, singularities.
Relative Fisher information (RFIM) and relative natural gradient for tackling large NNs (ICML):
proceedings.mlr.press/v70/sun17b.html
Is Chain-of-Thought Reasoning of LLMs a Mirage?
... Our results reveal that CoT reasoning is a brittle mirage that vanishes when it is pushed beyond training distributions. This work offers a deeper understanding of why and when CoT reasoning fails, emphasizing the ongoing…
Yann LeCun was right about LLMs.
"we will never achieve AGI by simply scaling them up"
LLMs will be a key part of achieving it -- but calling them alone AGI is not enough
we need at least one major architectural breakthrough, if not many
so it's up to the labs now how quickly…
Diffusion LLMs are promising ways to overcome the limitations of autoregressive LLMs.
Less error propagation, easier to control, and faster to sample!
But how do Diffusion LLMs actually work? 🤔
Let's explore some ideas on this fascinating topic! youtu.be/8BTOoc0yDVA
Guys, I understand you like drama, but this is a remark about the AI development at large. We are seeing the plateau: just scaling up is coming to an end. For EVERYONE, not one company in particular.
Guys, I understand you like drama, but this is a remark about the AI development at large. We are seeing the plateau: just scaling up is coming to an end. For EVERYONE, not one company in particular.
3K Followers 2K Followingcurrently doing things at Mintlify, prev. built a search API (trieve acq. YCW24), sideprojecting a new Patreon at https://t.co/MTSczbZEku, progression fantasy and HN enjoyer
998 Followers 7K FollowingMain: @CashbackGoose
Protected by every law on the planet internationally.
Every possible loophole has been covered.
Cosmic immunity.
#TheForgeAGI
54 Followers 513 FollowingTagX is the Data & AI company, enabling Enterprises & SMBs to harness data and generative AI for smarter, faster decision-making.
73 Followers 508 FollowingNeurodegenerate maximalist
NCAA 3x First Team All-American Soccer Player
Biohacking for maximal degeneracy, sex, performance and quality of life
220 Followers 1K FollowingData Scientist |Open to Opportunities |Working on AI/ML |Mohun Bagan | Connect and DM for collaboration | Trying to write: https://t.co/qyCoJMx9mO
3K Followers 2K Followingcurrently doing things at Mintlify, prev. built a search API (trieve acq. YCW24), sideprojecting a new Patreon at https://t.co/MTSczbZEku, progression fantasy and HN enjoyer
19K Followers 144 FollowingI am an Astrobiologist, retired Anesthesiologist and lead The Science of Consciousness conferences at The University of Arizona.
5K Followers 547 FollowingCEO of Nose Breathing | High Level Grappler | No fluff, no WOO WOO Breathing Coach | ELASTICITY | Get Beginning Breath Down Below | Not Medical Advice
22K Followers 10K FollowingBuilding the largest entrepreneurial community on X while supercharging solopreneurs and SMB owners with transformative coaching and strategic connections
498 Followers 165 FollowingPostdoctoral Research Scientist with @neurojosh @Columbia in search for the memory code.
• Intracranial EEG & Single Neurons
• Human hippocampus
10K Followers 801 Followingcofounder @eldaeonuap | philosophy grad and abstract artist exploring artificial and nonhuman intelligence, consciousness, tech, and Spirit
43K Followers 3K FollowingWe're in a race. It's not USA vs China but humans and AGIs vs ape power centralization.
@deepseek_ai stan #1, 2023–Deep Time
«C’est la guerre.» ®1
43K Followers 1K FollowingMoney Twitter Guru turned Serial SaaS Entrepreneur turned Crypto Shill turned SaaS Entrepreneur. 7 figs in SaaS. 10 years in tech. Ex-@Google/@TikTok
5K Followers 1K FollowingPublic Speaking Coach | professional opera singer |🗣️your voice tells the world who you are | command a room with your voice and presence