Distilling LLM Agents! 🧪 New work shows how to transfer the reasoning & task-solving power of large language model agents into smaller, more efficient models by cloning their tool-using behavior with retrieval and code!
Fractal, an Indian AI company, dropped Fathom-R1-14B open-source reasoning model that achieves performance comparable to o4-mini on math benchmarks within a 16K context window, trained for just $499.
Built on top of DeepSeek-R1-Distill-Qwen-14B, It beats o3-mini-low.
🚨 Speaker Alert! 🚨
We’re kicking off Le Robot Hackathon Miami (June 14-15) with an amazing panel featuring @ClementDelangue, Co-Founder & CEO of @huggingface. Clem turned open-source AI into a global movement—now he’s jetting to the 305 to talk robotics, community, and why the…
The Worldwide @LeRobotHF hackathon is in 2 weeks, and we have been cooking something for you…
Introducing SmolVLA, a Vision-Language-Action model with light-weight architecture, pretrained on community datasets, with an asynchronous inference stack, to control robots🧵
H Company released Holo-1: 3B and 7B GUI Action Vision Language Models for various web and computer agent tasks 🤗
Holo-1 has Apache 2.0 license and @huggingface transformers support 🔥
more details in their blog post (next ⤵️)
Today we are releasing ether0, our first scientific reasoning model.
We trained Mistral 24B with RL on several molecular design tasks in chemistry. Remarkably, we found that LLMs can learn some scientific tasks more much data-efficiently than specialized models trained from…
Introducing an all-new suite of tools built on swarms - the production-grade framework for autonomous agent swarms
⎆ Documentation Intelligence
⎆ Cross-language Compilation
⎆ Multi-agent Architecture
⎆ Financial Enterprise Solutions
Here's what our lead developer…
Anyone who thinks you need 100k GPUs to make progress should watch Hannaneh Hajishirzi COLM keynote. Molmo appeared to beat Llama 3.2 in quality with same release day, all open-science on a 1k GPU cluster youtube.com/watch?v=qMTzor…
A small number of people are posting text online that’s intended for direct consumption not by humans, but by LLMs (large language models). I find this a fascinating trend, particularly when writers are incentivized to help LLM providers better serve their users!
People who post…
Introducing llms.txt Generator ✨
You can now concatenate any website into a single text file that can be fed into any LLM.
We crawl the whole website with @firecrawl_dev and extract data with gpt-4o-mini.
Create your own llms.txt at llmstxt.firecrawl.dev!
The Algorithm Design Manual
- Practical approach
- Real-world examples
- Problem-solving strategies
- Good book for someone trying to understand algorithms
- It will require some understanding of any language.
- Resources: github.com/mohitmishra786…
📈 The State of Generative AI in the Enterprise
Interesting report from Menlo Ventures that shows the evolution of Gen AI in companies from 2023 to 2024:
• Uses cases: Code generation, chatbots, search, data, and meeting summarization are the top generative AI use cases in…
Literally a beast of a book.
Emphasizes heavily on code and modern deep learning architectures
Important concepts are highlighted so it’s easier to understand and focus.
Less than 48 hours ago, DeepSeek AI from China just dropped their AI reasoning model.
And it's on par with OpenAI o1-preview. Major shift.
10 examples (and how to try):
v0 can now:
• Create and run full-stack Next.js and React applications
• Create multiple files in one generation
• Link and deploy to Vercel projects
• Use Vercel project environment variables
Vision finetuning is finally in🦥@UnslothAI! It took a while, but Llama 3.2 Vision, Pixtral, Qwen2 VL & all Llava variants now work!
1. QLoRA / LoRA is 1.3x to 2x faster for each
2. 30-70% less VRAM usage
3. 3 examples - Radiography, LaTeX, Q&A
Extra stuff:
1. Pixtral chat…
75 Followers 240 FollowingChrist Follower. ProLife. Husband and Father. Lover of people, sincerity and tough mindedness. Fly fishing, hobby farmer and travel with my wife.
174 Followers 144 Following🚀 Master AI in 5 min/day👉 #1 AI newsletter for business - 31,000+ ⚡️ Build AI workflows and Masterclasses 📩 DM “AI” for your free ChatGPT playbook
28 Followers 76 FollowingTravel Lover & #ESCP student from Germany, currently living in #Paris. VC experience at #Next47 / #AxelSpringerDigital; Co Creator #traction
64K Followers 13 FollowingDevelop profitable trading strategies, build a systematic trading process, and trade your ideas with Python—even if you’ve never done it before.
205K Followers 5K FollowingVC at @MenloVentures. Formerly founding team @glean, @Google Search. @Cornell CS. Tweets about tech, immigration, India, fitness and search.
355K Followers 1K FollowingML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).
50K Followers 5K FollowingCofounder and Head of Post Training @NousResearch, prev @StabilityAI
Github: https://t.co/LZwHTUFwPq
HuggingFace: https://t.co/sN2FFU8PVE
38K Followers 991 FollowingCreator of bitsandbytes.Research Scientist @allen_ai and incoming professor @CarnegieMellon. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.
1.4M Followers 1K FollowingBuilding @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
283 Followers 1 FollowingAdvancing Humanity with Artificial Intelligence
Join Agora, an community transcending the modern world's limits!
https://t.co/ZUxUywFdEs
228 Followers 190 FollowingDeepStation is the largest and longest running weekly AI community in Miami. We bring together AI engineers and enthusiasts to stay ahead of the AI wave
36K Followers 2K FollowingCryptofund since July 1st, 2015 // Deep technical & full stack value-add for blockchain projects // We champion the leaders of the new internet.
43K Followers 96 FollowingDeFi’s most powerful intent-solver.
You define your outcome, Haiku executes everything in 1 optimized tx.
Multi-protocol, multi-primitive, and cross-chain! 🧘
54K Followers 166 FollowingFirst decentralized world computer capable of running AI 🤖 and AI-powered Dapps ⚒. MainNet is out. Go #BUILD!🔥 $CTXC
https://t.co/s7K5ET09PZ
1K Followers 1K FollowingSocial impact founder enabling the next gen of African tech leaders @directeddev, #buildinginpublic Also photography IG@wusallphoto
4.0M Followers 0 FollowingThe universal platform for crypto, blockchain apps, stablecoins & decentralized tech. An account about the Ethereum ecosystem maintained by @ethereumfndn.
No recent Favorites. New Favorites will appear here.