model intelligence through data. building https://t.co/D5lOAwGz7f | phd dropout @mit @mitmedialab, bachelors @iitdelhi San Francisco, CAJoined October 2019
1/We, at @essential_ai recently released Essential-Web v1.0 -- a 24T token dataset labelled by a 12 category taxonomy, produced by EAI-Distill-0.5b, a fine-tuned 0.5b-parameter model. With simple SQL filters, we obtain competitive datasets in math, web code, STEM and medical.
Incredible work by @RitvikKapila@ashVaswani and @essential_ai! They’ve open-sourced a 24T pre-training dataset with a detailed taxonomy spanning math, code, medical, science & more!
Open source at such massive scale is huge!!!!! Can’t wait to see what this unlocks for the…
Incredible work by @RitvikKapila@ashVaswani and @essential_ai! They’ve open-sourced a 24T pre-training dataset with a detailed taxonomy spanning math, code, medical, science & more!
Open source at such massive scale is huge!!!!! Can’t wait to see what this unlocks for the…
1/Pretraining is hitting a data wall; scaling raw web data alone leads to diminishing returns. Today @datologyai shares BeyondWeb, our synthetic data approach & all the learnings from scaling it to trillions of tokens🧑🏼🍳
- 3B LLMs beat 8B models🚀
- Pareto frontier for performance
The way humans interact with the web has evolved over ages
- Search engines freed us from searching over books & libraries.
- Browsing links gave us quick answers from a few sources.
Now, with llms and AI, the shift is bigger. AI can read, process & reason across thousands of…
The way humans interact with the web has evolved over ages
- Search engines freed us from searching over books & libraries.
- Browsing links gave us quick answers from a few sources.
Now, with llms and AI, the shift is bigger. AI can read, process & reason across thousands of…
just setting up my twttr, again
I’ve been heads down building Parallel with some of the best people I’ve ever worked with. We’re creating infrastructure for AIs to search and use the web.
just setting up my twttr, again
I’ve been heads down building Parallel with some of the best people I’ve ever worked with. We’re creating infrastructure for AIs to search and use the web.
To me, the most striking thing about this video is the sheer diversity of robots and tasks.
This showcases our thesis: AGI that truly understands the physical world will be omni-bodied -- a single brain to control any robot for any task.
We will show more soon!
To me, the most striking thing about this video is the sheer diversity of robots and tasks.
This showcases our thesis: AGI that truly understands the physical world will be omni-bodied -- a single brain to control any robot for any task.
We will show more soon!
Everyone has been asking about America's "DeepSeek" moment. It's arrived. 🇺🇸.
We are incredibly excited to release Cogito v2. It is amongst the strongest open models in the world. It matches/exceeds the performance of the latest DeepSeek v3 and DeepSeek R1 models both, and…
Today, we are releasing 4 hybrid reasoning models of sizes 70B, 109B MoE, 405B, 671B MoE under open license.
These are some of the strongest LLMs in the world, and serve as a proof of concept for a novel AI paradigm - iterative self-improvement (AI systems improving themselves).…
You SHOULD have a thesis about the world when you’re building a company.
Otherwise you’re trapped in an echo chamber of technical milestones that don’t mean anything.
That’s the kind of mindset @btaylor has. And needs, when he's leading @OpenAI, @facebook, & @salesforce.
If you're a student wondering what you should study in the world of AI, it's still the same: math, physics, chemistry, biology, computer science, engineering.
STEM teaches reasoning, objectivity and how to think. It makes you better at learning everything else. In STEM, ideas…
1K Followers 5K FollowingMachine Learning Engineer whose life revolves around Music, Books and Technology. Co-founder of @tinkerhub, a non-profit educational initiative.
12K Followers 3K FollowingPhD-ing @MIT_CSAIL. Working on scalable and principled algorithms in #LLM and #MLSys. In open-sourcing I trust 🐳. she/her/hers
13K Followers 433 FollowingBuilding next-gen AI at @thinkymachines. Past: Founding team @MistralAI, RS at Facebook AI Research. Ph.D. @SCSatCMU, BTech @iitbombay CS.
690K Followers 600 Followingentrepreneurship zealot, grounded technology possibilist, believer in the power of ideas, passionate about sustainability & impact
3K Followers 542 FollowingEarly stage partner to founders @southpkcommons, @ScriptCapital. Formerly @dropbox, @GreylockVC, @shopkick, Loopt (YC S05). SF local and lover of life's quirks.
325K Followers 3K FollowingNVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
163K Followers 166 FollowingCo-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log
23K Followers 820 Followingpartner @fpvventures - investing in seed/A. previous: investing @khoslaventures. first pm @meter, led growth @opendoor etc. love @shimoleejhaveri + 👦👧
92K Followers 207 FollowingLMArena: Open Platform for Community-driven AI Benchmarking. Graduated from UC Berkeley / @lmsysorg. We’re hiring: https://t.co/1OkfLq2n0I