Long in the making, finally released: Apertus-8B and Apertus-70B, trained on 15T tokens of open data from over 1800 languages. Unique opportunity in academia to work on and train LLMs across the full-stack. We managed to pull off a pretraining run with some fun innovations, ...
Long in the making, finally released: Apertus-8B and Apertus-70B, trained on 15T tokens of open data from over 1800 languages. Unique opportunity in academia to work on and train LLMs across the full-stack. We managed to pull off a pretraining run with some fun innovations, ... https://t.co/c8ECD4rp34
We uploaded V3 of our draft book "The Elements of Differentiable Programming". Lots of typo fixes, clarity improvements, new figures and a new section on Transformers! arxiv.org/abs/2403.14606
4K Followers 2K FollowingHexaly is the world’s fastest optimization solver for Routing, Scheduling, Packing, and more. Join our fast-growing developer community!
11K Followers 3K FollowingSenior director of Cisco Foundation AI, Former Chief Scientist at Robust Intelligence. ex Professor at Yale University, ex staff research scientist at Google.
584 Followers 2K FollowingAssistant Professor (Senior Lecturer) at Ben-Gurion University of the Negev, Head of Intelligent Systems, Geometric Computing, and Sensing Lab
32 Followers 195 FollowingMaster's student at the Graduate School of Artificial Intelligence in @postech2020. Previously: student researcher at @google
149 Followers 1K FollowingAgentic Systems @InstalilyAI | @Penn alum | Exploring the intersection of computation and creativity, fueled by South Indian filter ☕
80 Followers 613 Followingphd @tamu. prev: swe @stripe, bs @utaustin. i want to mechanistically understand models through the lens of training dynamics. 🇵🇪🏳️🌈
3K Followers 101 FollowingCSCS, the Swiss National Supercomputing Centre, develops and promotes technical and scientific services in the fields of high-performance computing.
#weareAlps
4K Followers 2K FollowingHexaly is the world’s fastest optimization solver for Routing, Scheduling, Packing, and more. Join our fast-growing developer community!
4K Followers 800 FollowingMachine Learning Research at the ELLIS Institute & Max-Planck for Intelligent Systems// Excited about fundamental questions in Safety & Efficiency of modern ML
288 Followers 52 FollowingEurIPS is a community-organized, NeurIPS-endorsed conference in Copenhagen where you can present papers accepted at @NeurIPSConf
11K Followers 3K FollowingSenior director of Cisco Foundation AI, Former Chief Scientist at Robust Intelligence. ex Professor at Yale University, ex staff research scientist at Google.
30 Followers 53 FollowingPhD student in machine learning with Francis Bach &
Michael I. Jordan: uncertainty quantification, conformal prediction, learning theory.
https://t.co/dkoZLl0pDg
2K Followers 739 Followingteaching robots to see by day, learning from nature by night. in search of elegant solutions to the metaproblem. infinitely curious.
19K Followers 3K FollowingLa #FondationTaraOcéan est la première fondation reconnue d’utilité publique consacrée à l’Océan en France. #ExplorerEtPartager
18K Followers 4K FollowingAssociate Professor at UC Berkeley. Former Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learning.
271 Followers 370 FollowingTheoretical physicist (bow tie included), inherently out of equilibrium. Studying data structure and deep learning. Marie Skłodowska-Curie fellow at @SISSA.
112 Followers 55 FollowingResearcher at Ecole des Ponts, Paris. Interested in operations research and the applications of machine learning to operations research
11K Followers 1K FollowingI like tokens! I lead the OLMo data team at @allen_ai w/ @kylelostat. Open source is fun 🤖☕️🍕🏳️🌈 Opinions are sampled from my own stochastic parrot