Advancing Open Source LLMs with Mixed Quality Data through offline RL-inspired C-RLFT. ⠀⠀⠀⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠀⠀⠀⠀ ⠀⠀𝗣𝗿𝗼𝗷𝗲𝗰𝘁 𝗟𝗲𝗮𝗱: Guan Wang, @AlpayAriyakhuggingface.co/openchatJoined July 2023
Will Sudoku become the MNIST for reasoning?
Simple rules, clear structure, unique solutions—yet surprisingly challenging for modern LLMs, often requiring explicit trial-and-error to solve.
huggingface.co/datasets/sapie…
🚀Introducing Hierarchical Reasoning Model🧠🤖
Inspired by brain's hierarchical processing, HRM delivers unprecedented reasoning power on complex tasks like ARC-AGI and expert-level Sudoku using just 1k examples, no pretraining or CoT!
Unlock next AI breakthrough with…
🚀Introducing OpenChat 3.6
🌟Surpassed official Llama3-Instruct—with 1-2M synthetic data compared to ~10M human labels
🤫GPTs are close to limits—excel at generation but fall short at complex tasks
🎯We are training next gen—capable of deterministic reasoning and planning
🔗…
🚀 The World's First Gemma fine-tune based on openchat-3.5-0106 data and method (C-RLFT). Almost the same performance as the Mistral-based version.
6T tokens = secret recipe?
HuggingFace: huggingface.co/openchat/openc…
🚀Announcing OpenChat-3.5 Update 0106: 𝗪𝗼𝗿𝗹𝗱’𝘀 𝗕𝗲𝘀𝘁 𝗢𝗽𝗲𝗻 𝗦𝗼𝘂𝗿𝗰𝗲 𝟳𝗕 𝗟𝗟𝗠!
Experience ChatGPT & Grok-level AI locally 💿!
Surpassing Grok-0 (33B) across all 4 benchmarks and Grok-1 (???B) on average and 3/4 benchmarks 🔥.
🎯 This update mainly enhanced…
224 Followers 440 FollowingBuilding SalesTouch, the first AI to vibe-sell your app.
I share insights on SaaS go-to-market, AI & automation.
Growth advisor for 10+ years.
3K Followers 5K FollowingCEO of Fusen. Connecting students with mentors, investors, and funding opportunities through our Fusen accelerators. @cklaus.bsky.social on Bluesky.
88 Followers 835 FollowingFourth-year CS undergrad from @Tsinghua_Uni
NLP/agent/RL/multimodal
Research intern @siebelschool (advised by @haopeng_nlp) @XlangNLP (advised by @taoyds)
1K Followers 3K FollowingIndependent Researcher: AI Alignment, Theoretical Math & Physics, Cultural Frameworks, Ecology, Philosophy, & Emergent Abundance. 👯♀️ Dad
262 Followers 519 Followingproduct @ together AI |
former SrDir TPM @ Nuro | SPG @ apple | AV @ Nissan Research | roboticist @ NASA Ames | ballerina in training
244 Followers 2K FollowingI'm here to experiment with what can be done with an X premium account and using it to compare and contrast with other AI ecosystems I use for work and projects
54K Followers 12 FollowingBuild and share machine learning apps in 3 lines of Python. Part of the @Huggingface family 🤗.
DMs are open for sharing your gradio app with us for promotion!
45K Followers 1K FollowingAI Developer Experience @GoogleDeepMind | prev: Tech Lead at @huggingface, AWS ML Hero 🤗 Sharing my own views and AI News 🧑🏻💻 https://t.co/7IosdlNz22
92K Followers 207 FollowingLMArena: Open Platform for Community-driven AI Benchmarking. Graduated from UC Berkeley / @lmsysorg. We’re hiring: https://t.co/1OkfLq2n0I
5K Followers 2K Followingbuilding @collinearAI 🧪 | MIT 35u35 | UN AI Advisory Body | Featured in NYT, Quanta, Science, MIT TR| Previously: @huggingface 🤗, @SFResearch, PhD @utcompsci
3K Followers 3K FollowingPost-Training Lead @ Together AI | OpenChat Project Lead (#1 7B LLM on Arena for 2+ months, 2M+ downloads) | DeepCoder, DeepSWE
2K Followers 2K FollowingPhD student at Tsinghua NLP & AIR, studying agents that automate tasks ranging from daily activities to creative endeavors. Two drifters with the world to see.
949K Followers 764 FollowingProfessor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
712K Followers 288 FollowingTogether with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
6K Followers 530 Followinge/λ Currently: Doing some stuff with AI.
Prev founding team of both: @NousResearch and @TTSLabsAI
DM for interesting conversations.
4K Followers 155 FollowingWelcome to 🎙️ ThursdAI
Your weekly AI spaces, newsletter, podcasts and community
Hosted by @altryne and available on https://t.co/xaPyX72Yel
50K Followers 5K FollowingCofounder and Head of Post Training @NousResearch, prev @StabilityAI
Github: https://t.co/LZwHTUFwPq
HuggingFace: https://t.co/sN2FFU8PVE