darthy @geekDarthy
Machine learning researcher in deep learning, computational probability, inference, and causality. Sydney, New South Wales Joined December 2014-
Tweets332
-
Followers153
-
Following1K
-
Likes4K
Scaling laws in deep RL? Turns out that batch size, learning rate, and UTD (update-to-data) for getting the most efficient and scalable deep RL has predictable relationships. Checkout the analysis in new work by @_oleh & collaborators: arxiv.org/abs/2502.04327
After more than a year of working on SFT, it's clear — it’s just overfitting to in-domain tasks and lacks true generalization. RL is the real future of intelligent systems. 🌟🤖 SFT is out, the RL revolution is in 🚀🔥
Imagine creating custom datasets and training AI models WITHOUT writing a single line of code. We did and made it a reality. @huggingface Synthetic Data Generator Blog: huggingface.co/blog Space: huggingface.co/spaces/argilla… GitHub: github.com/argilla-io/syn…
Synthetic data and iterative self-improvement is all you need. No humans needed in the evaluation loop. This paper introduces a self-improving evaluator that learns to assess LLM outputs without human feedback, using synthetic data and iterative self-training to match top…
Brilliant paper from @Meta having the potential to significantly boost LLM's reasoning power. Why force AI to explain in English when it can think directly in neural patterns? Imagine if your brain could skip words and share thoughts directly - that's what this paper achieves…
Microsoft Phi-4 is announced! It's a 14B parameter LM trained heavily on synthetic data, with very strong performance, even exceeding GPT-4o on GPQA and MATH benchmarks! Currently available on Azure AI Foundry, will be on HuggingFace next week
Training Large Language Models to Reason in a Continuous Latent Space Introduces a new paradigm for LLM reasoning called Chain of Continuous Thought (COCONUT) Extremely simple change: instead of mapping between hidden states and language tokens using the LLM head and embedding…
Text-to-SQL has been my passion since Yale Spider 1.0! But as LLMs master it, real-world complexity demands more. 🚀After a year of work, Spider 2.0 shows the gap: o1 achieves just 17%! The path to production deployment is still long but exciting! more👉spider2-sql.github.io
Text-to-SQL has been my passion since Yale Spider 1.0! But as LLMs master it, real-world complexity demands more. 🚀After a year of work, Spider 2.0 shows the gap: o1 achieves just 17%! The path to production deployment is still long but exciting! more👉spider2-sql.github.io https://t.co/xq2E2RDZmV
1/2 Critic-RM: A Self-Critiquing AI Framework for Enhanced Reward Modeling and Human Preference Alignment in LLMs Critic-RM, developed by researchers from GenAI, Meta, and Georgia Institute of Technology, enhances reward models through self-generated critiques, eliminating the…
I am happy to announce that the first draft of my RL tutorial is now available. arxiv.org/abs/2412.05265
It was a huge week of AI and LLM papers. Here are the top ML Papers of the Week (Dec 2-8): - Genie 2 - GenCast - OpenAI o1 - Auto-RAG - Reverse Thinking - Retrieval-Augmented Reasoning for LLMs Read on for more:
5). Auto-RAG - an autonomous iterative retrieval model with superior performance across many datasets; Auto-RAG is a fine-tuned LLM that leverages the decision-making capabilities of an LLM. x.com/omarsar0/statu…
5). Auto-RAG - an autonomous iterative retrieval model with superior performance across many datasets; Auto-RAG is a fine-tuned LLM that leverages the decision-making capabilities of an LLM. x.com/omarsar0/statu…
7). Challenges in Human-Agent Communication - present a comprehensive analysis of key challenges in human-agent communication, focusing on how humans and AI agents can effectively establish common ground and mutual understanding. microsoft.com/en-us/research…
OpenAI announced a new RL finetuning API. You can do this on your own models with Open Instruct -- the repo we used to train Tulu 3. Expanding reinforcement learning with verifiable rewards (RLVR) to more domains and with better answer extraction (what OpenAI calls a grader, a…
I will be at #NeurIPS2024:1️⃣Dec 10 (Tue) 9:30am-12: Our Tutorial "Causality for LLMs" w/ Sergio Garrido + Panel w/ @Yoshua_Bengio @bschoelkopf @_jasonwei @Swarooprm7 @giambattista92 2️⃣Dec 11 (Wed) 11am-2pm: Our GovSim Poster (Tragedy of Commons for LLM Agents) 3️⃣Dec 13 (Fri)…
I will be at #NeurIPS2024:1️⃣Dec 10 (Tue) 9:30am-12: Our Tutorial "Causality for LLMs" w/ Sergio Garrido + Panel w/ @Yoshua_Bengio @bschoelkopf @_jasonwei @Swarooprm7 @giambattista92 2️⃣Dec 11 (Wed) 11am-2pm: Our GovSim Poster (Tragedy of Commons for LLM Agents) 3️⃣Dec 13 (Fri)… https://t.co/CewJTJQ432
Learn Rust 🦀 from scratch with the comprehensive guide created by @Android team. 🔗 in comments From the basics to more advanced topics like concurrency & bare-metal programming, you'll find everything you need to start with Rust. PS: The table of contents is well structured!
Natural Language Reinforcement Learning (NLRL) redefines Reinforcement Learning (RL). The main idea: In NLRL, the core parts of RL like goals, strategies, and evaluation methods are reimagined using natural language instead of rigid math. What are the benefits? - NLRL uses not…
Consolidated insights on LLM fine-tuning - a long read across 114 pages. "Ultimate Guide to Fine-Tuning LLMs" Worth a read during the weekend. Few ares it covers 👇 📊 Fine-tuning Pipeline → Outlines a seven-stage process for fine-tuning LLMs, from data preparation to…
🚨LLM Reasoners 🧠 A library for LLMs to do advanced reasoning, including latest algorithms: - Reasoning-via-Planning (RAP) 🎶 - Tree-of-Thought (ToT) 🌴 - beam search, and more All in unified perspective of world models🌎 and reward🥇 More alg & results coming soon!
🚨LLM Reasoners 🧠 A library for LLMs to do advanced reasoning, including latest algorithms: - Reasoning-via-Planning (RAP) 🎶 - Tree-of-Thought (ToT) 🌴 - beam search, and more All in unified perspective of world models🌎 and reward🥇 More alg & results coming soon! https://t.co/OtxL3oUF9a

The Knowledge Graph C... @KGConference
4K Followers 2K Following KGC brings together leaders across industry and research defining the future of knowledge graphs, LLMs, and AI.
SectorRotation🇺�... @Ertalrer596
48 Followers 2K Following 15-30% Monthly | 2 High-Conviction Stocks.Short-Term Gains: 15-20% in Days/Weeks.DM "JOIN" for WhatsApp Alerts. Live Trade Signals • Market Analysis
Reesloez @Reesloezb08cA0
32 Followers 1K Following 私とデートしたい場合は、https://t.co/Xd0yAfEgsr にアクセスして直接話してください。
Alice @alicehaider56
157 Followers 3K Following
Narte @Narte85HwO3
63 Followers 916 Following
Unutilized Opportunit... @Unutilizedoppo
23 Followers 493 Following We help people have access to opportunities from all over the world
Sdoynu @SdoynupgXYf
49 Followers 919 Following
Monty Anderson @monty10x
1K Followers 3K Following founder @prodialabs — fastest image generation in the world
Papi Power @PapiP0wer
254 Followers 3K Following Papichulo will save you! I do not provide financial advice.
Ningyu Zhang@ZJU @zxlzr
3K Followers 2K Following Associate Professor @ZJU_China. Research interests include NLP, LLM, KG, Agent, Knowledge Editing.
Siwei Wu(吴思为�... @siweiwu7
267 Followers 348 Following I am a PhD Student of the NLP group at the University of Manchester. I am interested in LLM, AIGC, and Multimodal Model
Ned Letcher @nletcher
1K Followers 8K Following data (science | analytics | visualisation | engineering), @thoughtworks, #Python, #nlproc, ML, & assorted whimsical miscellania
Radiant Creative @Radiantcreativ
485 Followers 1K Following We help midlife women thrive with hormonal rhythm awareness, midlife productivity without burnout, and radical reinvention. We’re launching 9.1.25. Wanna play?
TracyWard @kd97bPxfHFgsE
19 Followers 1K Following
Zhengyang Geng @ZhengyangGeng
1K Followers 651 Following PhD student @SCSatCMU with @zicokolter / curiosity&love / dynamics to super intelligence
Yixin Wang @yixinwang_
692 Followers 5K Following
wasmCloud @wasmcloud
3K Followers 2K Following Incubating CNCF Project. Build, manage, and scale polyglot apps across any cloud, K8s, or edge. Join us on Bluesky: https://t.co/lzXzKZYaao
Kumo @Kumo_ai_team
2K Followers 903 Following Build AI models to get predictions and embeddings from your relational data — without feature engineering.
Siddharth Joshi @sjoshi804
1K Followers 2K Following ML PhD at @UCLA under @baharanm | Data Curation for Efficient & Robust SSL @datologyai | Prev @MSFTResearch, @Cisco Research, @Microsoft
Statistics Dept - U o... @StatsUMan
1K Followers 3K Following Official twitter account for the Department of Statistics at the University of Manitoba.
Stew Ackerman @AckermanStew
14 Followers 380 Following
The Hustl Club @thehustlclub
276 Followers 2K Following Join Annie and John, two self-proclaimed ‘hustlers’ who are extremely passionate about creating profitable side-hustles and passive income streams.
nyw @nywxy
34 Followers 4K Following
CVSM-Group @bupt_cvsm
277 Followers 949 Following Computer vision and smart medicine (CVSM) group. Focus on #ComputerVision, #MachineLearning, #MedicalImageAnalysis. Homepage: https://t.co/bbhCkzrogU
Syed Kamran Pasha @MuhammedSalar9
244 Followers 5K Following Data analyst -Spiritual -Troglodyte 🇪🇭 My views are my own (whom else can it be!) He/Him
Qinyi Zhang @qinyizhang1811
90 Followers 126 Following Kernel methods, nonparametric association testing, statistical machine learning, large-scale approximation methods.
Gautier Marti @GautierMarti1
2K Followers 852 Following #AI #Quant #MachineLearning #DeepLearning #NLP #QuantitativeTrading #StatArb #ADML #HKML #trailrunning
Jun @junzhao333
140 Followers 4K Following NLP@~ Only focus on professional skills and self-improvement
EuroCIM @TheEuroCIM
1K Followers 954 Following The European Causal Inference Meeting - causal inference in health, economic and social science. We retweet posts on causal inference if you tag @TheEuroCIM.
Julianne (Junyan) Son... @sunflowerMath
38 Followers 1K Following PhD in Applied Math and Statistics. Machine learning.
Audrey Boraski she/he... @audrey_boraski
1K Followers 4K Following MS Conservation Bio @AntiochNewEng Regional Planner @FranklinCOG #LandUse #NaturalResources #Transportation #Wildlife #Bioacoustics #WildlifeTechnology
チカ @ch_1_k_a
574 Followers 422 Following ML researcher. Causal Inference, Fairness, and 🇺🇸🇫🇷🇩🇪; On ne voit bien qu'avec le cœur. L’essentiel est invisible pour les yeux.
Luca Ambrogioni @LucaAmb
6K Followers 2K Following Ass. prof. of Machine Learning. PI of Generative Memory Lab (@DondersInst). Statistical physics, generative diffusion, memory, and generalization.
Vagelis Papalexakis @vagelispapalex
2K Followers 2K Following Computer Scientist working on #datascience #machinelearning #tensors Associate Professor @UCR_CSE, PhD @ScSatCMU,summer internships @MSFTResearch and @Google
MachineCurve.com @MachineCurve
935 Followers 1K Following Account no longer active · Username maintained to avoid misuse.
Jan Feyereisl @thefillm
765 Followers 4K Following Senior Research Scientist - GoodAI (@GoodAIdev) & Executive Director - AI Roadmap Institute (@AIroadmap)
Saravanan Kandasamy @Saravanan_CU
154 Followers 233 Following CS Grad student @ Cornell. Passionate researcher. My focus is towards contributing novel&beautiful ideas to problems at the intersection of causality/algorithms
Journal of Applied St... @JAppliedStats
3K Followers 4K Following Journal of Applied Statistics + Journal of Applied Statistics: Environmental Statistics and Data Science (new sister journal).
Global Academy Jobs @AcademyJobs
3K Followers 4K Following Sharing international academic job vacancies, career advice, and great research! Built by universities for universities. #higherED #AcademicChatter #Jobs
Vitalii Bilokon @NewgroundAI
1K Followers 3K Following We believe that the next big thing in AI will be related to the bio-inspired evolutionary algorithms. #Neuroevolution Algorithms.
JobAdvertising.com @JobAdvertisings
13K Followers 10K Following The Easiest Way to Recruit and Hire. https://t.co/RlCGT2kU8W has the technology, data, and ad experts to help employers fill their jobs faster with top talent.
Neeraj Wagh @neeraj_wagh
591 Followers 6K Following Representation learning for EEG signals. Bioengineering PhD student @BIOENGatIL. MS in Statistics @illinois_alma. Computer Engineer.
Dr. Kevin D. Brown @kevinbigdata
75 Followers 519 Following AI/ML/ Cloud Technology Strategist & Architect, Consulting leader & future Data Scientist. Doctorate in AI/ML Comments are mine. #artificialIntelligence #data
Juho Kim @juhokim
277 Followers 1K Following Machine learning & health AI & computational biology; PhD UIUC
Ihtesham Haider @ihteshamit
160K Followers 551 Following co-founder of @theprohumanai, running a brand agency & ai businesses, sharing what i learn here so you can be more productive and successful.
International Semanti... @iswc_conf
3K Followers 90 Following The International Semantic Web Conference since 2001
The Knowledge Graph C... @KGConference
4K Followers 2K Following KGC brings together leaders across industry and research defining the future of knowledge graphs, LLMs, and AI.
oxfordsemantic @oxfordsemantic
2K Followers 2K Following The creators and developers of RDFox, a high performance knowledge graph and semantic reasoning engine. https://t.co/AmHBorZmuz
Abhishek Upperwal @upperwal
778 Followers 597 Following Building Foundation Models • Founder at @soketlabs • @iiscbangalore • HPC ♥️ AI
Yuchen Cheng @yuchenrcheng
305 Followers 491 Following Software Engineer at Heywhale 🐳 / GitHub: https://t.co/TV2E9At1G7 / #Kubernetes #LLMOps / Chinese · English · Japanese / WeChat Official Account: YC Cheng
NVIDIA AI Developer @NVIDIAAIDev
81K Followers 321 Following All things AI for developers from @NVIDIA. Additional developer channels: @NVIDIADeveloper, @NVIDIAHPCDev, and @NVIDIAGameDev.
NVIDIA Omniverse @nvidiaomniverse
22K Followers 321 Following The official handle for #NVIDIAOmniverse. The platform for developing #OpenUSD applications for industrial digitalization and generative physical #AI.
Avi Chawla @_avichawla
50K Followers 134 Following Daily tutorials and insights on DS, ML, LLMs, and RAGs • Co-founder @dailydoseofds_ • IIT Varanasi • ex-AI Engineer @ MastercardAI
Dify @dify_ai
19K Followers 164 Following Build Production-Ready AI Agent GitHub: https://t.co/MfnJ29Agzj Discord: https://t.co/DJmS3kYvYZ Reddit: https://t.co/EneVBsKTzR
n8n.io @n8n_io
54K Followers 1 Following Workflow automation for technical teams to build AI solutions that integrate with any app or API at no-code speed and code flexibility. Open and self-hostable
Hunyuan @TencentHunyuan
25K Followers 6 Following Tencent's large model, encompasses text generation, image generation, video generation, and 3D generation.
Unwind AI @unwind_ai_
18K Followers 2 Following Step-by-step guides to building AI Agents & RAG Apps with LLMs | Subscribe now for daily AI news & tutorials in your inbox 📨
SemiAnalysis @SemiAnalysis_
34K Followers 16 Following
QboticsLabs @QboticsLabs
700 Followers 2K Following A research #startup, doing research in Image Processing, #EmbeddedSystem, Wearable #technology, Green Technology and #Robotics.
❄️Andrew Zhao❄�... @_AndrewZhao
4K Followers 3K Following PhD @Tsinghua_Uni. Absolute Zero,ExpeL,Diver-CT Research Intern @MSFTResearch, Ex. @ BIGAI. Interested in RL, Reasoning/Safety 4 LLMs, Agents. On job market 26'
Altera @AlteraFPGA_
29K Followers 27 Following Accelerating innovators across the globe through flexible, programmable products.
RISC-V International @risc_v
32K Followers 490 Following RISC-V International is the non-profit home of the open standard RISC-V Instruction Set Architecture (ISA), related specifications, and stakeholder community.
Arm @Arm
89K Followers 2K Following Arm’s foundational technology is defining the future of computing. A future built by the greatest technology ecosystem in the world. A future built on Arm.
MCP.so @chatmcp
1K Followers 9 Following 16000+ MCPs, ONE https://t.co/h0E4spor9M — discover the best MCP Servers and Clients.
ManusAI @ManusAI_HQ
204K Followers 25 Following Manus is the general AI agent that bridges minds and actions: it doesn't just think, it delivers results. Download our app: https://t.co/XSfjRhjdgo
MetaGPT @MetaGPT_
9K Followers 219 Following The Multi-Agent Framework The World's First AI Dev Team: https://t.co/5ONAO5tqCq Discord: https://t.co/vlkPJDMSQZ
AgentOps 🖇️ @AgentOpsAI
22K Followers 15 Following Making the next 1 billion agents fast, safe, and reliable. Agents suck. We're fixing that. (DMs open) https://t.co/KzRvFOijzL Agent Consulting: https://t.co/LRCXTHyXe2
Unsloth AI @UnslothAI
31K Followers 457 Following Open source LLM fine-tuning & RL! 🦥 https://t.co/2kXqhhvLsb
AGI Open Network @AGIOpenNetwork
50K Followers 298 Following AI Agent Development Platform. Empower anyone to create, deploy, and monetize AI Agents! Backed by @HashKeyGroup @CSDN_Global TG: https://t.co/RdiM9cfAVP
Agentica Project @Agentica_
3K Followers 8 Following Building generalist agents that scale @BerkeleySky
Yixuan Wang @YXWangBot
1K Followers 1K Following CS Ph.D. student @Columbia & Intern @AIatMeta | Prev. Boston Dynamics AI Institute, Google X #Vision #Robotics #Learning
Enze Xie @xieenze_jr
995 Followers 194 Following Staff Research Scientist at NVIDIA, doing GenAI, CS PhD from HKU MMLab, interned at NVIDIA.
CyLab @CyLab
10K Followers 2K Following CyLab is @CarnegieMellon's Security & Privacy Institute. Our 300+ researchers are passionate about creating a world in which technology can be trusted.
Hao Zhang @haozhangml
6K Followers 474 Following Asst. Prof. @HDSIUCSD and @ucsd_cse running @haoailab. Cofounder and runs @lmsysorg. 20% with @Snowflake
Hao AI Lab @haoailab
4K Followers 345 Following Hao AI Lab at UCSD. Our mission is to democratize large machine learning models, algorithms, and their underlying systems.
Xiang Yue @xiangyue96
5K Followers 828 Following Postdoc @LTIatCMU. PhD from Ohio State @osunlp. Author of MMMU, MAmmoTH. Training & evaluating foundation models. Opinions are my own.
Jiayi Pan @jiayi_pirate
13K Followers 1K Following 🧑🍳 Reasoning Agents @xAI | PhD on Leave @Berkeley_AI | Views Are My Own
Zihan Wang - on RAGEN @wzihanw
23K Followers 609 Following PhD Student @NorthwesternU. Intern @yutori_ai. I study PhysiCS of LLM. Ex @deepseek_ai @uiuc_nlp @RUC. RAGEN | Chain-of-Experts | ESFT.
Tim Cook @tim_cook
14.9M Followers 70 Following Apple CEO Auburn 🏀 🏈 Duke 🏀 National Parks 🏞️ “Life's most persistent and urgent question is, 'What are you doing for others?'” - MLK. he/him
Elon Musk @elonmusk
225.3M Followers 1K Following
DeepSeek @deepseek_ai
973K Followers 0 Following Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
maharshi @mrsiipa
41K Followers 847 Following ml perf @fal - learning deeply about life one gradient step at a time - personal blog: https://t.co/TYdFfUBImf
Red Hat AI @RedHat_AI
8K Followers 2K Following Deliver AI value with the resources you have, the insights you own, and the freedom you need.
Huazhe Harry Xu @HarryXu12
4K Followers 984 Following Hi, I like reinforcement learning, robots, and video games:) I am an amateur pianist. Assistant Prof at Tsinghua; Postdoc at Stanford; Ph.D. at Berkeley
Eric Xing @ericxing
8K Followers 22 Following Researcher, educator, entrepreneur, and administrator in computer science, artificial intelligence, and healthcare.
Hao Su @haosu_twitr
8K Followers 374 Following Associate Professor @UCSanDiego. Computer Vision, Graphics, Embodied AI, Robotics. Co-Founder of https://t.co/hqCarFwEtc @hillbot_ai