What Large Language Models Know About Plant Molecular Biology
1. A new benchmark called MOBIPLANT has been introduced to evaluate the capabilities of large language models (LLMs) in plant molecular biology. This benchmark was developed by a consortium of 112 plant scientists…
Supervised learning in DNA neural networks @Nature
1. A groundbreaking study demonstrates that DNA molecules can autonomously perform supervised learning in vitro, a significant leap towards embedding learning capabilities in non-living systems. The research shows that DNA…
How to generate medical training data and rewards that make small models generalize.
A 7B model beats a 72B model by 19.7% on OmniMedVQA.
The model reads medical images and text together like a vision language system.
It creates its own image question answer tasks, then a…
🧬 Massive. Newly released Biomni-R0, a tiny 8B param biomedical AI model surpasses Claude 4 Sonnet and GPT-5, demonstrating the efficiency of domain-specialized training.
The model uses reinforcement learning to push a biomedical agent to expert level.
Biomni-R0 comes in 8B…
🧬 Massive. Newly released Biomni-R0, a tiny 8B param biomedical AI model surpasses Claude 4 Sonnet and GPT-5, demonstrating the efficiency of domain-specialized training.
The model uses reinforcement learning to push a biomedical agent to expert level.
Biomni-R0 comes in 8B… https://t.co/JQ0wBd43yM
🤖 Better LLM Agents for CRM Tasks: Tips and Tricks
CRM tasks are tough for LLMs - even GPT-4o only solves <30% of tasks in our CRMArenaPro benchmark 😬
📝 Blog: sforce.co/4600cWT
💡 Key finding: Showing agents HOW to solve tasks (not just WHAT to solve) dramatically…
A collection of 300+ MCP servers for AI Agents!
Awesome MCP Servers is a curated list of production-ready and experimental MCP servers to supercharge your AI models.
100% open-source.
New in-depth blog post - "Inside vLLM: Anatomy of a High-Throughput LLM Inference System". Probably the most in depth explanation of how LLM inference engines and vLLM in particular work!
Took me a while to get this level of understanding of the codebase and then to write up…
@emnlpmeeting / #EMNLP2025 Accepted Paper: Text2Vis: A Challenging and Diverse Benchmark for Generating Multimodal Visualizations from Text
📝 Paper: arxiv.org/abs/2507.19969
This work introduces Text2Vis, a comprehensive benchmark for evaluating text-to-visualization models…
This is Qwen Chat Web Dev prompt — a powerful, design-focused AI assistant for frontend development. 🚀
We aim to help developers build websites using React or HTML with TailwindCSS, animations, and modern UI patterns — all in one clean code block.
✨ Key features:
✅…
A Multi-Layered Framework for Modeling Human Biology: From Basic AI Agents to a Full-Body AI Agent
1. This study introduces the Full-Body AI-Agent framework, a multi-agent architecture designed to model human biology across molecular to whole-organism scales. Unlike traditional…
wrote a short blogpost on what I think are some limitations of GRPO:
I’ve been playing around with RL finetuning for reasoning tasks and came across a few limitations that i wanted to document here
feedback/corrections are welcome!
I enjoyed reading this LLM-powered data application paper from UCSF: "When the Domain Expert Has No Time and the LLM Developer Has No Clinical Expertise." It gets at the heart of many of my favorite things: accelerating productivity with data analysis, evals in cross-functional…
📢 Big claim in this paper.
ROUGE makes many hallucination detectors look good, but it does not match human judgment, so scores are inflated.
ROUGE score rewards overlap, not truth, which hides real hallucination rates. ROUGE misaligns with humans.
Re-scoring with a judge…
@emnlpmeeting / #EMNLP2025 Accepted Paper: From Charts to Fair Narratives: Uncovering and Mitigating Geo-Economic Biases in Chart-to-Text
📝 Paper: bit.ly/3HNIgXC
This paper presents the first large-scale investigation of geo-economic biases in Vision-Language Models…
1/13 🧵 Today, Bindcraft was published in @Nature , one of the most famous AIs in biology for designing protein–protein interactions (PPI). In my opinion. Bindcraft represents one of the most important advances in the post–AlphaFold2 era.
BREAKING: Israeli strikes on Nasser Hospital in Gaza killed at least 15 people, including three journalists, one of whom worked for Reuters, Palestinian health officials said reut.rs/45OFPfr
More than a dozen Palestinians, including journalists from multiple outlets, killed in Israeli strikes on a southern Gaza hospital, officials say
cnn.it/45QIjKf
87 Followers 894 FollowingExploration over Exploitation.
RA @Mila_Quebec, Research Fellow @UniofOxford. MSc @UWindsor. Interested in Adversarial attacks, security & reliability of LLMs
427 Followers 523 FollowingPrincipal Applied Scientist @Oracle Health AI
ex - Sr. Applied Scientist @Amazon. 🇧🇩
Co-CTO @ReviewAcl.
Music (metal) and NLP research.
Opinions are my own.
83K Followers 8K FollowingCompiling in real-time, the race towards AGI.
🗞️ Don't miss my daily top 1% AI analysis newsletter directly to your inbox 👉 https://t.co/6LBxO8215l
411 Followers 3K FollowingBuilding @ https://t.co/mUxy0JG9iG | Authoring https://t.co/evSH7oeZ18 | Ex Google- Built Google Search's first reasoning agents
4K Followers 24 FollowingThe European Chapter of the Association for Computational Linguistics
An annual Top-tier *ACL conference. #EACL2026 #NLProc
24-29 March 2026
2K Followers 887 FollowingAssociate professor @EmoryUniversity. Working on large language models, LLM inference, reasoning, natural language generation, and various aspects of GenAI.
87 Followers 894 FollowingExploration over Exploitation.
RA @Mila_Quebec, Research Fellow @UniofOxford. MSc @UWindsor. Interested in Adversarial attacks, security & reliability of LLMs
15K Followers 1K FollowingSenior Research Scientist - @google, Adjunct Faculty - @iitmadras, @iitbombay, Ex: @NICT_Publicity
Use of my tweets without permission ➡️ legal action
96 Followers 5 FollowingWe are a researcher community developing scientifically grounded research outputs and robust deployment infrastructure for broader impact evaluations.
13K Followers 2K FollowingSVP and Head of Biomedical AI @Xaira_Thera; Associate Prof @UofT; Chief AI Officer @UHN; former PHD, CS @Stanford; opinions my own. #AI #healthcare #biology
2K Followers 935 FollowingPh.D. student @LTIatCMU and intern at @AIatMeta (FAIR) working on (V)LM Evaluation & Systems that SeIf-Improve | Prev: @kaist_ai @yonsei_u
10K Followers 4K Followingsth new // ex Gemini RL+Inference @GoogleDeepMind // Chat AI @Meta // RL Agents @EA // ML+Information Theory @MIT+@Harvard+@GeorgiaTech // زن زندگی آزادی
4K Followers 461 FollowingFollow for AI in Digital Biology and Drug Discovery @NVIDIA, ex Insilico Medicine, ex Yale, PhD UMaryland, views are mine, DM for collabs
163K Followers 0 FollowingInvented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.
360 Followers 7 FollowingComputational Linguistics, established in 1974, is the official flagship journal of the Association for Computational Linguistics (ACL).
27K Followers 4K FollowingFounder CEO @precigenetic. single cell biophotonics in real time -currently melanoma | prev: @penn cs and comp bio | molecular biologist, scuba diver, engineer.
53K Followers 766 FollowingCEO & Founder, Chemify. Regius Professor, Scientist & Inventor. Fascinated & in a state of confusion & optimism. Trying to digitize chemistry & make alien life.
3K Followers 73 FollowingMachine Learning for Health (ML4H) • San Diego, 2025
#ml4h2025 • Contact: [email protected]
Follow us on Bluesky: https://t.co/c99yfcFHF4
56K Followers 853 FollowingFiguring out AI @allen_ai, open models, RLHF, fine-tuning, etc
Contact via email.
Writes @interconnectsai
Wrote The RLHF Book
Mountain runner