If you're attending @aiDotEngineer on wed, june 4th, check out the recsys track. I'll be hosting talks from Pinterest, LinkedIn, Netflix, Instacart, Youtube. I'll also share 3 ideas that'll likely drive the next few years in recsys: semantic IDs, llm-augmentation, unified models
If you're attending @aiDotEngineer on wed, june 4th, check out the recsys track. I'll be hosting talks from Pinterest, LinkedIn, Netflix, Instacart, Youtube. I'll also share 3 ideas that'll likely drive the next few years in recsys: semantic IDs, llm-augmentation, unified models https://t.co/WbL8yhRw0k
📢 Introducing Biomni - the first general-purpose biomedical AI agent.
Biomni is built on the first unified environment for biomedical agent with 150 tools, 59 databases, and 106 software packages and a generalist agent design with retrieval, planning, and code as action.
This…
Announcing Biomni — the first general-purpose biomedical AI agent. Biomni is a free web platform where biomedical scientists can immediately delegate their tasks to Biomni, starting today!
Biomni automates literature reviews, hypothesis generation, protocol design,…
Here we go, 40% momery savings on TRL GRPOTrainer with Liger Kernel+FSDP+multi GPU training+vLLM rollout is ready to rock! lnkd.in/dq8pP7KJ, get your model train faster and cheaper starting today!
🧠 What is the relationship between the input data and the emergent Neural Collapse phenomenon in Neural Networks?
This has been a long-standing question since the paper by Papyan et al., and it has been gaining attention lately.
We try to answer this question in our @TmlrPub…
📷 Thrilled to share my latest blog post (co-authored with @qingquan_song ) on our open source work integrating the Flash Attention Backend end-to-end in SGLang—now enabled by default in the latest version!
😎 Dive into the details here: hebiao064.github.io/fa3-attn-backe…#LLM#SGLang
GRPO/reasoning enthusiasts - are you using the liger kernel? If not, I strongly suggest you give it a try! It is making an INSANE difference in the number of completions I can train on in a given training step.
We are open sourcing bytecheckpoint and veomni!
bytecheckpoint is the Bytedance's production checkpointing system for foundation model training, battle-tested with jobs with 10k+ GPUs. Blazing fast save/load, load-time checkpoint auto-resharding for different parallelism across…
Exciting news! Our paper, co-authored with @TirerTom, "Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond)" is accepted to #CVPR2025! Check out our code here: github.com/tirer-lab/CM4IR.
Excellent results **with only 4 NFEs!**
617 Followers 325 FollowingPostdoctoral Fellow at @PrincetonPLI | Past: Computer Science PhD @TelAvivUni & Apple Scholar in AI/ML | Interested in the foundations of deep learning
1K Followers 516 FollowingHi, I am a PI at ELLIS Institute Tübingen and MPI-IS. Was RS NIF @UniofOxford, JRF @SomervilleOx, postdoc @UTAustin, and PhD @Data_AI_TUe.
450 Followers 6K FollowingGiving meaning to mine share of star dust. Visiting fellow @WinshipAtEmory. Prev at @oracle, @maddox_ai, @KITKarlsruhe, @_nference, @val_iisc, @iitdelhi.
9K Followers 2K FollowingInterested in cognition and artificial intelligence. Research Scientist @DeepMind. Previously cognitive science @StanfordPsych. Tweets are mine.
9K Followers 16 Following@Stanford Prof. National Acad of Eng. Chief Sci @ Visual Layer & Virtue AI. Frm Sr Dir AI @Apple. Co-author of XGBoost, LIME, TextGrad, Alpaca, TVM, GraphLab.
5K Followers 1K FollowingNeurIPS workshop and digital community | 🌐 geometry, algebra, topology + 🤖 deep learning + 🧠 neuroscience | Join us on slack! https://t.co/Run9wPnrDB
856 Followers 470 FollowingAssistant Professor at Umich ECE. Research interest: machine learning, optimization, data science. A runner 🏃 in spare time.
739 Followers 2 FollowingDeep-tech startup. Compilers and auto code generators for high-performance AI. PolyBlocks. High-performance computing with polyhedral and MLIR-based compilers.
981 Followers 631 FollowingPh.D. student working of the foundations of AI/ML at @Penn.
Previously M.A. in Statistics @Wharton @Penn & B. Sc. in EE @ Sharif &
QR Intern @ Point72 (Cubist).
307 Followers 186 FollowingFlatiron Research Fellow@CCM, Flatiron Institute. Machine learning on graphs by exploiting symmetries / Understanding foundation models from first principles
617 Followers 325 FollowingPostdoctoral Fellow at @PrincetonPLI | Past: Computer Science PhD @TelAvivUni & Apple Scholar in AI/ML | Interested in the foundations of deep learning
163K Followers 0 FollowingInvented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.
1K Followers 1K FollowingMaths & ML researcher at Université Paris-Cité | Founder & Investor at @eigenventures
On leave from Academia until early 2025 🚀https://t.co/XwA78g040c