navigating the sea of entropy
RF @Microsoft Research | Prev @Adobe, @dtu_delhi
Looking for PhD opportunities!java-abhinav07.github.io Bangalore, IndiaJoined September 2022
My thoughts on how AI will automate my SWE job in 2026
(I will be plainly honest on this post, even though some people on both sides of this debate will be upset. So, please, respect that these are my predictions. I don't want to start an argument, I just want to share my…
You guys don’t understand. Bad grammar and spelling is becoming high signal. Perfection looks too close to an LLM. Being retarded is the only way to differentiate yourself for a machine
Deep research has emerged as a popular task with many recently released models. But beyond lengthy reports, what exactly defines the task? And how to quantify progress?
[New Paper!] We provide an objective defn. centered on claim discovery & a 100-problem benchmark spanning…
Good work & impressive gains (though core RL folks critique entropy methods).
Idea:
- Explore until model's entropy nears natural range.
- Adjust adv at token level for shared/unique tokens (which is already in GRPO via token-level importance-sampling)
🔗arxiv.org/abs/2507.19849
A thread 🧵
TL;DR: We’re working on making NumPy’s cross-platform 128-bit float operations go brrr.... 🔥
So why are quad-precision (128-bit) linear algebra ops so slow and how we’re fixing it?
I can't stress enough how useful this trick has been for me in all these years
It reduces GPU memory by N equal the number of losses, at literally no cost (same speed, exactly same results down to the last decimal digit)
For example ... [1/2]
I can't stress enough how useful this trick has been for me in all these years
It reduces GPU memory by N equal the number of losses, at literally no cost (same speed, exactly same results down to the last decimal digit)
For example ... [1/2]
20K Followers 2K FollowingVC @FlywheelVC. Lecturer, entrep mgmt fin & VC @Stanford. Expert witness. Prev: @NVCA @KauffmanFellows @Intel & 3x founder. I am "trevorloy" on all other apps.
3 Followers 143 FollowingMachine Learning, Data Science, Computer Science Curious Learner!!
here for understanding the market, news & upcoming research.
783 Followers 6K FollowingOnly account am using now old one got harked |Trading for 7 years | investor | turned 400+ students Profitable | post are not investment advice.
1K Followers 1K FollowingCS (HCC) PhD student @UMich | ex research @adobe | Human AI Interaction, Social Computing, Interaction Design | ruminating about des, tech & society🍂💫🌊
16K Followers 357 FollowingRuns an AI Safety research group in Berkeley (Truthful AI) + Affiliate at UC Berkeley. Past: Oxford Uni, TruthfulQA, Reversal Curse. Prefer email to DM.
5K Followers 889 FollowingFaculty at @ELLISInst_Tue & @MPI_IS, leading the AI Safety and Alignment group.
PhD from @EPFL supported by Google & OpenPhil PhD fellowships.
2K Followers 326 FollowingInventor of @msExcel Flash Fill. Distinguished Scientist @Microsoft leading @ProseMSFT (AI4Code). Connecting ideas, people, and research & practice. Dad
15K Followers 38 FollowingThe AllenNLP team works on language-centered AI that equitably serves humanity. We deliver high-impact research and open-source tools to accelerate progress.
357 Followers 37 FollowingEfficient Systems for Foundation Models Workshop, ICML2025.
Join us if you are interested in the challenges associated with large models training & inference!
20K Followers 2K FollowingThis is the site where I talk about the attacks on science and immigration.
Science is on the other site.
Lab website: https://t.co/vrtbcqRyRn
20K Followers 2K FollowingVC @FlywheelVC. Lecturer, entrep mgmt fin & VC @Stanford. Expert witness. Prev: @NVCA @KauffmanFellows @Intel & 3x founder. I am "trevorloy" on all other apps.