>switched to Qwen-2.5-VL (7B) for better results.
>tmux learning curve has been fun.
>600 samples
>next: switch to Qwen-2.5-VL (72B) and run all 2500 questions with larger rollout.
>work on multi turn and improve single turn.
Thanks for the support 🤗 (100+ and counting).
>switched to Qwen-2.5-VL (7B) for better results.
>tmux learning curve has been fun.
>600 samples
>next: switch to Qwen-2.5-VL (72B) and run all 2500 questions with larger rollout.
>work on multi turn and improve single turn.
Thanks for the support 🤗 (100+ and counting). https://t.co/ejUpI5QVEm
>HLE (humanity's last exam) rl env implementation almost nearing the end.
>Thanks for the opportunity @PrimeIntellect and for the compute 👀
>Also my first open source contribution if it goes well 🙏
50 followers 🥳means a lot for smol accounts
Will be diving deep into >optimisations, distributed training,sglang and start documenting it
Never did it for the clout , pure learning dedication🙏
gpt-oss is out!
we made an open model that performs at the level of o4-mini and runs on a high-end laptop (WTF!!)
(and a smaller one that runs on a phone).
super proud of the team; big triumph of technology.
3K Followers 7K FollowingSoftware,Hardware,Astrophysics, Web3, Stocks and AI enthusiast,In perplexity over answer to life universe and Everything . Founder at https://t.co/R6wDNPGZKK
164 Followers 4K FollowingJunior@Nankai University | Major in CS | Research in CV, GenAI | Full Stack Developer | Beginner in Crypto | Runner, Cyclist, Gym-goer | Rap enthusiast
372 Followers 1K Followingthis user posts engaging and inspiring content that makes readers want to purchase everything that was being advertised
ex @anthropicAI @googledeepmind user
1K Followers 719 FollowingPassionately in love with Science, mostly Altruistic, Engineer, Amateur Astronomer & Critical thinker. Current Research focus: ▫️Mechanistic Interpretability▫️
11K Followers 749 Followingslightly less attractive cofounder @AskEureka: we’re replacing all doctors with AI. I tweet abt healthcare and tech, prev @Harvard @Google @BCG, dm to say hi :)
855 Followers 3K FollowingBorn again Christian!
BS | MS @NDSU
https://t.co/wRn7mSM73s
Building @SpeechSage
Passion for biomedical arenas
Love a good debate, but only if there is purpose
18K Followers 1K FollowingAssistProf @CarnegieMellon. Distinguished Eng @NVIDIA. Creator of @XGBoostProject, @ApacheTVM. Member https://t.co/QYyfjQNp4p, @TheASF. Views are on my own
38K Followers 992 FollowingCreator of bitsandbytes.Research Scientist @allen_ai and incoming professor @CarnegieMellon. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.
3K Followers 342 FollowingI’m a software engineer building high-performance kernels and compilers at Anthropic! Previously at Facebook/Meta (PyTorch, HHVM, ReDex)
3K Followers 90 Followingcreator of @electronjs, check https://t.co/ZDJujd4Nql for the open source things I built.
currently sponsored to write a CUDA backend for MLX.
18K Followers 1K FollowingPretraining @xAI. Previously: @InflectionAI, @AIatMeta, @DeepMind, @Google, @LMU_Muenchen, PhD math-ph. Opinions my own. (Can be yours for a small fee.)
14K Followers 399 Following@huggingface engineer. I'm the reason your LLM frontend has a jinja2cpp dependency. Sometimes yells about housing and trans rights instead of working
He/him