Congrats OpenAI: Game changer.
More and more benchmarks are being saturated. And it is becoming increasingly difficult to tell how good the models really are. They are now so intelligent (I am aware of the problems with the term) that it is difficult to say unequivocally how…
I finally wrote another blogpost: ysymyth.github.io/The-Second-Hal…
AI just keeps getting better over time, but NOW is a special moment that i call “the halftime”. Before it, training > eval. After it, eval > training. The reason: RL finally works.
Lmk ur feedback so I’ll polish it.
Notes on PaperBench Evaluating AI’s Ability to Replicate AI Research.
(对于没有太多的 multip-hop 任务专门的 post training 的 LLM)做 budget forcing 去显著提高 agent 在复杂任务上的表现
1 Followers 82 FollowingProfessor Arthur and his team provide cryptocurrency market analysis and earn $500 to $5,000 per day. Click to join WA:https://t.co/6xNLuAAbow
181 Followers 3K FollowingTicklish about AI 🖖 | Cybersecurity explorer | Retro tech lover 📼 chasing shiny gadgets | Talking AI, tools and tech shaping tomorrow, one byte at a time.
103 Followers 240 FollowingWith 10+ years of experience at Taobao and Alibaba Cloud. Now I am the initiator of the ANP open-source tech community and the author of ANP.
11K Followers 314 FollowingBlog https://t.co/volRyyWPp1
Author of https://t.co/cvLGe9DZ6A https://t.co/3inMhtUZrQ
travel around china 33/34
travel around world 30/200
have enough money 100/100
6K Followers 389 Followingexists as 451; opinions are my own; Creator of @LANDropApp, @AthenaAGI, LMRouter; MTS @MicrosoftAI LLM training infra; Ex-@NVIDIA RISC-V security
2K Followers 825 Followinggpu engineer || science, music production, design, typography || 不要停止探索 / do not stop questioning. || for english @juntongc2k
37K Followers 6K Following《熵控理论》发明者 |Founder of the Entropy Control Theory | Entropy Control Theory transforms chaotic, high-entropy language into structured, executable cognitive units.
7K Followers 8 FollowingMuse is a canvas for thinking that helps you get clarity on things that matter. For iPad and Mac. https://t.co/t4eOIiifif, https://t.co/lXYdfn9kLL
4K Followers 23 Following🌌 The best tool for visual research
⬇ Download → https://t.co/IguCgTjDX0
📆 Get a demo → https://t.co/7sAhnEd1aA
👾 Discord → https://t.co/j1T5vDTx5a
60K Followers 829 FollowingCreator of Flask; A decade at @getsentry; Building new things — love API design & AI. Bypassing Permissions. Husband and father of 3 — “more nuanced in person”
22K Followers 9 FollowingYour new async coding agent by @GoogleLabs. Built for devs, open to feedback, evolving with you. Dive in → https://t.co/iIzFEMmWgv
785 Followers 13 Followinghttps://t.co/zI71a4QB1W
https://t.co/QFKNuHms1N
[email protected]
Donation: Support my work on Ko-fi (https://t.co/gAtHKPSCHH)!