New Anthropic research: Adding Error Bars to Evals.
AI model evaluations don’t usually include statistics or uncertainty. We think they should.
Read the blog post here: anthropic.com/research/stati…
Thrilled to announce that our paper "Artificial intelligence for literature reviews: opportunities and challenges" has been finally published by the 𝘈𝘳𝘵𝘪𝘧𝘪𝘤𝘪𝘢𝘭 𝘐𝘯𝘵𝘦𝘭𝘭𝘪𝘨𝘦𝘯𝘤𝘦 𝘙𝘦𝘷𝘪𝘦𝘸 journal.
#AI#SLR#research#openaccess#LLMs
📰doi.org/10.1007/s10462…
Are LLMs able to understand observational data, derive the underlying rules that govern it, and use these rules to better model new data instances?
In our new paper, @_emliu investigates LMs' ability to perform this *abductive reasoning* from a number of different perspectives.
Are LLMs able to understand observational data, derive the underlying rules that govern it, and use these rules to better model new data instances?
In our new paper, @_emliu investigates LMs' ability to perform this *abductive reasoning* from a number of different perspectives.
5K Followers 291 FollowingLet's make AI doctors!
Views my own;
CEO @ https://t.co/wvoKT50fKX;
AI Researcher @ Berkeley;
If I block you it's like I'm moving to another convo at a party; nbd.
19K Followers 8K FollowingOn the quest to understand the fundamental mathematics of intelligence and of the universe with curiosity. https://t.co/mMchI2d4pg Upskilling @StanfordOnline
1K Followers 17 FollowingRecent activities in #Robotics, #MachineLearning, #HCI, and #ComputerVision at OMRON SINIC X (OSX).
オムロン サイニックエックス株式会社の公式アカウントです。技術・共創に関するご相談など、お気軽にお問合わせください。
5K Followers 211 FollowingML Engineer at Anlatan (@novelaiofficial). co-author of HDiT (Hourglass Diffusion Transformers). works on diffusion models and LLMs. 日本語を勉強してる。
139 Followers 40 FollowingThe BadWolf project targets to build a temporal graph store abstraction layer. It also allows you acces to the graph data via the BadWolf Query Language (BQL).
5K Followers 4K FollowingGlobal & Unified Access to Knowledge Graphs. Public Data Infrastructure for a Large, Multilingual, Semantic Knowledge Graph
https://t.co/uMZLxiWtT6
2K Followers 362 FollowingOfficial journal Int. Society for Scientometrics and Informetrics. Editors: @lariviev, @RodrigoCostas1, @tang006. Published by @mitpress
@[email protected]
384 Followers 369 Following5th International Workshop on Scientific Knowledge: Representation, Discovery, and Assessment (Sci-K) co-located @iswc_conf 2025 in Nara, Japan
7K Followers 13 FollowingHost of scientistic papers for @aclmeeting and other venues in the field of Natural Language Processing
Not closely monitored, so responses may be delayed.
288K Followers 480 FollowingPython's BDFL-emeritus, Distinguished Engineer at Microsoft, Computer History Fellow, fully vaccinated. Opinions are my own. He/him.