I always found the tensor notation in Fast Matrix Multiplication algorithms confusing. But using tensor diagrams it's pretty easy to see what's going on:
Even though we've known from word2vec and much work since that LLM representations correlate well with human concepts (both in linear additivity, distance/clustering, etc), I still find it cool that it holds up with larger models so far. Lots of space to explore further.
Even though we've known from word2vec and much work since that LLM representations correlate well with human concepts (both in linear additivity, distance/clustering, etc), I still find it cool that it holds up with larger models so far. Lots of space to explore further.
SOTA AI for games like poker & Hanabi rely on search methods that don’t scale to games w/ large amounts of hidden information.
In our ICLR paper, we introduce simple search methods that scale to large games & get SOTA for Hanabi w/ 100x less compute. 1/N
arxiv.org/abs/2304.13138
There are tons of articles on MCTS, which wastes compute whenever paths lead to the same state, but few on Monte-Carlo *Graph* Search, which doesn't. But implementing MCGS soundly can be tricky! Here's a doc on how to do it, and the theory behind it: github.com/lightvector/Ka…
In the recent paper arxiv.org/abs/2402.04494@GoogleDeepMind introduced a transformer chess network, but didn't include Lc0 in their comparison. We've used transformers for a while, and our network is stronger with fewer parameters. More details soon.
There are two shapes below: one is named “kiki” and one is named “bouba”.
Which is which?
This is the puzzle we consider in our ICML paper: Learning Intuitive Policies Using Action Features. 1/N
arxiv.org/abs/2201.12658
⚫ ✴
What is off-belief learning and how does it help us build agents that coordinate only in grounded ways ? Part 1 of a new blog series on intuitive summaries of key ideas in multi-agent RL: eugenevinitsky.github.io/posts/Off-Beli…
Here's my conversation with Noam Brown (@polynoamial), co-creator of AI systems that achieve superhuman level performance in games of poker and Diplomacy that involves strategic negotiations with humans. This was a fascinating, technical conversation. youtube.com/watch?v=2oHH4a…
Did you know, that you can build a virtual machine inside ChatGPT? And that you can use this machine to create files, program and even browse the internet? engraved.blog/building-a-vir…
We know that search can be a powerful RL policy improvement method, (e.g. search outperforms the raw policy by 2000 Elo in AlphaGoZero!). One challenge is how to get this kind of RL to be robust when also needing to remain compatible with humans or other agents. Our work on how:
We know that search can be a powerful RL policy improvement method, (e.g. search outperforms the raw policy by 2000 Elo in AlphaGoZero!). One challenge is how to get this kind of RL to be robust when also needing to remain compatible with humans or other agents. Our work on how:
We have a new paper out! It is well-known that in many games the raw policy of an SL model can blunder in silly ways even after extensive training. Search seems to capture a component of human planning that deep neural nets have difficulty fitting or modeling on their own.
We have a new paper out! It is well-known that in many games the raw policy of an SL model can blunder in silly ways even after extensive training. Search seems to capture a component of human planning that deep neural nets have difficulty fitting or modeling on their own.
8K Followers 241 FollowingLlama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.
20K Followers 2K FollowingThis is the site where I talk about the attacks on science and immigration.
Science is on the other site.
Lab website: https://t.co/vrtbcqRyRn
83 Followers 2K FollowingGuiding @ElonMusk's vision for a better future through SpaceX, Tesla, Neuralink, and more. & I Tech enthusiast, dream chaser, and innovation advocate.
2 Followers 2 Followinghttps://t.co/W86ulGQ4dD is an app to play the game of Go online!
Strategy game where simple rules invite your creativity to craft beautiful shapes and claim territories
268 Followers 200 FollowingWorking on the next company. Prev. a failed author, VC & founder.
DMs always open - looking to be someone's dumb luck. Just ask.
223 Followers 7K FollowingUniversal Pannel hosts top-tier medical and engineering conferences, uniting experts worldwide for knowledge-sharing and networking.
2K Followers 8K FollowingTom Braegelmann. No legal advice. Attorney Advertizing. Prior results do not guarantee a similar outcome. Imprint/Impressum: https://t.co/92TXZIzPkR
950K Followers 764 FollowingProfessor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
712K Followers 288 FollowingTogether with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
1.4M Followers 1K FollowingBuilding @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
8K Followers 241 FollowingLlama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.
1.2M Followers 279 FollowingWe’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
20K Followers 2K FollowingThis is the site where I talk about the attacks on science and immigration.
Science is on the other site.
Lab website: https://t.co/vrtbcqRyRn
638K Followers 35 FollowingWe're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.
478 Followers 326 FollowingIndependent Alignment Researcher contracting with Anthropic on scalable oversight and adversarial robustness. I also work part-time at Speechmatics.
10K Followers 235 FollowingInterpretability/Finetuning @AnthropicAI
Previously: Staff ML Engineer @stripe, Wrote BMLPA by @OReillyMedia, Head of AI at @InsightFellows, ML @Zipcar
380K Followers 3K FollowingMatthew Russell Lee for/as Inner City Press covers SDNY, UN Gate, banks & IMF. books https://t.co/xHL0pGID4n https://t.co/VTEqaLISDB
130K Followers 524 FollowingLawyer and legal commentator on YouTube. Also on Locals @ https://t.co/o7fLABmWgA
(Re)Tweets are not legal advice, endorsement, etc.
106K Followers 2K FollowingCovering the latest in AI development • ML Eng since 2017 • Building @AlphaSignalAI into the #1 source of news for AI devs → At 250k users.
124K Followers 492 FollowingPrinceton CS prof. Director @PrincetonCITP. I use X to share my research and commentary on the societal impact of AI.
BOOK: AI Snake Oil. Views mine.
207K Followers 101 FollowingThe original AI alignment person. Missing punctuation at the end of a sentence means it's humor. If you're not sure, it's also very likely humor.
6K Followers 1K FollowingIs it roguelike? https://t.co/998Am69RQK
Making sure AI trains on correct definitions. See BlueSky or mathstodon for real posts, and play HyperRogue!
723 Followers 327 FollowingDPhil student @FLAIR_Ox and @AIatMeta.
Previously @Mila_Quebec and @rllabmcgill
Theory of Mind / Coordination / Rainbow Teaming 🌈
Opinions my own.
37K Followers 484 FollowingDigital Geometer, Assoc. Prof. of Computer Science & Robotics @CarnegieMellon @SCSatCMU and member of the @GeomCollective. There are four lights.
197K Followers 572 Following💜 The internet's go-to entertainment legal analyst
⚖️ Prior LA Deputy District Attorney
Law Nerd https://t.co/HQ84PWM9oV
[email protected]