Tip for debugging with LLM: instead of asking it to just stare at the code to guess what’s wrong, ask it to create a script to print out everything you need to diagnose the issue and then give it the output. Suddenly your LLM becomes 10x better at debugging.
One consequence of much faster internet may be that LLMs will move more and more to the frontend (browser) — better privacy, UX and no serving cost. WebGPU is already working pretty well and one major bottleneck (other than storage) is speed to download these giant models.
Been contemplating the fact that in LLMs, the outlier values are often the most important values. If you remove these outliers, your model performance degrades drastically. It's quite poetic and affirms my belief that "the average is overrated".
One cool thing from Gemini’s technical report no one is talking about is that it can take audio signal natively (as opposed to converting audio to text). This means it can potentially capture speech tones?
Is it possible to solve NLP tasks by simply following instructions that define the tasks? How can we measure the progress?
Excited to announce Natural Instructions v2, a collection of 1600+ diverse language tasks and their expert-written instructions!
📜arxiv.org/abs/2204.07705
Everybody wants their models to run faster. However, researchers often cargo cult performance without a solid understanding on the underlying principles.
To address that, I wrote a post called "Making Deep Learning Go Brrrr From First Principles". (1/3)
horace.io/brrr_intro.html
Was wondering why these model checkpoints files are so big (~GB). Aren’t they just a bunch of floats (~4 bytes each)? Then realized roberta-large is 355M parameters 🤯🤯🤯
We need a dedicated collection of Toy Datasets for Machine Learning:
1. They can be more interesting than real datasets, specially if designed to be hard for certain algorithms.
2. They are more useful for teaching / learning.
Maybe @huggingface / @kaggle can help with this?
Got hit-and-run by a white work van on Atlantic and Hellman on 9/24, 7:21pm. Neck is a bit sore but otherwise I'm ok. I'm offering $1k cash or $2k to charity of your choice for first person to send me dash cam footage clearly showing the van's license plate (ends in 5G)
Stanford's ~entire AI Department has just released a 200 page 100 author Neural Scaling Laws Manifesto.
They're pivoting to positioning themselves as #1 at academic ML Scaling (e.g. GPT-4) research.
"On the Opportunities and Risks of Foundation Models"
arxiv.org/abs/2108.07258
6K Followers 5K FollowingCEO at Deepchecks | Moderator at https://t.co/eIctpd8n3A | Forbes 30 Under 30 | Open Source Validation of AI & LLMs
https://t.co/e8ivMRLuEp
2K Followers 5K FollowingData scientist with interests in statistics, machine learning, deep learning, operation research and high performance computing.
17K Followers 2K FollowingMachine Learning at @Nvidia, 6x Kaggle Grandmaster CPMP. ENS Ulm alumni. ML PhD. Ex ILOG CPLEX, IBM. Views are my own. Blocking ad hominem attacks.
8K Followers 2K FollowingVP of Developer Relations at DataStax. Cal Poly Computer Engineering. Distributed Systems. I want YOU to be awesome. How can I help you? #cassandra #spark
1K Followers 4K FollowingSoftware engineer and co-founder of @LiquityProtocol - a decentralized borrowing protocol. Physics and Econ. Security maximalist. No crypto held at home.
2 Followers 36 FollowingSystems Architect @AWS | Skilled Botnet Developer | Worked with many Top US/German Based Companies
Opinions are my own 😊 💂🏻♀️England Born 🇺🇸 US Raised
682 Followers 4K FollowingApasionado por el desarrollo Web Full Stack. #Javascript | #NodeJS | #ReactJS | #BootStrap. Learning web development with Javascript, React ⚛, TS
10 Followers 170 FollowingA Passionate Digital Marketing Professional with a Proven Track Record to Thrive in the Emerging Start up with a Self Motivated and Dedicated Attitude.
26K Followers 3K FollowingFederally funded academic research is the innovation engine of the US economy. Reform is welcome. Destruction will have long term consequences.
1.2M Followers 518 FollowingHighlighting Politicians' trades so we can invest alongside Goal: get them banned from trading. $800,000,000 invested on @joinautopilot_ so far
416K Followers 2K Following@Shopify CEO by day, Dad in evening, hacker at night. Aspiring comprehensivist. (tweets auto delete eventually) retweet=noteworthy share, not endorsement
46K Followers 8K Following🇫🇷 Learn French with Jules and Renée!
📘 Unique tips: "Ne fais plus ces 50 erreurs en français"
🏠 Home of webcomic @JacquesandJack
#langtwt #edutwitter
16K Followers 1K FollowingStart speaking French in a few minutes
- Video & Audio Lessons
- Free Apps
- Your own Teacher
Sign up for a Free Lifetime Account ⬇
#FrenchPod101
210K Followers 359 FollowingI build & teach AI stuff. Founder @TakeoffAI where we’re building an AI coding tutor. Come learn to code + build with AI at https://t.co/oJ8PNoAutE.
63K Followers 3K FollowingScientist at Tufts University; my lab studies anatomical and behavioral decision-making at multiple scales of biological, artificial, and hybrid systems.
116K Followers 982 FollowingFounder of @carryhq_. Founded @teachable (sold to @hotmart). On a mission to help people be better with money. Not financial advice, views are personal.
8K Followers 2K FollowingHelping founders and brilliant people with work visas, immigration, and starting businesses worldwide. DM or website ⬇️ to look at options together.
59K Followers 133 FollowingWe make tinygrad and sell tinybox, the best perf/$ AI computer.
$25k for 4x 5090 in a quiet box.
Our mission is to commoditize the petaflop.