Computer Vision Engineer currently working as a Machine Learning Engineer.
https://t.co/xLVKLO30rv
https://t.co/DiKrU5Eya5youtube.com/channel/UC8FB3…Joined July 2016
Introducing Critique Fine-Tuning (CFT): a more effective SFT method for enhancing LLMs' reasoning abilities.
📄 Paper: arxiv.org/pdf/2501.17703
CFT is simple: instead of training models to directly answer questions, we train them to critique noisy answers.
What's fascinating is…
🔥 o3-mini-high beats deepseek r1 and o1-pro! in a p5.js challenge!
03-mini result is so good that deserves a video on its own.
deepseek r1 (bad result) and o1-pro (better) in comments below.
Prompt in last comment.
1/4
Transformers can overcome easy-to-hard and length generalization challenges through recursive self-improvement.
Paper on arxiv coming on Monday.
Link to a talk I gave on this below 👇
Super excited about this work!
o3-mini is out!
smart, fast model.
available in ChatGPT and API.
it can search the web, and it shows its thinking.
available to free-tier users! click the "reason" button.
with ChatGPT plus, you can select "o3-mini-high", which thinks harder and gives better answers.
📚🤖 Advanced RAG + Agents Cookbook
A comprehensive open-source guide delivering production-ready implementations of cutting-edge RAG techniques with AI agents. Built with LangChain and LangGraph, it features advanced implementations like Hybrid, Self, and ReAct RAG.
Learn…
Fuck it, today we're open-sourcing the codebase used to train SmolVLM from scratch on 256 H100s🔥
Inspired by our team's effort to open-source DeepSeek's R1 training, we are releasing the training and evaluation code on top of the weights 🫡
Now you can train any of our…
Letter-dropping physics comparison: o3-mini vs. deepseek-r1 vs. claude-3.5 in one-shot - which is the best? Prompt:
Create a JavaScript animation of falling letters with realistic physics. The letters should:
* Appear randomly at the top of the screen with varying sizes
* Fall…
AI Agents for Computer Use
This report provides a comprehensive overview of the emerging field of instruction-based computer control, examining available agents – their taxonomy, development, and resources.
Gemini 2.0 doesn’t get nearly enough credit. I just dumped all my workers-qb source code into it, hit it with a simple, humble prompt, and boom => it one-shotted the docs.
Not just good docs, way better than what I had before, packed with examples.
Kinda insane.
OpenAI o3-mini just one shotted this
prompt: write a script for 100 bouncing yellow balls within a sphere, make sure to handle collision detection properly. make the sphere slowly rotate. make sure balls stays within the sphere. implement it in p5.js
Finished a run (R1 style) GRPO on Qwen-2.5-0.5B (base model) yield +10 accuracy points on GSM8K. Literally just works. Base model scores 41.6% as reported on qwen paper vs 51%~ GRPO
for people learning gpu programming and especially triton should check out liger kernel by linkedin
it was released last year and built on top of triton to provide pre-optimized, ready-to-use implementations gpu optimization techniques specifically tailored for llm training
Excited to announce text-to-api.ai
A website that turns any website into a get API with @firecrawl_dev /extract endpoint. Data on the web has never been more accessible!
Thanks to @Dev__Digest, for starting this fabulous trend. Check out his GitHub repo below!
OpenAI o3-mini is a good model, but DeepSeek r1 is similar performance, still cheaper, and reveals its reasoning.
Better models will come (can't wait for o3pro), but the "DeepSeek moment" is real. I think it will still be remembered 5 years from now as a pivotal event in tech…
OpenAI’s o3-mini is here - a significant jump forward from o1-mini
Initial results (full benchmarking coming soon):
➤ Artificial Analysis Quality Index of 89, matching DeepSeek R1 and just below o1
➤ Cheaper - $1.1/$4.4 input/output pricing per million tokens, lower than many…
When working with o1/o3 models, I always have this feeling that I'm leaving a lot on the table with my prompting. Creating a long sequence of prompts for regular LLMs is good practice. This is because you don't want to overload what an LLM can process (or it'll lead to…
635 Followers 7K Following🧪 VibeCoding Tester (VC Tester) | 🧠 AI Context Engineer
🎯 My job is simple: I vibe-test all AI platforms & processes—to make sure they work and DELIVER VALUE
684 Followers 2K FollowingTransforming Healthcare through AI Innovation, One Nurse at a Time. Founder, The Nurse Intelligence Network. AI Agents & Vibe Coding for Healing and Community
318 Followers 227 FollowingWorking on LLM/VLM Tool Learning and Reasoning at Tsinghua and Bytedance, reading at least one paper a day — The future will not invent itself.
7K Followers 469 Following350,000+ Subscribers On Youtube.
The Latest In Artificial Intelligence News,Research and Updates.
https://t.co/00h8FGPPlZ
https://t.co/rWybeaStSe
9K Followers 3K FollowingDirector of Developer Tech @ NVIDIA, Co-founder/CEO https://t.co/GCUjRDOu73 acquired by NVIDIA • I laugh til I cry it's not the same on zoom •YC W20 | UCSB • views are my own
45K Followers 1K FollowingAI Developer Experience @GoogleDeepMind | prev: Tech Lead at @huggingface, AWS ML Hero 🤗 Sharing my own views and AI News 🧑🏻💻 https://t.co/7IosdlNz22
508K Followers 15 FollowingWelcome to a community built for passionate developers. Microsoft Developer is your resource for tips, research and more to help you build apps that users love.
101K Followers 28 FollowingBuild AI agents over your documents
Github: https://t.co/HC19j7vMwc
Docs: https://t.co/QInqg2zksh
LlamaCloud: https://t.co/yQGTiRSNvj
344K Followers 173 FollowingJoin over 97M users with our private browser, search, Web3 access & more. It only takes 60 seconds to switch. For help, contact @BraveSupport 🦁
22K Followers 321 FollowingThe official handle for #NVIDIAOmniverse. The platform for developing #OpenUSD applications for industrial digitalization and generative physical #AI.
325K Followers 3K FollowingNVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
18K Followers 4K FollowingAI for Life Sciences & Healthcare @NVIDIA | trained as a scientist from @JGI, @UW, @UCBerkeley | building global community @TechBi0 | views are all mine
1K Followers 426 FollowingCEO/Co-Founder @otter_ai, Share, remember, search, playback all your meetings & improve team collaboration. Ex @Google. @Stanford.
2K Followers 639 FollowingBuilding @rapidsai and @dask_dev at @NVIDIA. Tinker with kr8s in my spare time. Views are my own. he/him.
🦋 https://t.co/oxaQdmUkxo
4K Followers 960 FollowingPushing boundaries of AI & making it available for all @nvidiaAI | ex @Samsung AI, @amazon AI, PhD @imperialcollege | TensorLy creator https://t.co/1epTxX3XFB 👨💻
No recent Favorites. New Favorites will appear here.