❓What is an agent?
I get asked this question a lot, so I wrote a little blog on this topic and other things:
- What is an agent?
- What does it mean to be agentic?
- Why is “agentic” a helpful concept?
- Agentic is new
Check it out here: blog.langchain.dev/what-is-an-age…
Do you know your LLM uses less than 1% of your GPU at inference? Too much time is wasted on KV cache memory access ➡️ We tackle this with the 🎁 Block Transformer: a global-to-local architecture that speeds up decoding up to 20x 🚀
@kaist_ai@LG_AI_Research w/ @GoogleDeepMind 🧵
Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Enables MLLMs to express intermediate reasoning as images using code. You probably didn't use typography knowledge to solve this query
proj: whiteboard.cs.columbia.edu
abs: arxiv.org/abs/2406.14562
From RAG to Rich Parameters
Investigates more closely how LLMs utilize external knowledge over parametric information for factual queries.
Finds that in a RAG pipeline, LLMs take a “shortcut” and display a strong bias towards utilizing only the context information to answer the…
Transformer models can learn robust reasoning skills (beyond those of GPT-4-Turbo and Gemini-1.5-Pro) through a stage of training dynamics that continues far beyond the point of overfitting (i.e. with 'Grokking') 🤯
For a challenging reasoning task with a large search space,…
A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges
Explores applications of LLMs in various financial tasks, discussing the challenges, opportunities, and resources for further development in this domain.
📝arxiv.org/abs/2406.11903
I have lots of thoughts on "agents"!
❓What is an agent? Why do the basic agents not work reliably? How are teams bringing "agentic" applications to production
🙏I had a lot of fun talking about these topics (and more!) for nearly a hour with Sonya/Pat
open.spotify.com/episode/786INO…
Learning Iterative Reasoning through Energy Diffusion
abs: arxiv.org/abs/2406.11179
project page: energy-based-model.github.io/ired/
"IRED learns energy functions to represent the constraints between input conditions and desired outputs. After training, IRED adapts the number of…
Transcendence: Generative Models Can Outperform The Experts That Train Them
abs: arxiv.org/abs/2406.11741
Uses chess games as a simple testbed for studying transcedence: generative models trained on human labels that outperform humans.
Transformer models are trained on public…
DeepSeek-Coder-V2: First Open Source Model Beats GPT4-Turbo in Coding and Math
> Excels in coding and math, beating GPT4-Turbo, Claude3-Opus, Gemini-1.5Pro, Codestral.
> Supports 338 programming languages and 128K context length.
> Fully open-sourced with two sizes: 230B (also…
How Do Large Language Models Acquire Factual Knowledge During Pretraining?
Reveals several important insights into the dynamics of factual knowledge acquisition during pretraining
arxiv.org/abs/2406.11813
Google presents Improve Mathematical Reasoning in Language Models by Automated Process Supervision
- MCTS for the efficient collection of high-quality process supervision data
- 51% -> 69.4% on MATH
- No human intervention
arxiv.org/abs/2406.06592
Announcing LiveBench AI - The WORLD'S FIRST LLM Benchmark That Can't Be Gamed!!
We (Abacus AI) partnered with Yann LeCunn and his team to create LiveBench AI!
LiveBench is a living/breathing benchmark with new challenges that you CAN'T simply memorize. Unlike blind human eval,…
Husky
A Unified, Open-Source Language Agent for Multi-Step Reasoning
Language agents perform complex tasks by using tools to execute each step precisely. However, most existing agents are based on proprietary models or designed to target specific tasks, such as
Towards Lifelong Learning of LLMs
Nice survey on techniques to enable LLMs to learn continuously, integrate new knowledge, retain previously learned information, and prevent catastrophic forgetting.
arxiv.org/abs/2406.06391
Simple and Effective Masked Diffusion Language Models
Achieves a new SotA among diffusion models on a range of LM tasks and approaches AR perplexity
repo: github.com/kuleshov-group…
abs: arxiv.org/abs/2406.07524
Synthetic Query Generation using Large Language Models for Virtual Assistants
Apple investigates the use of LLMs to generate synthetic queries for virtual assistants that are similar to real user queries and specific to retrieving relevant entities.
📝arxiv.org/abs/2406.06729
18 Followers 270 FollowingAI & automation for business | No-code/low-code | Dev productivity | Curated AI news digests | Follow for more | Latest posts 👇
📍 https://t.co/gZsBDr7494
589 Followers 2K FollowingLaugh at the confusion, smile through the tears, & keep reminding yourself that everything happens for a reason.
Keep Your Standards High & Expectations Low. 👑
2K Followers 6K FollowingBiz Dev, CyberSecurity, Internet of Things (IoT), Business Intelligence (BI) Salesforce Threads - US Navy Veteran - No Financial advice. #XRP #Applied_Economics
655 Followers 1K FollowingInvestment philosophy: Buy and hold strong companies, dollar cost average. FSD supervisor. My comments are not investment advice.
322 Followers 868 FollowingWallSt FinTech prof pivoting to Defi; Trying to be a wise investor,looking for alpha;Crypto,Blockchain;Not investment advice.Shit posting at times ;)
27 Followers 300 FollowingOur Mission is to help Enterprises Evaluate their AI Agents. We calculate the Enterprise AI Agent's Capability Quotient or AGQ.
Visit https://t.co/lC1zaWXCeu
5K Followers 325 FollowingCEO@Redwood Research (@redwood_ai), working on technical research to reduce catastrophic risk from AI misalignment. [email protected]
56K Followers 853 FollowingFiguring out AI @allen_ai, open models, RLHF, fine-tuning, etc
Contact via email.
Writes @interconnectsai
Wrote The RLHF Book
Mountain runner
110K Followers 3K FollowingCPO @OpenAI, BoD @Cisco @nature_org, LTC @USArmyReserve
Prev: President @Planet, Head of Product @Instagram @Twitter
❤️ @elizabeth ultramarathons kids cats math
50K Followers 3K FollowingAI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
1.3M Followers 1K FollowingCo-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs
91K Followers 8K FollowingCompiling in real-time, the race towards AGI.
🗞️ Don't miss my daily top 1% AI analysis newsletter directly to your inbox 👉 https://t.co/6LBxO8215l
93K Followers 207 FollowingLMArena: Open Platform for Community-driven AI Benchmarking. Graduated from UC Berkeley / @lmsysorg. We’re hiring: https://t.co/1OkfLq2n0I
29K Followers 1K FollowingAI, national security, China. Part of the founding team at @CSETGeorgetown (opinions my own). Author of Rising Tide on substack: https://t.co/LKAoyL00iB
106K Followers 372 FollowingSharing practical ways to use AI for you and your business | Insights on Latest AI Tools, Tech Trends & AI Tutorials | DM open for collabs
155K Followers 523 FollowingWhere finance practitioners get started with Python for quant finance, algorithmic trading, and data analysis | Tweets & threads with free Python code & tools.
64K Followers 13 FollowingDevelop profitable trading strategies, build a systematic trading process, and trade your ideas with Python—even if you’ve never done it before.
355K Followers 1K FollowingML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).