We’ve updated our evaluation of the best open source LLMs and—spoiler alert—Llama 3 70B Instruct tops the list.
Our complete recommendations:
baseten.co/blog/the-best-…
We’re excited to launch Meta’s Llama 3 in our model library, in both 8B and 70B 🎉
The newly introduced Llama 3 is a significant improvement over Llama 2, with increased tokens, and reduced false refusal rates. These models deliver unparalleled performance, showcasing…
Deploy Mixtral 8x22B in one click!
Mixtral fast facts:
- #1 pretrained model on the Open LLM leaderboard
- Mixture of Experts architecture
- Apache 2.0 license
- Uses 4 A100s in fp16, optimized implementations coming soon!
baseten.co/library/mixtra…
Another first 🎉
Unlock the power of @nvidia's Multi-Instance GPU (MIG) virtualization technology with H100mig GPUs on Baseten: baseten.co/changelog/frac…
40% lower latency and 70% higher throughput for Stable Diffusion XL?
Using NVIDIA TensorRT to optimize each component of the SDXL image generation pipeline, we achieved these performance gains on an H100 GPU.
Full results:
baseten.co/blog/40-faster…
Nomic Embed v1.5 outperforms OpenAI's text-embedding-3-small and ada-002, plus offers variable dimensionality.
This best in class open source text embedding model from @nomic_ai is now available for 1-click deployment in Baseten's model library:
baseten.co/library/nomic-…
Unlocking the full potential of GPUs requires optimizing every part of the model serving stack.
Using TensorRT from @nvidia, we optimize ML models to take full advantage of the H100 Hopper architecture, providing 20-40% better performance per dollar.
baseten.co/blog/unlocking…
Launching today 🎉
Double your throughput or halve your latency for @MistralAI, @StabilityAI + others?
Do both at ~20% lower cost with @nvidia H100s on Baseten.
Here’s how 👇
Mixtral 8x7B beats Llama 2 70B on quality for many benchmarks, but it also wins on inference speed. Here’s why:
- Mixtral is only 46.7B parameters
- Only 12.9B are used during inference
- You can make it even faster with TensorRT-LLM and int8 quantization
Stable Video Diffusion is so much fun. You can now deploy it yourself on Baseten.
@StabilityAI knocked it out of the park with this one 🎉
Lots of examples in the thread below 🧵
baseten.co/blog/stable-vi…
Ready to try open source LLMs?
Switch from GPT to Mistral 7B in the smallest refactor you'll ever ship: just 3 tiny code changes.
If you're making the jump, DM us for $1,000 in free credits.
baseten.co/blog/gpt-vs-mi…
There's a new text embedding model by @JinaAI_ with some exciting properties 👀
- 8,192-token context window (embed chapters, not pages)
- Matches OpenAI's ada-002 on popular benchmarks
Use jina-embeddings-v2 for search & recommendations and pair w/ LLMs like Mistral for RAG.
Presenting your winner, by unanimous decision…
ANDDD…
NEW seven billion parameter champion of the world...
Mistral 7B blew past Llama 2 on the benchmarks. And it can code, too.
Get Mistral 7B with streaming output behind an API endpoint in minutes: app.baseten.co/explore/mistra…
For serverless models, I'm super impressed by baseten.co's team.
They fixed one example model, and then added another model I requested, all in ten minutes.
And easily the fastest server response times too.
@JulienTech@basetenco I haven't seen a GPU start and load my model faster than @basetenco. It goes from 0 to fully operational in a few seconds. You guys are killing it!
Introducing Opendream, a better interface for diffusion models 👨🎨
Opendream brings much needed features to your workflow, such as:
- writing extensions as single Python functions 👀
- layering and non-destructive editing 🍰
- saving and sharing 💾
And it's all open-source.
We’ve got drinks, dinner, demos, and discussion - all we need is you!
Join builders and founders at our next AI meetup this Thursday, August 10th, in San Francisco.
38 Followers 191 Followinghello my dear friends my name is Pardeep Kumar Podia owner of Shivam madical food supplements store and podia fitness life zym Panipat Haryana ( jivan men ek ac
13 Followers 185 FollowingFounder/CEO at New Rand Business Solutions and SyncMoney | Tech enthusiast specializing in FinTech and Neobanking solutions | Proud father of 3
19K Followers 2K FollowingFellow at Henry, Inc. Tech SaaSのPdM、スタートアップ取締役CTOや外資スタートアップのIC等を経て現任。好きな言語はGoとPerlと中国語で雑なOSSを200以上量産している。3 times ISUCON winner. 著書「みんなのGo言語」共著他。Podcast @oss4fun
1K Followers 1K FollowingBlockchain + Art = Tokenized Art | NFT collectiables
I'm YouTuber, See My Works on @DCNmediaUS
Expressing Feelings in NFT Collection @XQUIETH
@Shangritwo 🌝
607 Followers 49 FollowingAutomate common bug triage bottlenecks and focus on what matters with Launchable's Intelligent Test Failure Diagnostics and Bug Triage powered by #AI.
424 Followers 127 FollowingPlug into the Internet of Talent. MindTrust built the world’s first Teams as a Service™ (TaaS) platform to give you on-demand access to the top tech talent.
246 Followers 262 FollowingFind articles, reviews, tutorials, solutions, and the latest videos related to #CloudComputing, #hosting, #security, #Microsoft, #hostingnews, and #Linux.
2K Followers 4K FollowingHandmade with love🧶
#favecrochet
Crocheting is all about warmth and love. If you want to see crochet crafts in action, Fave Crochet is your place.
2K Followers 5K Following- The immortal mystery, is the eternal fascination
- Darkness knows no boundaries and an enlightened mind knows not either
𓅊 אֵל ך
𓂀
⚖️
Ɛ13
2K Followers 3K FollowingFull-service #marketing agency near #Manchester. Providing expertise and creativity within 🎨 design 📱 digital 📈 marketing 📰 PR 📦 print production
875 Followers 4K FollowingI am #digital #marketer.i do all the work of #digitalmarketing, #facebookmarketing #instagrammarketing #linkedinmarketing #pinterestmarketing and #SEO service.
114K Followers 291 FollowingWe cover Real Madrid from the Bernabéu and are the largest Real Madrid podcast in the world. Tactical analysis, RM Femenino, Castilla, club history, and more.
362 Followers 510 FollowingGrowing @Basetenco: best-in-class model hosting for AI startups. Formerly @Stripe. NYC + SF. Let's talk GPUs, sales, or Brazilian jiu-jitsu.
19K Followers 2K FollowingFellow at Henry, Inc. Tech SaaSのPdM、スタートアップ取締役CTOや外資スタートアップのIC等を経て現任。好きな言語はGoとPerlと中国語で雑なOSSを200以上量産している。3 times ISUCON winner. 著書「みんなのGo言語」共著他。Podcast @oss4fun
3K Followers 2K Following- Adding "why" to AI
- Applying #CausalAI to business challenges
- Leading @GeminosAI Community Strategy
Book: https://t.co/EXGpPI8Blv
6K Followers 2K FollowingI might be wrong, joking, or say asinine things. Not financial advice, legal advice, or good ideas. Content may be satire, & is subject to change without notice
95K Followers 2 FollowingNo longer permitted to post here. (Written and automated by @JoeSondow) Mastodon: @[email protected] Bluesky: @RikerGoogling.bsky.social
9K Followers 2K FollowingLover of all things good in life including Family, Friends, Food, and Functional Programs.
⚒️ Creator of https://t.co/DcmmmjzGuU, JS.git, and https://t.co/jjkLYugcAt
9K Followers 662 FollowingJoin us for DevOps World Virtual 2024 by @CloudBees! 🎉 Joins Us Virtually on Sept. 17 and in a city close to you! Register now: https://t.co/GqrUWgjzqK
2K Followers 5K Following- The immortal mystery, is the eternal fascination
- Darkness knows no boundaries and an enlightened mind knows not either
𓅊 אֵל ך
𓂀
⚖️
Ɛ13