we all know how important LLM evaluation is.. 🤔
i’m excited to FINALLY announce that we are starting a new 📢 recipe-based evals team!!! 📢
for our first study, we compared 5 LLM-generated chili recipes with the prompt: “Give me a chili recipe with an interesting twist” (1/n)
me: can i get a recipe for banana cream pie that is bad
dbrx-instruct: Sure, here's a recipe for a not-so-delicious banana cream pie:
Ingredients:
* 1 pre-made pie crust
* 3 overripe bananas
* 1 cup of mayonnaise
* 1/4 cup of mustard
* 1/4 cup of vinegar
* 1/4 cup of soy sauce…
Evaluating LLMs is really hard! At @MosaicML, we rigorously benchmark models by asking for vegan* banana bread recipes, baking them, and ranking on taste
*we currently do not penalize for responding with non-vegan, but this will change in future
Everything is just better with cactus spikes
“Bagel with real cactus spikes”
“Slippers with real cactus spikes”
“Office chair with real cactus spikes”
“Mobile phone with real cactus spikes”
44K Followers 1K FollowingCTO at @Databricks and CS prof at @UCBerkeley. Working on data+AI, including @ApacheSpark, @DeltaLakeOSS, @MLflow, https://t.co/94gROE5Xa0. https://t.co/nmRYAKG0LZ
7K Followers 714 Followingresearch scientist @databricks! previously @berkeley_ai phd, @googleai and @metaai. interested in human brains and computer brains 💫
3K Followers 359 FollowingAssociate Professor in EECS at @MIT | Founding Advisor at @mosaicml | Programming Systems | Neural Networks | Approximate Computing
3K Followers 949 FollowingChief Science Officer, Co-Founder @datologyai. Former: Head of Data Research @MosaicML; FAIR. 🧠 and 🤖 intelligence // views are from nowhere
7K Followers 714 Followingresearch scientist @databricks! previously @berkeley_ai phd, @googleai and @metaai. interested in human brains and computer brains 💫
31K Followers 877 FollowingVP GenAI @Databricks. Former CEO/cofounder MosaicML & Nervana/IntelAI. Neuro + CS. I like to build stuff that will eventually learn how to build other stuff.
29K Followers 182 FollowingThe platform engineering guy. 👾 I ask the best dev teams about their DevOps practices. Then I tweet about it. Product @Humanitec_com, Baker @ Platform Weekly
94K Followers 890 FollowingDirector of Developer Experience @togethercompute. Building open source AI apps (https://t.co/f8hbvXOFaN, https://t.co/SmHisRTtnp, https://t.co/H3xCBJvVMu, https://t.co/sed83e9OUA).
46K Followers 390 FollowingPowered by the world’s most trusted and largest professional platform, our vision is to create economic opportunity for every member of the global workforce.
68K Followers 2K FollowingCreate Short Links, QR Codes, Landing Pages and Analytics to track the performance of each with ease. Need Support? DM @BitlyCSChannel
93K Followers 3K FollowingJournalist - cyber/national security. Author - COUNTDOWN TO ZERO DAY: Stuxnet and the Launch of the World's First Digital Weapon. https://t.co/334DzfSL1f
3K Followers 1K FollowingCo-Founder at @Phonic_Co. Previously @Stanford CS PhD Dropout, @MosaicML, CS @MIT. I tend to be wrong, but the learning process makes it enjoyable. 🇵🇰🇺🇲
1K Followers 799 Following💡Empowering girls & women to excel in STEAM from the classroom 👩🏫 → to the boardroom👩🔬Show the world 🌎 #WeAreWIT! Join our community today💓
3K Followers 949 FollowingChief Science Officer, Co-Founder @datologyai. Former: Head of Data Research @MosaicML; FAIR. 🧠 and 🤖 intelligence // views are from nowhere
14K Followers 42 FollowingWe are an independent nonprofit organization that believes collaboration opportunities and research training should be openly accessible and free.
43K Followers 313 FollowingBlack in AI empowers a global community of AI professionals of African descent to be full partners in shaping our technological and economic future.
15K Followers 168 FollowingResearch scientist @GoogleDeepMind. Past: @Databricks, first hire @MosaicML, @MIT PhD. I post about AI technical progress + sometimes the business side.
46K Followers 1K Following(On mat leave.) Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS.