Harm de Vries @harmdevries77

Building something new | prev co-lead @BigCodeProject @ServiceNowRsrch | PhD from @Mila_Quebec harmdevries.com Amsterdam Joined September 2022

Tweets

66
Followers

1K
Following

173
Likes

430

Harm de Vries @harmdevries77

9 months ago

We're hiring! Over the past few months, we’ve been building up our agent tech stack. Now we're ready to scale up. If you live and breathe agentic systems and how they are going to impact work—DM me. We just opened a few engineering and product roles, see careers.graidd.com

5 8 64 12K 59

Harm de Vries @harmdevries77

11 months ago

Interesting analogy between the current GenAI revolution and the computer industry from the 80s!

Nikita Arora @Nikita_Arora17

11 months ago

Interesting analogy between the current GenAI revolution and the computer industry from the 80s!

1 4 11 6K 5

Download Image

0 0 3 2K 2

BigCode @BigCodeProject

a year ago

Introducing 🌸BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks! BigCodeBench goes beyond simple evals like HumanEval and MBPP and tests LLMs on more realistic and challenging coding tasks.

10 62 214 102K 82

Download Gif

Guilherme Penedo @gui_penedo

a year ago

We are (finally) releasing the 🍷 FineWeb technical report! In it, we detail and explain every processing decision we took, and we also introduce our newest dataset: 📚 FineWeb-Edu, a (web only) subset of FW filtered for high educational content. Link: hf.co/spaces/Hugging…

38 309 1K 1.1M 1K

Download Image

ServiceNow Research @ServiceNowRSRCH

a year ago

It’s been a year since the release of @BigCodeProject’s 💫 StarCoder models and paper: May the source be with you! Join us as we celebrate the anniversary, and share what you’ve done using #StarCoder. Read how StarCoder has helped ServiceNow developers: servicenow.com/blogs/2024/big…

0 7 23 4K 2

Philipp Schmid @_philschmid

a year ago

Self-Instruct for CodeLLMs! 👀 @BigCodeProject released a new StarCoder2-Instruct, the first entirely self-aligned code LLM trained with a transparent and permissive pipeline. 🧑🏻‍💻 It used itself to generate thousands of instruction-response pairs, which were then used to…

2 34 160 29K 117

Download Image

Andreas Kirsch 🇺🇦 @BlackHC

a year ago

Test-of-time awards should maybe be handed out after a longer period of time but in my opinion this blog post (and the following) were incredibly prescient, and about one year later, everybody in LLMs is doing exactly what it suggested

Harm de Vries @harmdevries77

2 years ago

15 124 660 232K 418

Download Image

1 4 46 9K 37

Gabriele Sarti @gsarti_

a year ago

LLaMA 3 is testing the limits of @harmdevries77's Law (viz: huggingface.co/spaces/lvwerra… using 8B param & 15T tokens)

Sasha Rush @srush_nlp

a year ago

LLaMA 3 is testing the limits of @harmdevries77's Law (viz: huggingface.co/spaces/lvwerra… using 8B param & 15T tokens) https://t.co/diMxsbv0QG

3 22 221 31K 30

Download Image

1 4 19 4K 6

Download Image

Leandro von Werra @lvwerra

2 years ago

Took some time to reflect on the past 1+year of the @BigCodeProject: Here are a few of my learnings from leading it during this time and some ingredients I think are important for a successful open collaboration in ML. What is BigCode? BigCode is an open scientific collaboration…

1 18 69 21K 17

BigCode @BigCodeProject

2 years ago

Introducing: StarCoder2 and The Stack v2 ⭐️ StarCoder2 is trained with a 16k token context and repo-level information for 4T+ tokens. All built on The Stack v2 - the largest code dataset with 900B+ tokens. All code, data and models are fully open! hf.co/bigcode/starco…

13 200 669 221K 247

Download Image

Terry Yue Zhuo @ SF 🏖️ @terryyuezhuo

2 years ago

Instruction Tuning Code LLMs Using #PEFT methods? Introducing 🌠 ✨Astraios Model Suite: A suite of 28 #StarCoder instruct-tuned using #OctoPack, 7 tuning methods & 4 model sizes, and up to 16B parameters. 📝Extensive Evaluation: 5 tasks & 8 datasets in both Code Comprehension…

1 17 62 13K 33

Download Image

BigCode @BigCodeProject

2 years ago

Exciting times: we are working on the next generation of StarCoder trained on a new dataset! 🚀 If you would like to have your code excluded from the training run you can check if your data is in the dataset and follow the link to opt-out: huggingface.co/spaces/bigcode…

2 25 92 14K 12

Harm de Vries @harmdevries77

2 years ago

First promising results for pre-training with related documents in the context window, nicely addressing the data issue I explained in my last blog post. Looks de-risked enough to go into llama-3. arxiv.org/abs/2310.10638