Scale AI @scale_AI, Twitter Profile

Scale AI @scale_AI

a month ago

Not all data is created equal. Scaling quality control for data that can challenge PhDs and the most advanced LLMs demands a different approach. To meet this demand, we built something new: autoraters powered by multi-agent model debate. Here’s how it works 🧵

5 15 78 14K 31

Download Image

Scale AI @scale_AI

a month ago

The demand for PhD-level, multimodal reasoning data is exploding. This type of data is critical for training models via reinforcement learning to tackle complex, extended reasoning. The challenges? Manual review is slow, even experts can miss subtle errors, and scaling the process is nearly impossible.