Back to jobs
SA
Human

Staff Machine Learning Research Scientist, LLM Evals

Scale AI

San Francisco, CA; Seattle, WA; New York, NY greenhouse 1w ago

Skills & Keywords

greenhouse

Job Description

As the leading data and evaluation partner for frontier AI companies, Scale is dedicated to advancing the evaluation and benchmarking of large language models (LLMs). We are building industry-leading LLM evals, setting new standards for model performance assessment. Our mission is to develop rigorous, scalable, and fair evaluation methodologies to drive the next generation of AI capabilities. Our Research teams work with the industry’s leading AI labs to provide high quality data and accelerate

View full posting

Similar Roles