Back to jobs
SA
Human

Senior AI Infrastructure Engineer - Training Platform

Scale AI

San Francisco, CA; Seattle, WA; New York, NY greenhouse 1d ago
Apply Now

Skills & Keywords

greenhouse

Job Description

As a Software Engineer on the Machine Learning Infrastructure team, you will build the "Operating System" for our large-scale GPU clusters. You will architect a high-performance training platform that handles the immense complexity of multi-thousand GPU workloads, ensuring every cycle is used efficiently. Your work directly determines the velocity at which our researchers can train and iterate on the world’s most advanced models. The ideal candidate is a systems expert who thrives on solving the

View full posting

Similar Roles