Back to jobs
R
Human

Staff Machine Learning Engineer, ML Efficiency

Reddit

Remote - United Kingdom aijobs 1d ago
Apply Now

Get roles like this in your inbox

New agentic AI jobs, curated every Thursday. No spam.

Skills & Keywords

Apache SparkC++CachingCloud Cost OptimizationCloud infrastructureCost OptimizationDebuggingDistributed SystemsDistributed TrainingGPU ArchitectureGoJava

Job Description

Build benchmarking frameworks and performance dashboards; Design and build efficient ML training and inference systems; Develop ML tooling for debugging profiling optimization and monitoring; Drive ML platform scalability reliability and cost efficiency; Improve GPU and resource utilization; Lead cross functional initiatives to improve ML engineer productivity; Optimize distributed training infrastructure data pipelines and model serving architectures; Partner to identify bottlenecks and drive p

View full posting

Similar Roles