Skills & Keywords
Artifact trackingCloud ComputingData LineageData orchestrationData pipelineDistributed SystemsDockerExperiment trackingGPU ComputingHPCJAXKubernetes
Job Description
Architect ML research platform; Build large scale experimentation infrastructure; Contribute to scalable platform architecture decisions; Design distributed training pipelines; Develop feature engineering and dataset generation tools; Enhance platform observability for ML workloads; Improve experiment management and model versioning; Optimize compute efficiency and resource scheduling; Troubleshoot complex system issues;
View full posting