Skills & Keywords
A/BA/B TestingB testingBias VarianceBias-Variance TradeoffCI/CDCalibrationConfidence IntervalsData DriftData Drift DetectionDrift DetectionLLM Evaluation
Job Description
Build regression test suites for ML and LLM models; Collaborate on ML model architecture improvements; Communicate model health findings to stakeholders; Design ML model validation frameworks; Develop and execute model evaluation protocols; Document evaluation methodologies and monitoring runbooks; Monitor models in production for drift and reliability; Stay current with LLM evaluation and safety techniques;
View full postingSimilar Roles
D
Senior Staff Machine Learning Engineer
Doordash
San Francisco, CA; Sunnyvale, CA
BH
Lead AI and Computer Vision specialist
Baker Hughes
IT-FI-FLORENCE-VIA FELICE MATTEUCCI 2, Italy
R
Senior Machine Learning Engineer
Roku
Cambridge, United Kingdom
D
Resident Solutions Architect - Digital Native Business
Databricks
Remote - California