Skills & Keywords
Agentic dataApache HiveApache SparkData CurationData Scaling LawsData scalingLanguage ModelsLanguage ProcessingLarge Language ModelsMachine LearningNatural LanguageNatural Language Processing
Job Description
Advance data research to overcome data walls; Advance data tooling; Architect scalable data curation pipelines; Build agentic data systems; Create synthetic data; Develop foundational language models; Execute pre training mid training post training data curation projects; Improve data velocity across workflows; Lead end-to-end technical projects; Optimize datamix;
View full posting