All roles

LLM Engineer Jobs

26 open positions across the agentic economy

Role Overview

Top Companies Hiring
Langfuse9
Scale AI8
Datadog3
Google DeepMind2
Social Discovery Group1
Hopper1
Common Skills
greenhouseashbybuiltinRemote
Remote Availability

8% of positions are remote

All LLM Engineer positions

SD

Senior NLP / LLM Engineer

Social Discovery Group

RemoteToday
H

Senior Backend Engineer, MCP, RAG and Fine-Tuning

Hopper

RemoteToday
T

Senior AI Developer

TripleTen

We’re building an AI Tutor — a personalized learning system that adapts educational content and learning paths for each student. We’re looking for an AI Developer with a strong backend background who can turn large language models into reliable, production-ready features that feel native within the learning experience. What you will do: You’ll join a cross-functional team of experienced backend and frontend engineers, ML specialists, and UX/UI designers building a new generation of AI-powered le

BerlinToday
D

Senior LLM Engineer

Datadog

We’re a new team building AI-assisted tools to make Datadog developers more effective, by autonomously generating tests, fixing bugs, and improving performance. We’re looking for a product-minded generalist to help us quickly define and ship products that make all Datadog customers 10x developers. At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can c

Paris, France1w ago
D

Staff Software Engineer - ML Observability

Datadog

The ML Observability team builds cutting-edge tools to monitor, explain, and improve AI systems in production, particularly those leveraging Large Language Models (LLMs) and generative AI. We provide robust, scalable observability for AI workloads, including drift detection and model evaluation, and behavior tracing, enabling customers to ship AI with confidence. As a Staff Engineer, you’ll lead the development of new features and foundational capabilities within Datadog’s LLM Observability prod

Boston, Massachusetts, USA; New York, New York, USA1w ago
A

Research Engineer/Research Scientist, Audio

Anthropic

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. Anthropic’s Audio team pushes the boundaries of what's possible with audio with large language models. We care about making safe, steerable, reliable systems that

San Francisco, CA1w ago
D

Product Manager II - Model Lab

Datadog

As a Product Manager II for Model Lab , you will define and launch Datadog’s experiment tracking platform built for teams training and fine-tuning foundational models. Model Lab centralizes metrics, hyperparameters, datasets, code versions, artifacts, and lineage to help ML and AI teams achieve reliable, reproducible, and explainable training runs at scale. This is a 0→1 opportunity to build a new product that will be deeply integrated into Datadog’s observability platform. You will shape the pr

New York, New York, USA1w ago
SA

Tech Lead Manager- MLRE, ML Systems

Scale AI

Scale's LLM post-training platform team builds our internal distributed framework for large language model training. The platform powers MLEs, researchers, data scientists, and operators for fast and automatic training and evaluation of LLMs. It also serves as the underlying training framework for the data quality evaluation pipeline. Scale is uniquely positioned at the heart of the field of AI as an indispensable provider of training and evaluation data and end-to-end solutions for the ML lifec

San Francisco, CA; New York, NY1w ago
SA

Staff Machine Learning Research Scientist, LLM Evals

Scale AI

As the leading data and evaluation partner for frontier AI companies, Scale is dedicated to advancing the evaluation and benchmarking of large language models (LLMs). We are building industry-leading LLM evals, setting new standards for model performance assessment. Our mission is to develop rigorous, scalable, and fair evaluation methodologies to drive the next generation of AI capabilities. Our Research teams work with the industry’s leading AI labs to provide high quality data and accelerate

San Francisco, CA; Seattle, WA; New York, NY2w ago
SA

GenAI Strategic Projects Lead, Public Sector

Scale AI

Scale is at the frontier of the AI industry, improving the world’s leading generative AI and large language models through model evaluations, human-powered supervised fine-tuning datasets, world-class reinforcement learning with human feedback, and more. Scale AI’s Public Sector team is growing in the Generative AI space, and we’re seeking an Strategic Projects Lead to own high-impact projects that drive revenue and experimentation. In this role, you’ll work across operations, engineering, and c

Washington, DC2w ago
SA

Machine Learning Research Engineer - Robotics

Scale AI

Scale’s Robotics business unit is dedicated to solving the data bottleneck in Physical AI. This position will be a key contributor in conducting applied research in Robotics and developing ML pipelines for training and fine-tuning on data collected by Scale. In this role, you will have the opportunity to advance Robotic research, shape Scale’s robotics offerings, and expand the frontier of Robotics data and model evaluation. You will: Collaborate closely with Robotics customers to drive the indu

San Francisco, CA2w ago
SA

ML Research Engineer, ML Systems

Scale AI

Scale’s ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and operators for fast and automatic training and evaluation of LLM's, as well as evaluation of data quality. Scale is uniquely positioned at the heart of the field of AI as an indispensable provider of training and evaluation data and end-to-end solutions for the ML lifecycle. You will work closely across Sca

San Francisco, CA; Seattle, WA; New York, NY2w ago
SA

Field Engineer, Public Sector

Scale AI

Scale is a vital part of bringing AI-enabled technologies to the world, from autonomous driving to drones, robots, and large language models. For example, Scale works with the world's top self-driving car and robotics ML teams as well as the largest companies in the generative AI space. As our customer base is growing, you will be on the front lines of our field engineering efforts for our federal AI projects, having the opportunity to meaningfully impact millions of dollars in revenue by workin

San Francisco, CA; New York, NY; Honolulu, Hawaii, St. Louis, MO; Washington, DC2w ago
SA

ML Systems Engineer, Robotics

Scale AI

Scale's Physical AI business unit is dedicated to solving the data bottleneck across Robotics, Autonomous Vehicles, and Computer Vision. This position will be a key contributor in conducting applied research in Physical AI and developing ML pipelines for processing, training, and fine-tuning on data collected by Scale, with a specific focus on optimizing algorithms and pipelines to run efficiently on GPUs in the cloud. In this role, you will have the opportunity to advance research, shape Scale’

San Francisco, CA2w ago
SA

Evals Engineer, Applied AI

Scale AI

Scale AI is seeking a technically rigorous and driven AI Research Engineer to join our Enterprise Evaluations team. This high-impact role is critical to our mission of delivering the industry's leading GenAI Evaluation Suite . You will be a hands-on contributor to the core systems that ensure the safety, reliability, and continuous improvement of LLM-powered workflows and agents for the enterprise. The ideal candidate has a strong foundational knowledge of large language models, a passion for ta

San Francisco, CA; New York, NY2w ago
GD

Research Scientist, Recommendation Systems

Google DeepMind

About Us Our team operates at the frontier of modern recommender systems. With a proven track record of innovating and deploying novel deep learning algorithms and systems at scale, we are currently focused on building the next-gen Large Recommendation Models by bridging the gap between LLMs and complex behavioral signals. Our research explores user & item tokenizations, continued pre-training, and advanced fine-tuning techniques to build recommendations-native foundation models. Our mission

Mountain View, California, US3w ago
GD

Chemist (FTC - 12 Month Fixed Term Contract)

Google DeepMind

Snapshot As a Chemist in the Responsible Development & Innovation (ReDI) team at Google DeepMind, you will be a principal architect of the safety protocols governing the intersection of Large Language Models (LLMs) and the chemical sciences. You will design and execute rigorous safety evaluations and inform mitigation strategies that ensure our frontier models accelerate scientific discovery without compromising global security. This role is pivotal in deciding when and how our most advanced

Mountain View, California, US1mo ago
L

Senior Forward Deployed Engineer

Langfuse

ABOUT LANGFUSE Open Source LLM Engineering Platform that helps teams build useful AI applications via tracing, evaluation, and prompt management (mission https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/0/82383757e54352130f65066e1b2fc4708aacab7897561bcb8000fe4c8a9c6a21?cache_buster=1761124921, product https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/1/b9fba3a93b6ffcc0f99ecda62767a17cc437fe8fe0b16181d1c43c1391212e3d?cache_buster=1761124921). We are now part of Clic

Europe1mo ago
L

Senior Technical Account Manager

Langfuse

ABOUT LANGFUSE Open Source LLM Engineering Platform that helps teams build useful AI applications via tracing, evaluation, and prompt management (mission https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/0/82383757e54352130f65066e1b2fc4708aacab7897561bcb8000fe4c8a9c6a21?cache_buster=1761124921, product https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/1/b9fba3a93b6ffcc0f99ecda62767a17cc437fe8fe0b16181d1c43c1391212e3d?cache_buster=1761124921). We are now part of Clic

Europe1mo ago
L

Senior Frontend Engineer

Langfuse

ABOUT LANGFUSE Open Source LLM Engineering Platform that helps teams build useful AI applications via tracing, evaluation, and prompt management (mission https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/0/82383757e54352130f65066e1b2fc4708aacab7897561bcb8000fe4c8a9c6a21?cache_buster=1761124921, product https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/1/b9fba3a93b6ffcc0f99ecda62767a17cc437fe8fe0b16181d1c43c1391212e3d?cache_buster=1761124921). We are now part of Clic

Europe1mo ago
L

Senior Backend Engineer

Langfuse

ABOUT LANGFUSE Open Source LLM Engineering Platform that helps teams build useful AI applications via tracing, evaluation, and prompt management (mission https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/0/82383757e54352130f65066e1b2fc4708aacab7897561bcb8000fe4c8a9c6a21?cache_buster=1761124921, product https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/1/b9fba3a93b6ffcc0f99ecda62767a17cc437fe8fe0b16181d1c43c1391212e3d?cache_buster=1761124921). We are now part of Clic

Europe1mo ago
L

Product Engineer (Growth)

Langfuse

ABOUT LANGFUSE Open Source LLM Engineering Platform that helps teams build useful AI applications via tracing, evaluation, and prompt management (mission https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/0/82383757e54352130f65066e1b2fc4708aacab7897561bcb8000fe4c8a9c6a21?cache_buster=1761124921, product https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/1/b9fba3a93b6ffcc0f99ecda62767a17cc437fe8fe0b16181d1c43c1391212e3d?cache_buster=1761124921). We are now part of Clic

Europe1mo ago
L

Senior Software Engineer (SDK)

Langfuse

ABOUT LANGFUSE Open Source LLM Engineering Platform that helps teams build useful AI applications via tracing, evaluation, and prompt management (mission https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/0/82383757e54352130f65066e1b2fc4708aacab7897561bcb8000fe4c8a9c6a21?cache_buster=1761124921, product https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/1/b9fba3a93b6ffcc0f99ecda62767a17cc437fe8fe0b16181d1c43c1391212e3d?cache_buster=1761124921). We are now part of Clic

Europe1mo ago
L

Senior Product Engineer

Langfuse

ABOUT LANGFUSE Open Source LLM Engineering Platform that helps teams build useful AI applications via tracing, evaluation, and prompt management (mission https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/0/82383757e54352130f65066e1b2fc4708aacab7897561bcb8000fe4c8a9c6a21?cache_buster=1761124921, product https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/1/b9fba3a93b6ffcc0f99ecda62767a17cc437fe8fe0b16181d1c43c1391212e3d?cache_buster=1761124921). We are now part of Clic

Europe1mo ago
L

Product Designer

Langfuse

ABOUT LANGFUSE Open Source LLM Engineering Platform that helps teams build useful AI applications via tracing, evaluation, and prompt management (mission https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/0/82383757e54352130f65066e1b2fc4708aacab7897561bcb8000fe4c8a9c6a21?cache_buster=1761124921, product https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/1/b9fba3a93b6ffcc0f99ecda62767a17cc437fe8fe0b16181d1c43c1391212e3d?cache_buster=1761124921). We are now part of Clic

Europe4mo ago
L

DevRel Engineer

Langfuse

ABOUT LANGFUSE Open Source LLM Engineering Platform that helps teams build useful AI applications via tracing, evaluation, and prompt management (mission https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/0/82383757e54352130f65066e1b2fc4708aacab7897561bcb8000fe4c8a9c6a21?cache_buster=1761124921, product https://tracking.us.nylas.com/l/6d586a21a6fc4e1a8aacc7eb75882b72/1/b9fba3a93b6ffcc0f99ecda62767a17cc437fe8fe0b16181d1c43c1391212e3d?cache_buster=1761124921). We are now part of Clic

Europe6mo ago

Related articles

All articles

Find your next role in the agentic economy

1,700+ curated AI and agentic jobs from top companies

Get the weekly agentic jobs digest

Curated every Thursday. No spam.

Explore other roles