Back to jobs
N
Human

Deep Learning Architect, LLM Inference - New College Grad 2026

NVIDIA

US, CA, Santa Clara workday 1mo ago
Apply Now

Skills & Keywords

LLMGenerative AI

Job Description

We are now looking for a Deep Learning Architect, LLM Inference! NVIDIA is at the forefront of the generative AI revolution. The Inference Benchmarking (IB) team specifically focuses on inference server performance optimization for Large Language Models (LLMs). If you're passionate about pushing the boundaries of GPU hardware and software performance and understand terms like disaggregated serving, data parallel attention, MoE, Qwen3.5, DeepSeek, GPT-OSS, then this is a great role for you! What

View full posting

Similar Roles