Back to jobs
N
Human

Senior Performance Engineer - LLM Inference Frameworks

NVIDIA

Israel, Yokneam workday 1mo ago
Apply Now

Skills & Keywords

LLMGenerative AI

Job Description

NVIDIA is hiring exceptional software engineers to build and optimize the core inference infrastructure for large language models. Join the TensorRT‑LLM team - the group defining how generative AI performs at global scale on NVIDIA GPUs. We’re looking for engineers who love squeezing every drop of throughput, memory efficiency, and scalability out of modern model runtimes. Your work will directly shape the frameworks behind state‑of‑the‑art LLM inference used across NVIDIA and the AI community.

View full posting

Similar Roles