NVIDIA

NVIDIA Senior Software Engineer AI Inference

📍 Location
toronto, on
⏰ Job Type
Full-time
📅 Posted
June 13, 2026
Apply Now

Job Description

Drive advancements in AI inference with NVIDIA! As a Senior Software Engineer, leverage your deep systems expertise and hands-on customer engagement to optimize LLM serving and achieve significant performance improvements.

Join NVIDIA to transform AI inference by partnering directly with technical customers and tackling complex performance challenges. In this senior engineering role, you will design comprehensive benchmarking campaigns and develop performance plans to enhance LLM serving deployments across GPU clusters. This position requires collaboration with customer engineering teams and contributions to the open-source community.

Key Responsibilities: • Collaborate with customers on LLM serving architectures • Implement end-to-end benchmarking across Kubernetes and Slurm • Set up and optimize vLLM deployments on GPU clusters • Build internal tools to enhance team productivity • Document technical insights and recommend improvements

Requirements: • Bachel...

Start Your Week Right!

Apply now and make every Monday exciting with NVIDIA

Apply for this Position