NVIDIA Gruppe

Senior Software Engineer, AI Inference

📍 Location
toronto, on
⏰ Job Type
Full-time
📅 Posted
May 31, 2026
Apply Now

Job Description

Help us push the boundaries of AI inference at NVIDIA — where your systems expertise shapes both the technology and the teams building on top of it! We're looking for a Senior Software Engineer to work at the frontier of large-scale LLM serving, partnering directly with some of the world's most technically demanding customers to unlock the full performance potential of NVIDIA's inference stack. In this role, you'll combine deep systems knowledge with hands‑on customer engagement — profiling real deployments, benchmarking across GPU clusters, and turning insights into improvements that ripple across the open-source ecosystem. Do you love digging into performance problems that don't have obvious answers, and want your work to have an impact far beyond a single codebase? We'd love to talk. Unlike traditional customer‑facing engineering roles, we expect you to go far deeper — contributing to vLLM, NVIDIA Dynamo, and the tooling that makes every engineer on your team more effective. What Yo...

Start Your Week Right!

Apply now and make every Monday exciting with NVIDIA Gruppe

Apply for this Position