Job Description
Drive advancements in AI inference with NVIDIA! As a Senior Software Engineer, leverage your deep systems expertise and hands-on customer engagement to optimize LLM serving and achieve significant performance improvements.
Join NVIDIA to transform AI inference by partnering directly with technical customers and tackling complex performance challenges. In this senior engineering role, you will design comprehensive benchmarking campaigns and develop performance plans to enhance LLM serving deployments across GPU clusters. This position requires collaboration with customer engineering teams and contributions to the open-source community.
Key Responsibilities: • Collaborate with customers on LLM serving architectures • Implement end-to-end benchmarking across Kubernetes and Slurm • Set up and optimize vLLM deployments on GPU clusters • Build internal tools to enhance team productivity • Document technical insights and recommend improvements
Requirements: • Bachel...
Join NVIDIA to transform AI inference by partnering directly with technical customers and tackling complex performance challenges. In this senior engineering role, you will design comprehensive benchmarking campaigns and develop performance plans to enhance LLM serving deployments across GPU clusters. This position requires collaboration with customer engineering teams and contributions to the open-source community.
Key Responsibilities: • Collaborate with customers on LLM serving architectures • Implement end-to-end benchmarking across Kubernetes and Slurm • Set up and optimize vLLM deployments on GPU clusters • Build internal tools to enhance team productivity • Document technical insights and recommend improvements
Requirements: • Bachel...