P2P

HPC Specialist for Advanced AI Infrastructure Management

📍 Location
montreal, qc
⏰ Job Type
Full-time
📅 Posted
June 06, 2026
Apply Now

Job Description

Join an innovative team focused on building GPU infrastructure for high-performance AI and ML workloads. This role requires expertise in systems engineering and cloud-based solutions for optimizing large-scale operations.

As an HPC Specialist, you will be integral to the AI and Multi-Asset Systematic Strategies team. Your primary responsibility will be to deploy and maintain GPU infrastructure for complex workloads. Candidates should possess strong skills in optimizing deep learning models and managing distributed systems to ensure optimal performance.

Key Responsibilities:
• Deploy and manage GPU server fleets for LLM workloads
• Create distributed serving solutions for multi-GPU deployments
• Optimize Kubernetes clusters for machine learning tasks
• Configure and manage networking for GPU clusters
• Troubleshoot performance issues across hardware and software

Requirements:
• Bachelor's/Master's in Computer Sci...

Start Your Week Right!

Apply now and make every Monday exciting with P2P

Apply for this Position