Red Hat

Senior LLM Inference & Model Optimization Engineer

📍 Location
toronto, on
⏰ Job Type
Full-time
📅 Posted
June 01, 2026
Apply Now

Job Description

A leading open-source software company seeks a Machine Learning Engineer in Toronto, Canada. You will focus on model optimization algorithms, working closely with product and research teams. Responsibilities include designing and implementing model compression pipelines and optimizing LLM performance. Ideal candidates should have a strong background in machine learning, programming skills in Python, and familiarity with LLM Inference Optimizations. This position offers a collaborative environment fostering continuous learning and innovation.
#J-18808-Ljbffr

Start Your Week Right!

Apply now and make every Monday exciting with Red Hat

Apply for this Position