Red Hat, Inc.

Machine Learning Engineer Specializing in Model Optimization Algorithms

📍 Location
toronto, on
⏰ Job Type
Full-time
📅 Posted
June 01, 2026
Apply Now

Job Description

Elevate AI innovations as a Machine Learning Engineer focused on enhancing model optimization algorithms in remote workspace. Collaborate with product teams to streamline LLM training and deployment pipelines effectively.
In this role, you will work as a key contributor on a dynamic team, developing cutting-edge deep learning software for various applications. Your main focus will be on improving inference performance through model compression techniques, collaborating closely with research teams to bring their ideas into robust production-ready systems. Expect to profile and enhance end-to-end LLM performance for optimal efficiency.
Key Responsibilities:
• Design and develop optimization algorithms for deep learning
• Implement model compression pipelines using quantization techniques
• Maintain speculative decoding frameworks for improved inference
• Collaborate with research scientists on system development
• Optimize LLM memory usage and latency for efficiency<...

Start Your Week Right!

Apply now and make every Monday exciting with Red Hat, Inc.

Apply for this Position