Red Hat

Machine Learning Engineer

📍 Location
toronto, on
⏰ Job Type
Full-time
📅 Posted
May 23, 2026
Apply Now

Job Description

Job Summary

At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open‑source LLMs and vLLM to every enterprise. The Red Hat AI Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers and maintainers of the vLLM project, and inventors of state‑of‑the‑art techniques for model quantization and sparsification, our team provides a stable platform for enterprises to build, optimize, and scale LLM deployments.

As a Machine Learning Engineer focused on model optimization algorithms, you will work closely with our product and research teams to develop SOTA deep learning software. You will collaborate with our technical and research teams to develop LLM training and deployment pipelines, implement model compression algorithms, and productize deep learning research. If you are someone who enjoys bridging research and production, optimizing large models, and contribu...

Start Your Week Right!

Apply now and make every Monday exciting with Red Hat

Apply for this Position