Job Description
We are Bagel Labs - a distributed machine learning research lab working towards open-source superintelligence. We ignore years of experience and pedigree. If you have high agency - meaning your default assumption is that you can control the outcome of whatever situation you are in - we want to hear from you. Every requirement below is flexible for a candidate with high enough agency and tolerance for ambiguity. Overview
You will design and optimize a distributed diffusion model training and serving system. Your focus is on building scalable, fault-tolerant infrastructure that can serve open-source diffusion models across multiple nodes and regions, with efficient support for adaptation techniques. Key Responsibilities
Design and implement distributed diffusion model inference systems for image, video, and multimodal generation across multiple nodes and regions. Architect high-availability clusters for diffusion model serving with automatic failover, load balancing, an...
You will design and optimize a distributed diffusion model training and serving system. Your focus is on building scalable, fault-tolerant infrastructure that can serve open-source diffusion models across multiple nodes and regions, with efficient support for adaptation techniques. Key Responsibilities
Design and implement distributed diffusion model inference systems for image, video, and multimodal generation across multiple nodes and regions. Architect high-availability clusters for diffusion model serving with automatic failover, load balancing, an...