Job Description
Requirements
- PhD in Computer Science, Electrical Engineering, or a related field and at least 10 years of industry or research experience in the field
- Solid background in Machine Learning systems research and experience with performance evaluation and performance modeling
- Hands‑on experience with machine learning / deep learning methods and extensive knowledge of Large Language Model training and inference frameworks
- Academic publications in relevant journals and industry conferences
- Strong knowledge of AI systems, including compute (GPUs, TPUs, etc.) and networking (Ethernet, Infiniband, NVLink, etc.) technologies, and their various topologies, paradigms, and trade‑offs
- Proven track record leading a team to deliver published results
- (Desirable) Demonstrated ability to creatively problem‑solve and apply first‑principles thinking to generate novel ideas, particularly in ambiguous, evolving environments