Note: This position does not offer visa sponsorship.
One of our clients is looking for a Software Engineer – ML Infrastructure.
Here's the lowdown:
- Support Our Cutting-Edge Research: Provide the infrastructure muscle for our ML research team.
- Build Powerful Tools: Develop tools to diagnose and troubleshoot cluster issues and hardware failures.
- Become an ML Research Hero: Monitor deployments, manage experiments, and be the go-to person for all things infrastructure related to ML research.
Are you the one?
- You've got 5+ years of experience supporting infrastructure within an ML environment. (Bonus points for experience with Large Language Models!)
- You're a whiz at developing tools to diagnose ML infrastructure problems and failures.
- Large GPU clusters, high-performance computing/networking? No sweat!
- Cloud platforms like Compute Engine and Kubernetes are your playground.
#J-18808-Ljbffr