Design, deploy, and maintain training clusters, implement distributed training algorithms, and develop tools for AI researchers. Requires experience with Python, PyTorch, and managing HPC clusters.
Experience with configuration management tools (Ansible, Terraform, Puppet, Chef, etc.)
The US base salary range for this full-time position is between $150,000 - $400,000 annually.
The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.