[Remote] AI Ops Engineer
Note: The job is a remote job and is open to candidates in USA. Diverse Lynx is seeking an AI Ops Engineer to build, train, and tune machine learning models. The role involves translating data science prototypes into scalable, production-ready ML solutions and collaborating with Data Engineering on feature pipelines.
Responsibilities
- Translate data science prototypes into production-grade ML services and pipelines
- Build training and inference code with reproducibility, versioning, and automated testing
- Implement scalable model serving (online/offline), batching, and latency/throughput optimization
- Integrate model lifecycle tooling (tracking, registry, deployment automation, monitoring)
- Collaborate with Data Engineering on feature pipelines and data contracts
- Own production health: drift detection, performance regression, rollback strategies, and incident response
Skills
- 5+ years software engineering with 2+ years shipping ML models to production
- Strong Python skills and experience with ML frameworks (TensorFlow/PyTorch)
- Experience with containers and orchestration (Docker/Kubernetes) and API development
- Understanding of ML system design (data leakage, training-serving skew, drift)
- CI/CD and DevOps practices applied to ML workloads (MLOps)
Company Overview
Company H1B Sponsorship