Job Description
About Nexus Horizon
We are building the infrastructure that powers the next generation of artificial intelligence. As a leading innovator in the 2026 landscape, we are seeking a visionary Lead AI/ML Infrastructure Engineer to architect scalable, high-performance systems that bridge the gap between theoretical research and production deployment. You will be responsible for ensuring our models run efficiently at massive scale while maintaining rigorous security standards.
Why Join Us?
- Work on cutting-edge AI models that redefine industry standards.
- Competitive compensation package with equity options.
- Flexible remote-first policy with a hub in San Francisco.
Responsibilities
- Design and maintain robust MLOps pipelines for the deployment and scaling of large language models (LLMs).
- Optimize GPU clusters and cloud infrastructure to reduce inference latency and cost.
- Implement automated CI/CD workflows for model versioning and A/B testing.
- Collaborate closely with data scientists to translate research prototypes into reliable production services.
- Enforce security protocols and data privacy standards across all infrastructure layers.
- Lead architectural decisions regarding Kubernetes orchestration and serverless architectures.
Qualifications
- 5+ years of experience in backend engineering, DevOps, or MLOps.
- Expert proficiency in Python, Docker, and Kubernetes.
- Deep experience with cloud providers (AWS, GCP, or Azure) and serverless technologies.
- Strong understanding of machine learning frameworks such as PyTorch, TensorFlow, or JAX.
- Experience with data serialization formats (Protobuf, Avro) and message queues (Kafka, RabbitMQ).
- Excellent problem-solving skills and the ability to thrive in a fast-paced, high-impact environment.