Job Description
The Opportunity
Are you ready to define the technological landscape of 2026? Nexus Future Systems is seeking a visionary Senior AI Infrastructure Architect to lead the construction of next-generation neural networks and scalable cloud ecosystems. In this pivotal role, you will bridge the gap between theoretical AI research and production-grade engineering, ensuring our platforms are ready for the massive scale of the upcoming decade.
Why Join Us?
We are not just building software; we are architecting the future. You will work with cutting-edge hardware, optimize for quantum-readiness, and collaborate with a team of elite engineers pushing the boundaries of what is possible.
Key Responsibilities
Responsibilities
- Architect Future-Ready Systems: Design and implement scalable, high-performance infrastructure architectures specifically tuned for the computational demands of 2026 AI models.
- Model Deployment: Oversee the end-to-end deployment of large-scale machine learning models, ensuring reliability, latency optimization, and cost-efficiency.
- Infrastructure Optimization: Continuously refine our GPU and TPU resource allocation strategies to maximize throughput and minimize energy consumption in data centers.
- Cross-Functional Leadership: Partner with Research Scientists and Product Managers to translate 2026 roadmap goals into concrete technical specifications.
- Security & Compliance: Implement robust security protocols and ethical AI governance frameworks to protect sensitive data assets.
- Tech Stack Evangelism: Evaluate and integrate emerging technologies (edge computing, neuromorphic chips) to stay ahead of industry standards.
Qualifications
- Education: Masterβs degree in Computer Science, Electrical Engineering, or a related field (PhD preferred).
- Experience: 8+ years of experience in software engineering or systems architecture, with at least 3 years specifically in AI/ML infrastructure.
- Technical Skills: Proficiency in Python, C++, and distributed systems. Deep experience with Kubernetes, Docker, and cloud providers (AWS, GCP, or Azure).
- AI Expertise: Strong understanding of deep learning frameworks (TensorFlow, PyTorch) and model serving technologies (Triton, vLLM).
- Problem Solving: Demonstrated ability to solve complex, ambiguous problems in high-pressure environments.
- Communication: Exceptional written and verbal communication skills, with the ability to articulate complex technical concepts to diverse stakeholders.