Back to Jobs

MLOps Architect

Remote, USA Full-time Posted 2025-11-03
Role: MLOps Architect Location: New York, NY (Remote) Responsibilities: • Architect for scalable, cost-efficient, reliable and secure MLOps solution. • Design, implement and deploy MLOps solutions in AWS. • Select and justify appropriate ML technology within AWS and Identify appropriate AWS services to implement MLOps solutions. • Design, build, and maintain infrastructure required for efficient development, deployment, and monitoring of machine learning models. • Implement CI/CD pipelines for machine learning applications to ensure smooth development and deployment processes. • Collaborate with data scientists to understand and implement requirements for model serving, versioning, and reproducibility. • Monitor and optimize model performance in production, identifying and resolving issues proactively to ensure optimal results. • Automate repetitive tasks to improve efficiency and reduce the risk of human error in MLOps workflows. • Maintain documentation and provide training to team members on MLOps best practices, ensuring knowledge sharing and collaboration within the team. • Stay updated with the latest developments in MLOps tools, technologies, and methodologies to remain current and effective in your role. Experience: • Minimum 10+ years of experience in MLOps with a proven track record of successful deployments. • In-depth working knowledge of MLOps tools and platforms (Kubernetes, Docker, Jenkins, Git, MLflow, JupyterHub, LLM-specific tooling). • In-depth working knowledge of AWS and infrastructure as code (IaC) principles. • Strong Experience with DevOps methodologies and CI/CD pipelines such as Github Actions. • Strong understanding of machine learning pipelines, model training frameworks, and monitoring techniques. • Strong programming skills in Python • Experience with ML frameworks such as TensorFlow, PyTorch, and/or scikit-learn. • Strong understanding of machine learning lifecycle, including data preprocessing, model training, evaluation, and deployment. • Experience with large language models (LLMs) and their unique operational considerations is a plus. • Excellent communication, collaboration, and problem-solving skills. • The ability to translate technical concepts into clear and concise language. • A passion for innovation and a drive to optimize ML and LLM workflows • 12+ years of experience in MLOps, DevOps, or related fields. • Hands-on experience with AWS. • Familiarity with containerization and orchestration tools like Docker and Kubernetes. • In depth Knowledge of infrastructure-as-code tools such as AWS CDK and Cloudformation. • Excellent problem-solving skills and the ability to work independently as well as part of a team. • Strong communication skills and the ability to explain complex technical concepts to non-technical stakeholders. Preferred Qualifications: • AWS Certified Machine Learning – Specialty • Experience with A/B testing and model performance monitoring Apply tot his job Apply To this Job

Similar Jobs