The Lead DevOps Engineer will design and manage CI/CD pipelines, develop Infrastructure-as-Code, scale container orchestration, and implement MLOps best practices alongside cloud infrastructure optimization.
Job Title: Lead DevOps Engineer (Azure, Terraform)
Employment Type: Full-time Remote (India)
About the Role:
NorthBay, a leading AWS Premier Partner, is seeking a highly skilled Lead DevOps (Azure, Terraform) to join its growing cloud and AI engineering team. This role is ideal for candidates with a strong foundation in cloud DevOps practices and a passion for implementing scalable MLOps solutions.
Key Responsibilities:
● Design, implement, and manage CI/CD pipelines using tools such as Jenkins, GitHub Actions, or Azure DevOps● Develop and maintain Infrastructure-as-Code using Terraform
● Manage and scale container orchestration environments using Kubernetes, including experience with larger production-grade clusters
● Ensure cloud infrastructure is optimized, secure, and monitored effectively
● Collaborate with data science teams to support ML model deployment and operationalization
● Implement MLOps best practices, including model versioning, deployment strategies (e.g., blue-green), monitoring (data drift, concept drift), and experiment tracking (e.g., MLflow)
● Build and maintain automated ML pipelines to streamline model lifecycle management
Required Skills:
● 8 to 12 years of experience in DevOps and/or MLOps roles● Proficient in CI/CD tools: Jenkins, GitHub Actions, Azure DevOps
● Strong expertise in Terraform, including managing and scaling infrastructure across large environments
● Hands-on experience with Kubernetes in larger clusters, including workload distribution, autoscaling, and cluster monitoring
● Strong understanding of containerization technologies (Docker) and microservices architecture
● Solid grasp of cloud networking, security best practices, and observability
● Scripting proficiency in Bash and Python
Preferred Skills:
● Experience with MLflow, TFX, Kubeflow, or SageMaker Pipelines● Knowledge of model performance monitoring and ML system reliability
● Familiarity with AWS MLOps stack or equivalent tools on Azure/GCP
Similar Jobs
Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
Design, develop, and maintain Salesforce solutions across Sales, Service, and Experience Clouds including CPQ. Perform Apex and LWC development, unit testing, deployments (Gearset), integrations, and support SOX-compliant change management while collaborating with global teams.
Top Skills:
ApexChange SetsExperience CloudFlowGearsetGitJIRALightning Web Components (Lwc)Process BuilderRest ApiSales CloudSalesforce CpqSalesforce Data LoaderSalesforce DxService CloudSoap ApiSOQLSoslValidation RulesVisualforce
Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
Contribute to the development and monitoring of ML and LLM-based security models, including data acquisition, model evaluation, and deployment on AWS infrastructure.
Top Skills:
AWSBedrockCloudwatchGithub ActionsHuggingface TransformersJenkinsLambdaLangchainNumpyPandasPythonPyTorchS3SagemakerScikit-LearnTensorFlow
Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
The Manager of Technical Support Engineering will lead a technical support team, enhance processes, and improve customer experience through collaboration and coaching.
Top Skills:
Salesforce Service Cloud
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

