We are seeking a highly motivated and experienced DevOps Lead with +10 years of experience to drive our cloud infrastructure, cloud platforms, application management, and deployment automation / pipeline strategy across Azure and AWS Cloud. As a DevOps Lead, you will be responsible for architecting, designing, building, automating, and maintaining our multi-cloud infrastructure, with a focus on containerization (AKS, EKS), serverless technologies (PaaS), and Infrastructure as Code (IaC). You will lead the implementation of DevOps best practices, cloud infrastructure design, and automation, enabling our development teams to deliver applications quickly and reliably.
Job Description:
Key Responsibilities:
- Cloud Infrastructure Design and Management: Design, deploy, and manage cloud infrastructure on AWS and Azure to ensure performance, scalability, and security for enterprise applications and strategic products. Manage and optimize cloud services like Azure Virtual Machines, AKS, Container Apps, Azure PaaS - App Services, and Azure Functions, AWS EC2, S3, RDS, Lambda, EKS.
- Infrastructure as Code (IaC): Implement IaC solutions using tools such as Terraform, CloudFormation, ARM Templates, and Bicep scripts, for automating provisioning.
- Support Development Teams: Assist development and operations teams in building cloud applications using serverless, managed or containerized services.
- Monitoring and Performance: Implement monitoring, alerting, and logging solutions using Azure Monitor, ELK stack, Prometheus, Grafana, and other tools to ensure infrastructure health.
- Security Compliance: Ensure the security and compliance of cloud infrastructure with best practices, including Azure RBAC, AWS IAM and more.
- Cost Optimization: Develop and implement strategies to optimize cloud costs and reduce operational expenses.
- Disaster Recovery and High Availability: Design and implement disaster recovery and high availability solutions across AWS and Azure environments.
- Network Management: Manage network configurations such as VPCs, subnets, load balancers, and security groups.
- CI/CD Pipelines: Maintain and improve CI/CD pipelines using GitHub Actions, Azure DevOps tools, Brigade.js, or other CI/CD tools for continuous integration and delivery.
- Technical Guidance: Provide technical guidance and support on cloud architecture, best practices, and cloud-based solutions.
- Stakeholder Collaboration: Collaborate with senior stakeholders from across our Global DevOps organization, Cloud Services organization, and Security organization, to align on infrastructure solutions.
Key Requirements:
- Deep understanding of Azure cloud platform and all its core services.
- Ability to define and design an end-to-end cloud platform infrastructure and deployment architecture, comprising all application, networking, and security components, with considerations for high availability.
- Ability to explain the rationale for their infrastructure design and deployment architecture, present options and choices, and defend discussions with senior engineering leads and tech stakeholders on the choice of architecture.
- Azure Active Directory (Entra ID) Management: Manage Microsoft Entra users and groups (create, manage properties, licenses, external users, SSPR). Understanding of how Entra ID integrates with IDP and SSO solutions such as Okta.
- Azure Resource Access Management: Manage built-in Azure roles, assign roles at different scopes, and interpret access assignments.
- Azure Subscription and Governance: Implement and manage Azure Policy, configure resource locks, apply and manage tags, manage resource groups, subscriptions, costs (alerts, budgets, Azure Advisor), and management groups.
- Azure Storage: Configure access (firewalls, VNETs, SAS tokens, stored access policies, access keys, identity-based access for Files), manage storage accounts (create, configure redundancy, object replication, encryption), manage data (Storage Explorer, AzCopy), configure Azure Files and Blob Storage (file shares, containers, storage tiers, snapshots, soft delete, lifecycle management, versioning).
- Azure Compute: Automate resource deployment (ARM templates, Bicep), create and configure virtual machines (disk encryption, resizing, availability zones/sets, scale sets), provision and manage containers (Azure Container Registry, Container Instances, Container Apps, scaling), create and configure App Service (plans, scaling, certificates, TLS, custom DNS, backup, networking, deployment slots).
- Azure Virtual Networking: Configure and manage virtual networks (subnets, peering, public IPs, routes), secure access (NSGs, application security groups, Azure Bastion, service/private endpoints), configure name resolution and load balancing (Azure DNS, load balancers).
- Azure Resource Monitoring and Maintenance: Monitor resources (metrics, logs, alerts, action groups, alert processing rules), configure monitoring of VMs, storage, and networks (Azure Monitor Insights), use Azure Network Watcher and Connection Monitor.
- Backup and Recovery: Create Recovery Services/Backup vaults, configure backup policies, perform backup and restore operations, configure Azure Site Recovery, perform failovers, configure and interpret reports and alerts.
- Extensive experience with Azure Kubernetes Service (AKS) and Azure Container Registry.
- Extensive experience with Service Routing and Service Mesh frameworks such as Istio.
- Extensive experience with Application Lifecycle Management (ALM), Monitoring, and Observability tools.
Preferred Qualifications:
- Azure Certifications: Azure Administrator (AZ-104) or Azure DevOps Engineer Expert certifications are highly desirable.
- Container Orchestration: Experience with other container orchestration platforms (e.g., Docker Swarm) in addition to AKS.
- Monitoring Tools: Familiarity with monitoring and observability platforms like Prometheus, Grafana, and other cloud-native monitoring solutions.
- Scripting: Proficiency in scripting languages (e.g., Python, Bash, PowerShell) for frequent automation tasks.
- Agile Experience: Experience working in an Agile/Scrum environment.
Educational Background
Bachelor’s degree in computer science, Engineering, or a related field
Location:
DGS India - Pune - Kharadi EON Free Zone
Brand:
Dentsu Creative
Time Type:
Full time
Contract Type:
Permanent