Manage and support data pipelines using Airflow and AWS. Responsibilities include monitoring reliability, performing root cause analysis, optimizing costs for storage, and collaborating with data teams on ML workloads.
We are looking for a hands on Airflow & AWS Platform Support engineer who keeps data pipelines and ML workloads running reliably in production, manages access and costs, and enables data teams to build safely.
Client JD:
Apache Airflow, AWS (S3, EMR and Bedrock) Administrator:
- Strong experience in Airflow DAG monitoring, including tracking task states, resolving DAG execution delays, and ensuring reliability across distributed environments.
- Expertise in failure recovery, including retry strategies, SLA miss handling, backfilling, re-running failed task instances, and ensuring consistent pipeline execution across environments.
- Hands‑on experience providing SLA‑based job execution support, ensuring time‑critical pipelines meet business deadlines and production SLAs are continuously maintained.
- Skilled in performing root cause analysis (RCA) for pipeline failures, including dependency failures, task‑level exceptions, scheduler issues, and platform‑level bottlenecks.
- Experience in managing S3 storage optimization, including lifecycle policies, intelligent tiering, storage class transitions, versioning, and cost‑effective data retention strategies.
- Expertise in securing S3 environments using IAM policies, bucket policies, encryption (KMS), access logging, and object‑level permissions.
- Skilled in conducting cost usage analysis for S3 storage and recommending optimization strategies to reduce operational spend.
- Strong background in administering Amazon EMR clusters, including cluster provisioning, configuration, autoscaling, and lifecycle management.
- Experience supporting Amazon Bedrock environments, including model endpoint configuration, invocation monitoring, access control, and cost governance.
Core Responsibilities
- Support and operate Apache Airflow in production environments (AWS Managed Airflow preferred)
- Monitor DAGs, troubleshoot failures, recover missed or delayed pipelines
- Provide SLA‑based job execution support for critical data pipelines
- Perform root cause analysis (RCA) for Airflow, AWS, and data platform issues
- Support AWS EMR and EC2 workloads running Spark, Python, and data processing jobs
- Manage S3 storage, access control, lifecycle policies, and cost optimization
- Ensure secure access via IAM roles, bucket policies, KMS encryption
- Support ML workloads triggered via Airflow (SageMaker / Bedrock integrations)
- Work closely with Data Engineers, ML Engineers, and DevOps teams
- Ensure production platforms are stable, scalable, and cost‑efficient
Mandatory Skills
- Strong Hands-on Experience in SQL
- Strong hands-on experience with Apache Airflow (operations & support)
- Experience debugging long-running / hung Airflow jobs
- Solid AWS knowledge: S3, EMR, EC2, IAM
- Monitoring & alerting using CloudWatch, Grafana, or similar
- Experience with CI/CD for Airflow DAGs (Dev → Test → Prod)
- Infrastructure automation using Terraform and/or CloudFormation
- Strong troubleshooting mindset in production environments
Good to Have
- Exposure to Databricks or Snowflake
- Experience with ML pipelines / MLOps concepts
- Knowledge of Amazon SageMaker or Amazon Bedrock
- Python scripting for automation/support
- Experience in Platform Support or SRE‑like roles
Part of the $4.8 billion RPG Group, we’re a community of 10,000+ innovators across 30+ global locations, including Milpitas, Seattle, Princeton, Cape Town, London, Zurich, Singapore, and Mexico City. Explore Life at Zensar and join us to Grow. Own. Achieve. Learn. to be the best version of yourself.
We believe the best work happens when individuality is celebrated, growth is encouraged, and well-being is prioritized. We are an equal employment opportunity (EEO) and affirmative action employer, committed to creating an inclusive workplace. All qualified applicants will be considered without regard to race, creed, color, ancestry, religion, sex, national origin, citizenship, age, sexual orientation, gender identity, disability, marital status, family medical leave status, or protected veteran status.
Zensar Technologies Pune, Mahārāshtra, IND Office
Zensar Knowledge Park, Kharadi, Plot # 4, MIDC, Pune, Maharashtra, India, 411014
Similar Jobs
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Lead, Big Data Analytics & Engineering will oversee data engineering, analytics enablement, and collaboration across teams, focusing on high-quality data platforms and solutions.
Top Skills:
Apache AirflowApache NifiAWSAzureAzure Data FactoryGCPHadoopImpalaNumpyPandasPentahoPysparkPythonSQLTalend
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Responsible for designing and coordinating talent acquisition programs, developing candidate pipelines, and managing high-volume recruitment processes while ensuring diverse hiring strategies.
Top Skills:
JavaWorkday
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Director of Enterprise Operations leads network operations, incident management, and cross-team coordination, focusing on optimizing network performance and quick incident resolution.
Top Skills:
NetcoolPagerdutyServicenow
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

