Capco Logo

Capco

Data Engineer (Databrick + Pyspark)

Posted 15 Hours Ago
Be an Early Applicant
Hybrid
Pune, Maharashtra, IND
Senior level
Hybrid
Pune, Maharashtra, IND
Senior level
The Data Engineer will design, develop, and maintain ETL/ELT data pipelines, leveraging PySpark and Databricks to process large datasets, ensuring data quality, reliability, and performance optimization.
The summary above was generated by AI

Job Title: Data Engineer (PySpark / Databricks)

Experience: 5–9 Years Location: Pune (Hybrid – Capco Office)

Job Summary

We are looking for a skilled Data Engineer with strong expertise in PySpark, Databricks, and modern data engineering practices. The ideal candidate will have hands-on experience in building scalable data pipelines, working with large datasets, and leveraging cloud-based data platforms.

Key Responsibilities Design, develop, and maintain scalable ETL/ELT data pipelines Work extensively with PySpark and Apache Spark for large-scale data processing Build and manage workflows using Apache Airflow Develop and optimize data solutions on Databricks (Jobs, Delta Lake) Work with cloud-based data lakes (S3 or equivalent) Write efficient and complex SQL queries for data transformation and analysis Run and manage Spark workloads on EMR Serverless or other managed Spark platforms Ensure data quality, reliability, and performance optimization of pipelines Must Have Skills Strong hands-on experience with PySpark and Apache Spark internals Experience with Databricks (Jobs, Delta Lake) Proficiency in Apache Airflow for workflow orchestration Solid experience building ETL/ELT pipelines at scale Strong SQL skills and experience with Data Warehouse (DWH) systems Experience running Spark workloads on EMR Serverless or managed Spark platforms Hands-on experience with cloud data lakes (S3 or equivalent) Good to Have Skills Experience with Delta Lake / Apache Iceberg Exposure to streaming frameworks (Spark Structured Streaming, Kafka) Familiarity with CI/CD pipelines for data engineering workflows Knowledge of data governance, cataloging, and lineage tools

Top Skills

Apache Airflow
Databricks
Emr Serverless
Pyspark
S3
SQL

Similar Jobs at Capco

7 Hours Ago
Remote or Hybrid
India
Mid level
Mid level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
The role involves business analysis and gathering requirements in wholesale credit risk, focusing on model development, regulatory compliance, and credit risk systems management.
Top Skills: Basel IiiCrrEbaEcbPra
Yesterday
Remote or Hybrid
India
Senior level
Senior level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Manage projects in the transaction banking sector with an emphasis on compliance and retail payments, using Agile methodologies and strong stakeholder management.
Top Skills: ConfluenceIso20022JIRA
Yesterday
Remote or Hybrid
India
Senior level
Senior level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Lead end-to-end delivery of digital transformation projects in banking, managing stakeholders and risks while mentoring teams to achieve goals.
Top Skills: AgileBlackrock PlatformsProject Management Methodologies

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account