Citi Logo

Citi

Lead Data Engineer - ETL, Big Data (Hadoop, Hive, Apache Spark)– Assistant Vice President - C12 - Pune

Posted 7 Days Ago
Be an Early Applicant
Pune, Mahārāshtra
Senior level
Pune, Mahārāshtra
Senior level
The Lead Data Engineer develops high-quality data products, supports regulatory compliance, and leads the design of scalable data solutions focusing on big data technologies.
The summary above was generated by AI

Job Title: Lead Data Engineer – C12 / Assistant Vice President (India)

The Role

The Data Engineer is accountable for developing high quality data products to support the Bank’s regulatory requirements and data driven decision making. A Data Engineer will serve as an example to other team members, work closely with customers, and remove or escalate roadblocks. By applying their knowledge of data architecture standards, data warehousing, data structures, and business intelligence they will contribute to business outcomes on an agile team.

Responsibilities

  • Developing and supporting scalable, extensible, and highly available data solutions
  • Deliver on critical business priorities while ensuring alignment with the wider architectural vision
  • Identify and help address potential risks in the data supply chain
  • Follow and contribute to technical standards
  • Design and develop analytical data models

Required Qualifications & Work Experience

  • First Class Degree in Engineering/Technology (4-year graduate course)
  • 8 to 12 years’ experience implementing data-intensive solutions using agile methodologies
  • Experience of relational databases and using SQL for data querying, transformation and manipulation
  • Experience of modelling data for analytical consumers
  • Ability to automate and streamline the build, test and deployment of data pipelines
  • Experience in cloud native technologies and patterns
  • A passion for learning new technologies, and a desire for personal growth, through self-study, formal classes, or on-the-job training
  • Excellent communication and problem-solving skills
  • An inclination to mentor; an ability to lead and deliver medium sized components independently

Technical Skills (Must Have)

  • ETL: Hands on experience of building data pipelines. Proficiency in two or more data integration platforms such as Ab Initio, Apache Spark, Talend and Informatica
  • Big Data: Experience of ‘big data’ platforms such as Hadoop, Hive or Snowflake for data storage and processing
  • Data Warehousing & Database Management: Expertise around Data Warehousing concepts, Relational (Oracle, MSSQL, MySQL) and NoSQL (MongoDB, DynamoDB) database design
  • Data Modeling & Design: Good exposure to data modeling techniques; design, optimization and maintenance of data models and data structures
  • Languages: Proficient in one or more programming languages commonly used in data engineering such as Python, Java or Scala
  • DevOps: Exposure to concepts and enablers - CI/CD platforms, version control, automated quality control management
  • Data Governance: A strong grasp of principles and practice including data quality, security, privacy and compliance

Technical Skills (Valuable)

  • Ab Initio: Experience developing Co>Op graphs; ability to tune for performance. Demonstrable knowledge across full suite of Ab Initio toolsets e.g., GDE, Express>IT, Data Profiler and Conduct>IT, Control>Center, Continuous>Flows
  • Cloud: Good exposure to public cloud data platforms such as S3, Snowflake, Redshift, Databricks, BigQuery, etc. Demonstratable understanding of underlying architectures and trade-offs
  • Data Quality & Controls: Exposure to data validation, cleansing, enrichment and data controls
  • Containerization: Fair understanding of containerization platforms like Docker, Kubernetes
  • File Formats: Exposure in working on Event/File/Table Formats such as Avro, Parquet, Protobuf, Iceberg, Delta
  • Others: Experience of using a Job scheduler e.g., Autosys. Exposure to Business Intelligence tools e.g., Tableau, Power BI

Certification on any one or more of the above topics would be an advantage.

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Digital Software Engineering

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.

Top Skills

Ab Initio
Spark
Big Data
BigQuery
Ci/Cd
Databricks
Docker
DynamoDB
ETL
Hadoop
Hive
Informatica
Java
Kubernetes
MongoDB
Mssql
MySQL
Oracle
Python
Redshift
S3
Scala
Snowflake
SQL
Talend

Similar Jobs

20 Hours Ago
Hybrid
Pune, Mahārāshtra, IND
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Manage a team focused on observability and tracing. Architect large-scale platforms, conduct retrospectives, and support DevOps efforts.
Top Skills: AWSBashChaos EngineeringCi/CdGCPGoPythonTdd
Yesterday
Hybrid
Pune, Mahārāshtra, IND
Senior level
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The role involves building, maintaining, and automating database platforms, including CI/CD pipelines, troubleshooting, and collaborating on product delivery teams.
Top Skills: Azure DevopsAzure Sql DatabaseFoglightGitOctopus DeployPowershellSplunkSQL ServerT-Sql
5 Days Ago
Remote
Hybrid
18 Locations
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Join CrowdStrike's NGSIEM Data Onboarding Team to develop third-party ingest pipelines, ensuring fault tolerance and scalability in cloud systems while optimizing quality assurance and automated testing.
Top Skills: AWSAzureDockerGCPGoKubernetes

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account