MSD Animal Health Technology Labs Logo

MSD Animal Health Technology Labs

Specialist, Senior Data Engineer

Posted 2 Days Ago
Be an Early Applicant
In-Office
Pune, Maharashtra, IND
Senior level
In-Office
Pune, Maharashtra, IND
Senior level
Design, build, and maintain scalable ETL/ELT data pipelines using SQL, Python, Spark, Airflow and Databricks on AWS. Ensure data quality and performance, optimize Redshift, collaborate with business and analytics teams, and monitor/ troubleshoot production workflows.
The summary above was generated by AI

Job Description

Senior Data Engineer, Specialist 

The Opportunity

  • Based in Hyderabad, join a global healthcare biopharma company and be part of a 130- year legacy of success backed by ethical integrity, forward momentum, and an inspiring mission to achieve new milestones in global healthcare.
  • Be part of an organisation driven by digital technology and data-backed approaches that support a diversified portfolio of prescription medicines, vaccines, and animal health products.
  • Drive innovation and execution excellence. Be a part of a team with passion for using data, analytics, and insights to drive decision-making, and which creates custom software, allowing us to tackle some of the world's greatest health threats.

Our Technology Centers focus on creating a space where teams can come together to deliver business solutions that save and improve lives. An integral part of our company's’ IT operating model, Tech Centers are globally distributed locations where each IT division has employees to enable our digital transformation journey and drive business outcomes. These locations, in addition to the other sites, are essential to supporting our business and strategy.

A focused group of leaders in each Tech Center helps to ensure we can manage and improve each location, from investing in growth, success, and well-being of our people, to making sure colleagues from each IT division feel a sense of belonging to managing critical emergencies. And together, we must leverage the strength of our team to collaborate globally to optimize connections and share best practices across the Tech Centers.

Role Overview

We are looking for a highly motivated Data Engineer to build and maintain scalable, high-performance data pipelines. The ideal candidate will have strong expertise in AWS, SQL, Python, Apache Spark, and Apache Airflow, along with hands-on experience in Databricks as a core data processing platform. Along with exposure to Agentic AI systems and AI-driven data workflows.

What will you do in this role

  • Design, develop, and maintain ETL/ELT pipelines using SQL, Python, and Spark
  • Build and manage data workflows using Apache Airflow for orchestration and scheduling
  • Develop scalable and optimized solutions using AWS services (S3, Glue, Redshift, EMR, Lambda, etc.)
  • Implement and manage data processing pipelines in Databricks (Delta Lake, notebooks, workflows, Unit Catalog)
  • Ensure data quality, reliability, and performance across pipelines
  • Collaborate with analytics, product, and business teams to deliver data solutions
  • Monitor, troubleshoot, and optimize production pipelines

What should you have

  • Strong proficiency in SQL and Python
  • Hands-on experience with Apache Spark (PySpark preferred)
  • Experience working with Apache Airflow for workflow orchestration
  • Solid experience with AWS cloud platform. Redshift performance optimization skills
  • Hands-on experience in Databricks
  • Understanding of data warehousing, data modeling, and ETL design

🔹 Good to Have

  • Experience with CI/CD pipelines and GitHub Actions
  • Knowledge of Pharmaceutical / Life Sciences domain
  • Familiarity with data governance and quality frameworks
  • Exposure to Docker, Kubernetes, or similar technologies

Primary Skills.

  • SQL, Pyton,
  • PySpark
  • Aws cloud  Platform

Who we are

We are known as well-known org Inc., Rahway, New Jersey, USA in the United States and Canada and MSD everywhere else. For more than a century, bringing forward medicines and vaccines for many of the world's most challenging diseases. Today, our company continues to be at the forefront of research to deliver innovative health solutions and advance the prevention and treatment of diseases that threaten people and animals around the world.

What we look for

Imagine getting up in the morning for a job as important as helping to save and improve lives around the world. Here, you have that opportunity. You can put your empathy, creativity, digital mastery, or scientific genius to work in collaboration with a diverse group of colleagues who pursue and bring hope to countless people who are battling some of the most challenging diseases of our time. Our team is constantly evolving, so if you are among the intellectually curious, join us—and start making your impact today.

#HYDIT2025

Required Skills:

Agile Data Warehousing, Amazon Elastic Compute Cloud (Amazon EC2), Amazon Relational Database Service (RDS), Apache Airflow, Apache Spark, Business Intelligence (BI), Computer Science, Database Administration, Databricks Platform, Data Engineering, Data Governance, Data Management, Data Modeling, Data Quality, Data Visualization, Design Applications, Docker Kubernetes Architecture, Information Management, Job Descriptions, Kubernetes, Software Development, Software Development Life Cycle (SDLC), System Designs

Preferred Skills:

Current Employees apply HERE

Current Contingent Workers apply HERE

Search Firm Representatives Please Read Carefully 
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company.  No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails. 

Employee Status:

Regular

Relocation:

VISA Sponsorship:

Travel Requirements:

Flexible Work Arrangements:

Hybrid

Shift:

Valid Driving License:

Hazardous Material(s):

Job Posting End Date:

06/29/2026

*A job posting is effective until 11:59:59PM on the day BEFORE the listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.

Similar Jobs

2 Days Ago
In-Office
Pune, Maharashtra, IND
Senior level
Senior level
Artificial Intelligence • Pet • Software
Lead design and deliver scalable end-to-end data platforms and pipelines using Spark, Python, SQL, Databricks, Airflow, and AWS. Architect lakehouse and dimensional models, drive CI/CD, mentor engineering teams, and ensure performance, security, and cost-efficiency across enterprise data solutions.
Top Skills: Agentic AiAmazon RedshiftApache AirflowSparkAws GlueAws LambdaAws S3DatabricksDbtDelta LakeGithub ActionsKafkaPysparkPythonSQLUnity Catalog
2 Days Ago
In-Office
Pune, Maharashtra, IND
Senior level
Senior level
Artificial Intelligence • Pet • Software
Lead design and implement end-to-end, scalable data platforms and Lakehouse architectures using Databricks, Spark, Python, SQL, Delta Lake and AWS. Build and optimize large-scale pipelines, orchestration with Airflow, CI/CD (GitHub Actions), Redshift modeling, and data modeling (star/snowflake). Mentor engineering teams, define data standards, and collaborate with stakeholders to deliver analytics and AI-driven data workflows.
Top Skills: Agentic AiAmazon RedshiftApache AirflowSparkAws GlueAws LambdaAws S3DatabricksDatabricks Unity CatalogDbtDelta LakeGithub ActionsKafkaPysparkPythonRedshift Distribution TechniquesSQL
2 Days Ago
In-Office
Pune, Maharashtra, IND
Senior level
Senior level
Artificial Intelligence • Pet • Software
Design, build, and maintain scalable ETL/ELT data pipelines using SQL, Python, and Spark. Orchestrate workflows with Apache Airflow and implement processing in Databricks (Delta Lake, notebooks). Leverage AWS services (S3, Glue, Redshift, EMR, Lambda) to ensure data quality, reliability, and performance. Collaborate with analytics and business teams, monitor production pipelines, and optimize performance. Exposure to agentic AI and AI-driven data workflows is desirable.
Top Skills: Agentic AiApache AirflowSparkAWSCi/CdDatabricksDelta LakeDockerEc2EmrGithub ActionsGlueKubernetesLambdaNotebooksPysparkPythonRdsRedshiftS3SQL

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account