Roche Logo

Roche

Data Engineer

Job Posted 20 Days Ago Posted 20 Days Ago
Be an Early Applicant
Pune, Maharashtra
Mid level
Pune, Maharashtra
Mid level
The Data Engineer will build scalable data pipelines and products, drive automation, and ensure efficient data access for analytics and reporting, working closely with various stakeholders.
The summary above was generated by AI

Roche fosters diversity, equity and inclusion, representing the communities we serve. When dealing with healthcare on a global scale, diversity is an essential ingredient to success. We believe that inclusion is key to understanding people’s varied healthcare needs. Together, we embrace individuality and share a passion for exceptional care. Join Roche, where every voice matters.

The Position

Diabetes is a pesky monster — and that’s putting it mildly. If you’re serious about helping us face it head-on, join us!

Being a global leader in integrated Personalized Diabetes Management (iPDM), Roche collaborates with pioneers around the globe, including people with diabetes, caregivers, healthcare providers, and payers. We aim to transform and advance care provision and foster sustainable care structures. Under the brands Accu-Chek and mySugr, comprising glucose monitoring, insulin delivery systems, and digital solutions, we unite with our partners to create patient-centered value. By building and collaborating in an open ecosystem, connecting devices and digital solutions, and contextualizing relevant data points, we enable deeper insights and a better understanding of diabetes, leading to personalized and effective therapy adjustments.  For a quick look at what we do, check out Roche code4life

Here’s what we’re looking for:

  • We’re looking for a motivated Data Engineer who will have a significant impact on our platform making data accessible.
  • You will work with different data stakeholders to gain a good understanding of the data needs.
  • You will build scalable data pipelines, data products and data services that will process high data volumes.
  • You will drive automation - wherever possible and feasible - to ensure that automation is in place to increase quality and efficiency.
  • You will implement the tools and processes to access, manage and work with the data for the reporting, advanced analytics and evidence generation teams. 
  • You will be part of a new team working on making data accessible to generate new products based on data and support clinical and commercial decisions. 

Essential skills for your mission:

  • You have a B.S. degree in Computer Science, Information Systems, Math, Statistics, Engineering or equivalent training

  • You have at least 4 years of professional experience working as a Data Engineer with large scale data platforms, using agile methodologies

  • You have experience implementing data pipelines (using i.e PySpark, Spark SQL, Scala),  orchestration tools/services (i.e. Airflow, data factory) and testing frameworks.

  • You have experience with databases (columnar, NoSQL and relational databases: Redshift, Dynamodb, Snowflake, Postgres and/or Aurora), data modeling and data management tools (Hive, Jupiter, Athena, Zeppelin and/or Databricks).

  • You have experience in one of the main cloud services (AWS, Google Cloud or Azure) with Big Data services (EMR, Databricks, Synapse, HDInsight, Kinesis, etc.)

  • You are familiar with orchestration, automation, integration and continuous delivery frameworks such as Jenkins or Streamsets

  • You have experience with Software engineering best practices, such as unit testing and integration testing, and software development tools, such as IDE, Maven, Git, Docker among others

  • Autonomy in solving technical challenges with a problem solving mindset

  • Great written and verbal communication in English

Bonus Skills:

  • Certification in AWS data engineer, AWS data analytics specialty or equivalent

  • Experience with reporting systems and visualization tools, (preferred: Quicksight, Superset; optional: Tableau, Looker).

  • Experienced in DevOps, DataOps and MLOps.

  • Knowledge with security and privacy regulations (GDPR, HIPAA).

  • Knowledge of the Health Industry.

Here's what you can expect from us:

  • Ambitious and passionate people building meaningful products based in data

  • An innovative agile working environment allowing for collaboration with smart people and knowledge sharing in cross-functional teams 

  • We welcome technical evangelists, so if you are interested in any thought leadership contributions (blogs, conferences) within the realm of the organization, we are happy to support you.

As a Data Engineer, you will have the opportunity to explore and work with a diverse range of technologies , including data processing frameworks, cloud-based infrastructure, and advanced analytics tools. You will gain valuable insights into the unique demands of medical software, such as ensuring data security, managing medical risks, implementing robust and reliable programming practices, and complying with necessary certifications and audits.

You will have direct contact with our users, learning about their daily struggles living with diabetes, as well as customers and partners, understanding the unique needs and mechanics of healthcare systems worldwide. 

Interested?

Great. We’d like to hear from you! Just click that “Apply Now” button and send us your CV… and anything else you think might impress us.

Who we are

At Roche, more than 100,000 people across 100 countries are pushing back the frontiers of healthcare. Working together, we’ve become one of the world’s leading research-focused healthcare groups. Our success is built on innovation, curiosity and diversity.

Roche is an Equal Opportunity Employer.

Top Skills

Airflow
Athena
Aurora
AWS
Azure
Data Factory
Databricks
Docker
DynamoDB
Emr
Git
GCP
Hive
Jenkins
Jupiter
Kinesis
Postgres
Pyspark
Redshift
Scala
Snowflake
Spark Sql
Streamsets
Zeppelin

Roche Pune, Mahārāshtra, IND Office

671-75 Ganeshkhind Road, Pune, Maharashtra, India, 411005

Similar Jobs

19 Hours Ago
Remote
Hybrid
Pune, Maharashtra, IND
Senior level
Senior level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
The Sr. Data Engineer will work on projects involving PySpark and Scala with a focus on data analysis and debugging. They will utilize their skills in Spark, GIT, and familiar CICD tools to manage the Big Data Application Life Cycle while ensuring efficient incident management using Control-M and Service Now.
Yesterday
Remote
Hybrid
Pune, Maharashtra, IND
Expert/Leader
Expert/Leader
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
The role involves designing and maintaining large-scale data pipelines using AWS services, particularly Glue, while collaborating on ETL solutions.
Top Skills: AthenaAWSAws GlueEc2LambdaPysparkPythonRedshiftS3Spark
4 Days Ago
Hybrid
Mumbai, Maharashtra, IND
Junior
Junior
Financial Services
As a Data Engineer II, you will design and implement scalable data pipelines and ETL processes, maintain data infrastructure, and collaborate with stakeholders for optimizing data access and performance.
Top Skills: AWSAws EmrAzureGitlabGCPJenkinsLambdaNoSQLPysparkPythonRedshiftS3SQL

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account