H1 Logo

H1

Big Data Engineer

Sorry, this job was removed at 02:48 p.m. (IST) on Thursday, Apr 03, 2025
Remote
Hiring Remotely in India
Remote
Hiring Remotely in India

At H1, we believe access to the best healthcare information is a basic human right. Our mission is to provide a platform that can optimally inform every doctor interaction globally. This promotes health equity and builds needed trust in healthcare systems. To accomplish this our teams harness the power of data and AI-technology to unlock groundbreaking medical insights and convert those insights into action that result in optimal patient outcomes and accelerates an equitable and inclusive drug development lifecycle.  Visit h1.co to learn more about us.


Data Engineering is responsible for the development and delivery of our most important asset—our data. With thousands of data sources from around the world, the team ensures that data is accurate, normalized, and delivered at a velocity that keeps up with real-world changes. As we expand our markets and the scope of data we provide to our customers, our team must scale to meet that demand.


WHAT YOU'LL DO AT H1

As a Data Engineer at H1, you will contribute to the development, optimization, and scaling of our data pipelines and infrastructure. You will collaborate closely with senior engineers, product managers, and cross-functional teams to help build efficient and scalable data solutions. This role is ideal for an engineer eager to deepen their expertise, take ownership of key projects, and contribute to our data platform’s success.


You will:

- Design, develop, and maintain scalable data extraction frameworks to collect data from a variety of  diverse sources.

- Continuously improve and enhance the efficiency and reliability of data collection, extraction, and normalization

- Work with large datasets, transforming and processing structured and unstructured data for downstream use.

- Build and maintain efficient, reliable data pipelines and ETL processes and big data tools such as spark

- Collaborate with senior engineers to improve data architecture and infrastructure.

- Support data integration efforts from multiple sources, ensuring consistency and accuracy.

- Troubleshoot data issues, optimize queries, and improve data retrieval performance.

- Document data processes and workflows, ensuring transparency and repeatability.

- Participate in code reviews and contribute to best practices for clean, maintainable code.

- Engage with cross-functional teams to understand business needs and help translate them into data solutions.


ABOUT YOU

You have solid technical skills in data engineering and a passion for building efficient, scalable solutions. You thrive in a collaborative environment, enjoy learning from experienced team members, and are eager to take on increasing responsibility as you grow in the role.


- You have an understanding of Large Language Models (LLMs) and their applications.

- It’s a bonus if you’re familiar with model training and fine-tuning, particularly in NLP (Natural Language Processing) contexts.

- You possess a basic knowledge of network, security, and encryption protocols such as HTTP/HTTPS/TLS.

- You’re able to work collaboratively across teams and communicate effectively with both technical and non-technical stakeholders.

- You have strong analytical and problem-solving skills with a focus on data quality and performance optimization.

- You have a passion for writing clean, efficient code and following best practices.


REQUIREMENTS

- 3+ years of experience in data engineering, working with large-scale data systems and pipelines.

- Proficiency in programming languages like Python, Java, or similar languages.

- Strong SQL skills, including the ability to write optimized complex queries for  large datasets using advanced SQL operators  such as GROUP BY, HAVING, window functions, and complex joins.

- Experience with big data tools like Apache Spark, particularly on cloud platforms, with a preference for AWS EMR.

- Experience with Docker or other containerization technologies.



Not meeting all the requirements but still feel like you’d be a great fit? Tell us how you can contribute to our team in a cover letter! 

H1 OFFERS

- Full suite of health insurance options, in addition to generous paid time off

- Pre-planned company-wide wellness holidays

- Retirement options

- Health & charitable donation stipends

- Impactful Business Resource Groups

- Flexible work hours & the opportunity to work from anywhere

- The opportunity to work with leading biotech and life sciences companies in an innovative industry with a mission to improve healthcare around the globe


Similar Jobs

23 Days Ago
Remote
Hybrid
Pune, Maharashtra, IND
Senior level
Senior level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
The Senior Data Engineer is responsible for developing data pipelines, improving data models, and integrating new data management technologies while consulting on complex projects.
Top Skills: AirflowAnsibleChefHadoopHiveKafkaProtobuf RpcPythonScalaSparkSQLTerraform
Senior level
Software
Lead the software development lifecycle for Big Data solutions, mentor engineers, and drive innovation in machine learning applications.
Top Skills: AWSAzureDeltaDockerGCPHudiIcebergKubernetesPysparkSpark
Yesterday
Remote
Hybrid
Bengaluru, Karnataka, IND
Junior
Junior
Software
The Site Reliability Engineer II at Clari will be responsible for managing third-party services, ensuring their availability and performance, and collaborating with application development teams to implement optimal solutions. The role includes end-to-end ownership of systems, promoting efficient service utilization, and contributing to community enhancements.
Top Skills: AWSCassandraElasticsearchGCPHelmKafkaKubernetesMongodbOpensearchRedisTerraform

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account