HG Insights

Software Engineer - Big Data

Reposted 3 Days Ago

Be an Early Applicant

Hybrid

Pune, Maharashtra

Senior level

Hybrid

Pune, Maharashtra

Senior level

Develop and optimize large-scale data pipelines, ensuring system reliability using a variety of big data technologies, while collaborating in agile development processes.

The summary above was generated by AI

Job Title: Software Engineer - Big Data

Department: IDP

About Us

HG Insights is the global leader in technology intelligence, delivering actionable AI driven insights through advanced data science and scalable big data solutions. Our Big Data Insights Platform processes billions of unstructured documents and powers a vast data lake, enabling enterprises to make strategic, data-driven decisions. Join our team to solve complex data challenges at scale and shape the future of B2B intelligence.

What You’ll Do:

Build, and optimize large-scale distributed data pipelines for processing billions of unstructured documents using Databricks, Apache Spark, and cloud-native big data tools
Scale enterprise-grade big-data systems, including data lakes, ETL/ELT workflows, and syndication platforms for customer-facing Insights-as-a-Service (InaaS) products.
Implement cutting-edge solutions for data ingestion, transformation, and analytics using Hadoop/Spark ecosystems, Elasticsearch, and cloud services (AWS EC2, S3, EMR).
Drive system reliability through automation, CI/CD pipelines (Docker, Kubernetes, Terraform), and infrastructure-as-code practices.
Implement data orchestration strategies using Airflow to manage multi-cloud workflows across AWS/Azure/GCP, Kubernetes clusters, and hybrid environments

What You’ll Be Responsible For

Building & Troubleshooting complex big data pipelines, including performance tuning of Spark jobs, query optimization, and data quality enforcement.
Collaborating in agile workflows (daily stand-ups, sprint planning) to deliver features rapidly while maintaining system stability.
Ensuring security and compliance across data workflows, including access controls, encryption, and governance policies.

What You’ll Need

BS/MS/Ph.D. in Computer Science or related field, with 5+ years of experience building production-grade big data systems.Extensive experience in Scala/Java for Spark development, including optimization of batch/streaming jobs and debugging distributed workflows.Airflow orchestration (DAGs, operators, sensors) and integration with Spark/DatabricksDistributed workflow scheduling and dependency management257.Performance tuning of Airflow DAGs and Spark jobs in multi-tenant environmentsProven track record with:Databricks, Hadoop/Spark ecosystems, and SQL/NoSQL databases (MySQL, Elasticsearch).Cloud platforms (AWS EC2, S3, EMR) and infrastructure-as-code tools (Terraform, Kubernetes).RESTful APIs, microservices architectures, and CI/CD automation.5+ years of designing, modeling and building big data pipelines in an enterprise work setting.

Top Skills

Airflow

Spark

Aws Ec2

Databricks

Docker

Elasticsearch

Emr

Hadoop

Kubernetes

NoSQL

Restful Apis

SQL

Terraform

Similar Jobs

JPMorganChase

Software Engineer III - Python / Spark Big Data

12 Days Ago

Hybrid

Mumbai, Maharashtra, IND

Mid level

Financial Services

As a Software Engineer III, you will design and deliver technology products, develop and maintain production code, analyze data, and contribute to team improvements while promoting an inclusive culture.

Top Skills: AWSKafkaPysparkPythonSpark

HG Insights

Senior Software Engineer, Big Data

3 Days Ago

Hybrid

Pune, Maharashtra, IND

Senior level

Big Data • Information Technology • Software • Database • Business Intelligence

Lead the development of a Big Data Insights Platform, optimizing data pipelines and systems while mentoring engineers and ensuring compliance.

Top Skills: SparkAws Ec2Aws EmrAws S3DatabricksDockerElasticsearchHadoopJavaKubernetesNoSQLScalaSQLTerraform

Senior Software Engineer - Big Data

4 Days Ago

Hybrid

Pune, Maharashtra, IND

Senior level

Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting

As a Senior Software Engineer at ZS, you will lead development of multi-tenant cloud-based platforms using Big Data technologies while mentoring junior developers and ensuring high software quality through best engineering practices.

Top Skills: SparkAWSC#EmrHadoopHdfsHTML5JavaJavaScriptLinuxPowershellPython

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.