HG Insights Logo

HG Insights

Software Engineer - Big Data

Reposted 3 Days Ago
Be an Early Applicant
Hybrid
Pune, Maharashtra
Senior level
Hybrid
Pune, Maharashtra
Senior level
Develop and optimize large-scale data pipelines, ensuring system reliability using a variety of big data technologies, while collaborating in agile development processes.
The summary above was generated by AI

Job Title: Software Engineer - Big Data

Department: IDP

About Us

HG Insights is the global leader in technology intelligence, delivering actionable AI driven insights through advanced data science and scalable big data solutions. Our Big Data Insights Platform processes billions of unstructured documents and powers a vast data lake, enabling enterprises to make strategic, data-driven decisions. Join our team to solve complex data challenges at scale and shape the future of B2B intelligence.

What You’ll Do:

  • Build, and optimize large-scale distributed data pipelines for processing billions of unstructured documents using Databricks, Apache Spark, and cloud-native big data tools
  • Scale enterprise-grade big-data systems, including data lakes, ETL/ELT workflows, and syndication platforms for customer-facing Insights-as-a-Service (InaaS) products.
  • Implement cutting-edge solutions for data ingestion, transformation, and analytics using Hadoop/Spark ecosystems, Elasticsearch, and cloud services (AWS EC2, S3, EMR).
  • Drive system reliability through automation, CI/CD pipelines (Docker, Kubernetes, Terraform), and infrastructure-as-code practices.
  • Implement data orchestration strategies using Airflow to manage multi-cloud workflows across AWS/Azure/GCP, Kubernetes clusters, and hybrid environments

What You’ll Be Responsible For

  • Building & Troubleshooting complex big data pipelines, including performance tuning of Spark jobs, query optimization, and data quality enforcement.
  • Collaborating in agile workflows (daily stand-ups, sprint planning) to deliver features rapidly while maintaining system stability.
  • Ensuring security and compliance across data workflows, including access controls, encryption, and governance policies.

What You’ll Need

BS/MS/Ph.D. in Computer Science or related field, with 5+ years of experience building production-grade big data systems.Extensive experience in Scala/Java for Spark development, including optimization of batch/streaming jobs and debugging distributed workflows.Airflow orchestration (DAGs, operators, sensors) and integration with Spark/DatabricksDistributed workflow scheduling and dependency management257.Performance tuning of Airflow DAGs and Spark jobs in multi-tenant environmentsProven track record with:Databricks, Hadoop/Spark ecosystems, and SQL/NoSQL databases (MySQL, Elasticsearch).Cloud platforms (AWS EC2, S3, EMR) and infrastructure-as-code tools (Terraform, Kubernetes).RESTful APIs, microservices architectures, and CI/CD automation.5+ years of designing, modeling and building big data pipelines in an enterprise work setting.

Top Skills

Airflow
Spark
Aws Ec2
Databricks
Docker
Elasticsearch
Emr
Hadoop
Kubernetes
NoSQL
Restful Apis
S3
SQL
Terraform

Similar Jobs

12 Days Ago
Hybrid
Mumbai, Maharashtra, IND
Mid level
Mid level
Financial Services
As a Software Engineer III, you will design and deliver technology products, develop and maintain production code, analyze data, and contribute to team improvements while promoting an inclusive culture.
Top Skills: AWSKafkaPysparkPythonSpark
3 Days Ago
Hybrid
Pune, Maharashtra, IND
Senior level
Senior level
Big Data • Information Technology • Software • Database • Business Intelligence
Lead the development of a Big Data Insights Platform, optimizing data pipelines and systems while mentoring engineers and ensuring compliance.
Top Skills: SparkAws Ec2Aws EmrAws S3DatabricksDockerElasticsearchHadoopJavaKubernetesNoSQLScalaSQLTerraform
4 Days Ago
Hybrid
Pune, Maharashtra, IND
Senior level
Senior level
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
As a Senior Software Engineer at ZS, you will lead development of multi-tenant cloud-based platforms using Big Data technologies while mentoring junior developers and ensuring high software quality through best engineering practices.
Top Skills: SparkAWSC#EmrHadoopHdfsHTML5JavaJavaScriptLinuxPowershellPython

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account