HG Insights Logo

HG Insights

Software Engineer - Big Data

Posted 7 Days Ago
Be an Early Applicant
Hybrid
Pune, Maharashtra
Senior level
Hybrid
Pune, Maharashtra
Senior level
Develop and optimize large-scale data pipelines, ensuring system reliability using a variety of big data technologies, while collaborating in agile development processes.
The summary above was generated by AI

Job Title: Software Engineer - Big Data

Department: IDP

About Us

HG Insights is the global leader in technology intelligence, delivering actionable AI driven insights through advanced data science and scalable big data solutions. Our Big Data Insights Platform processes billions of unstructured documents and powers a vast data lake, enabling enterprises to make strategic, data-driven decisions. Join our team to solve complex data challenges at scale and shape the future of B2B intelligence.

What You’ll Do:

  • Build, and optimize large-scale distributed data pipelines for processing billions of unstructured documents using Databricks, Apache Spark, and cloud-native big data tools
  • Scale enterprise-grade big-data systems, including data lakes, ETL/ELT workflows, and syndication platforms for customer-facing Insights-as-a-Service (InaaS) products.
  • Implement cutting-edge solutions for data ingestion, transformation, and analytics using Hadoop/Spark ecosystems, Elasticsearch, and cloud services (AWS EC2, S3, EMR).
  • Drive system reliability through automation, CI/CD pipelines (Docker, Kubernetes, Terraform), and infrastructure-as-code practices.
  • Implement data orchestration strategies using Airflow to manage multi-cloud workflows across AWS/Azure/GCP, Kubernetes clusters, and hybrid environments

What You’ll Be Responsible For

  • Building & Troubleshooting complex big data pipelines, including performance tuning of Spark jobs, query optimization, and data quality enforcement.
  • Collaborating in agile workflows (daily stand-ups, sprint planning) to deliver features rapidly while maintaining system stability.
  • Ensuring security and compliance across data workflows, including access controls, encryption, and governance policies.

What You’ll Need

BS/MS/Ph.D. in Computer Science or related field, with 5+ years of experience building production-grade big data systems.Extensive experience in Scala/Java for Spark development, including optimization of batch/streaming jobs and debugging distributed workflows.Airflow orchestration (DAGs, operators, sensors) and integration with Spark/DatabricksDistributed workflow scheduling and dependency management257.Performance tuning of Airflow DAGs and Spark jobs in multi-tenant environmentsProven track record with:Databricks, Hadoop/Spark ecosystems, and SQL/NoSQL databases (MySQL, Elasticsearch).Cloud platforms (AWS EC2, S3, EMR) and infrastructure-as-code tools (Terraform, Kubernetes).RESTful APIs, microservices architectures, and CI/CD automation.5+ years of designing, modeling and building big data pipelines in an enterprise work setting.

Top Skills

Airflow
Spark
Aws Ec2
Databricks
Docker
Elasticsearch
Emr
Hadoop
Kubernetes
NoSQL
Restful Apis
S3
SQL
Terraform

Similar Jobs

2 Days Ago
Hybrid
Mumbai, Maharashtra, IND
Senior level
Senior level
Financial Services
As a Lead Software Engineer, you'll design, develop, and enhance software solutions, manage AWS services, and lead technical evaluations while promoting a collaborative culture.
Top Skills: AngularjsAthenaAWSCi ToolsEc2EcsEmrGitHadoopHibernateHTML5J2EeJavaJavaScriptJIRAKafkaLambdaMavenMicroservicesMskNode.jsNoSQLOpen SearchOracle DbRdsReactS3SparkSpringSQL
7 Days Ago
Hybrid
Pune, Maharashtra, IND
Junior
Junior
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
As a Software Engineer at ZS, you will design and implement cloud-based software products, collaborate with teams, and help enhance technical skills. Responsibilities include developing software, ensuring product quality, and leading feature implementation for large projects in a fast-paced environment.
Top Skills: SparkAWSC#EmrHadoopHdfsHTML5JavaJavaScriptPowershellPython
12 Days Ago
Hybrid
Pune, Maharashtra, IND
Senior level
Senior level
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
As a Senior Software Engineer at ZS, you will lead development of multi-tenant cloud-based platforms using Big Data technologies while mentoring junior developers and ensuring high software quality through best engineering practices.
Top Skills: SparkAWSC#EmrHadoopHdfsHTML5JavaJavaScriptLinuxPowershellPython

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account