Lead the development of a Big Data Insights Platform, optimizing data pipelines and systems while mentoring engineers and ensuring compliance.
Job Title: Senior Software Engineer
Department: IDP
About Us
HG Insights is the global leader in technology intelligence, delivering actionable AI driven insights through advanced data science and scalable big data solutions. Our Big Data Insights Platform processes billions of unstructured documents and powers a vast data lake, enabling enterprises to make strategic, data-driven decisions. Join our team to solve complex data challenges at scale and shape the future of B2B intelligence.
What You’ll Do:
- Design, build, and optimize large-scale distributed data pipelines for processing billions of unstructured documents using Databricks, Apache Spark, and cloud-native big data tools
- Architect and scale enterprise-grade big-data systems, including data lakes, ETL/ELT workflows, and syndication platforms for customer-facing Insights-as-a-Service (InaaS) products.
- Collaborate with product teams to develop features across databases, backend services, and frontend UIs that expose actionable intelligence from complex datasets.
- Implement cutting-edge solutions for data ingestion, transformation, and analytics using Hadoop/Spark ecosystems, Elasticsearch, and cloud services (AWS EC2, S3, EMR).
- Drive system reliability through automation, CI/CD pipelines (Docker, Kubernetes, Terraform), and infrastructure-as-code practices.
What You’ll Be Responsible For
- Leading the development of our Big Data Insights Platform, ensuring scalability, performance, and cost-efficiency across distributed systems.
- Mentoring engineers, conducting code reviews, and establishing best practices for Spark optimization, data modeling, and cluster resource management.
- Building & Troubleshooting complex data pipeline issues, including performance tuning of Spark jobs, query optimization, and data quality enforcement.
- Collaborating in agile workflows (daily stand-ups, sprint planning) to deliver features rapidly while maintaining system stability.
- Ensuring security and compliance across data workflows, including access controls, encryption, and governance policies.
What You’ll Need
- BS/MS/Ph.D. in Computer Science or related field, with 5+ years of experience building production-grade big data systems.
- Expertise in Scala/Java for Spark development, including optimization of batch/streaming jobs and debugging distributed workflows.
- Proven track record with:
- Databricks, Hadoop/Spark ecosystems, and SQL/NoSQL databases (MySQL, Elasticsearch).
- Cloud platforms (AWS EC2, S3, EMR) and infrastructure-as-code tools (Terraform, Kubernetes).
- RESTful APIs, microservices architectures, and CI/CD automation37.
- Leadership experience as a technical lead, including mentoring engineers and driving architectural decisions.
- Strong understanding of agile practices, distributed computing principles, and data lake architectures.
- Airflow orchestration (DAGs, operators, sensors) and integration with Spark/Databricks
- 7+ years of designing, modeling and building big data pipelines in an enterprise work setting.
Nice-to-Haves
- Experience with machine learning pipelines (Spark MLlib, Databricks ML) for predictive analytics.
- Knowledge of data governance frameworks and compliance standards (GDPR, CCPA).
- Contributions to open-source big data projects or published technical blogs/papers.
- DevOps proficiency in monitoring tools (Prometheus, Grafana) and serverless architectures.
Top Skills
Spark
Aws Ec2
Aws Emr
Aws S3
Databricks
Docker
Elasticsearch
Hadoop
Java
Kubernetes
NoSQL
Scala
SQL
Terraform
Similar Jobs
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
As a Senior Software Engineer at ZS, you will lead development of multi-tenant cloud-based platforms using Big Data technologies while mentoring junior developers and ensuring high software quality through best engineering practices.
Top Skills:
SparkAWSC#EmrHadoopHdfsHTML5JavaJavaScriptLinuxPowershellPython
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The Senior CPQ Developer designs, implements, and maintains CPQ solutions to enhance sales operations, ensuring data integrity and collaboration with business teams.
Top Skills:
ApexApttusCSSEtl ToolsHTMLJavaScriptLightning ComponentsOracle CpqProsRelational DatabasesSalesforce CpqSOQLVisualforce
Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
The role involves the entire software development lifecycle, including design, coding, testing, and support, while mentoring juniors and improving engineering practices.
Top Skills:
SparkAws Ec2Aws EmrDockerJavaKafkaKubernetesRabbitMQRedis
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.