Lead development of big data solutions, focusing on text data processing, data pipelines, and cloud services, primarily in AWS and GCP.
Company Description
At DemandMatrix, our vision is to disrupt the $100 billion sales and marketing intelligence industry by using domain knowledge, machine learning and AI. Fortune 100 companies like Microsoft, Google, Adobe, Amazon, IBM trust us to identify their next customer.
Job DescriptionWhat will you do?
- To help us go to the next level we are looking to onboard a hands-on SME in leveraging big data tech to solve the most complex data issues. You will spend almost half of time with hands-on coding.
- It involves large scale text data processing, event driven data pipelines, in-memory computations, optimization considering CPU core to network IO to disk IO.
- You will be using cloud native services in AWS and GCP.
Who Are You?
- Solid grounding in computer engineering, Unix, data structures and algorithms would enable you to meet this challenge.
- Designed and built multiple big data modules and data pipelines to process large volume.
- Genuinely excited about technology and worked on projects from scratch.
Must have:
- 7+ years of hands-on experience in Software Development with a focus on big data and large data pipelines.
- Minimum 3 years of experience to build services and pipelines using Python.
- Expertise with a variety of data processing systems, including streaming, event, and batch (Spark, Hadoop/MapReduce)
- Understanding of at least one NoSQL stores like MongoDB, Elasticsearch, HBase
- Understanding of how data models, sharding and data location strategies for distributed data stores in large scale high-throughput and high-availability environments and their effect in non-structured text data processing
- Experience with running scalable & high available systems with AWS or GCP.
Good to have:
- Experience with Docker / Kubernetes
- Exposure with CI/CD
- Knowledge of Crawling/Scraping
- Entire Work From Home
- Birthday Leave
- Remote Work
DemandMatrix Pune, Mahārāshtra, IND Office
104 - Tower 1, World Trade Center, Kharadi, Pune, India, 411014
Similar Jobs
Information Technology • Consulting
Lead design and optimization of cloud data warehouses on Snowflake, build scalable ETL/ELT pipelines, leverage Snowflake advanced features and ML tooling, and use cloud-native AWS services and modern ETL tools to operationalize data solutions.
Top Skills:
AWSCortex AiData SharingDbtHorizon CatalogIics (Informatica Intelligent Cloud Services)Informatica CloudLambdaMatillionMlopsS3SnowflakeSnowpark MlSnowpipeSnsSqsStagesStreamlit
Information Technology • Consulting
Design, build, and maintain scalable data solutions and pipelines in Azure or Microsoft Fabric for financial market data. Implement Medallion architecture with delta tables, SCD Type 2, data quality frameworks, vendor ingestion (Bloomberg, MSCI, LSEG, ICE), entitlement management, and BCDR strategies. Collaborate with stakeholders and produce reporting/Power BI deliverables.
Top Skills:
AdlsAzureAzure Cosmos DbAzure DatabricksAzure Etl ToolsAzure SqlAzure Synapse AnalyticsBloombergDelta TablesDq StudioFabric Sql DatabaseFabric WarehouseIceLsegMicrosoft FabricMicrosoft Fabric LakehouseMsciPower BIPysparkPythonSqlpackageT-Sql
Information Technology • Consulting
The Big Data Lead will maintain SQL databases, build data pipelines and streams using Databricks, Pyspark, and SQL, and design APIs.
Top Skills:
Azure Data FactoryDatabricksETLSparkSQL Server
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.
