Fusemachines Logo

Fusemachines

Senior Data Engineer

Posted 6 Days Ago
Be an Early Applicant
In-Office
Pune, Maharashtra
Senior level
In-Office
Pune, Maharashtra
Senior level
Seeking a Senior Data Engineer to design and implement high-performance data solutions and pipelines, ensuring scalability and reliability in cloud environments.
The summary above was generated by AI

About Fusemachines

Fusemachines is a 10+ year old AI company, dedicated to delivering state-of-the-art AI products and solutions to a diverse range of industries. Founded by Sameer Maskey, Ph.D., an Adjunct Associate Professor at Columbia University, our company is on a steadfast mission to democratize AI and harness the power of global AI talent from underserved communities. With a robust presence in four countries and a dedicated team of over 400 full-time employees, we are committed to fostering AI transformation journeys for businesses worldwide. At Fusemachines, we not only bridge the gap between AI advancement and its global impact but also strive to deliver the most advanced technology solutions to the world.
Type: Remote Full-time

Senior Data Engineer

Are you an experienced Data Engineering professional with a passion for building scalable, reliable, and high-performance data systems? Do you have hands-on experience designing and optimizing end-to-end real-time and batch pipelines, and developing cloud-native data architectures using modern technologies such as AWS, GCP, Azure, Databricks, and Snowflake?

We are looking for a Senior Data Engineer to architect, design, and implement scalable, high-performance data solutions. The ideal candidate will be an expert in at least one major cloud data ecosystem (AWS, Azure, GCP, Snowflake, or Databricks) and possess a deep understanding of the end-to-end data lifecycle, from ingestion to business intelligence.
Qualification & Skill Set Requirements
Core Technical Competencies
Experience: 5+ years of hands-on data engineering experience in a production environment.
Languages: Strong proficiency in Python, SQL (complex queries, performance tuning), and PySpark/Apache Spark.
Data Modeling: Expert knowledge of data modeling (3NF, Star, Snowflake Schema) and Lakehouse/Warehouse architectures.
ETL/ELT & Orchestration: Proven experience building pipelines using tools like dbt, Airflow, Dagster, or native cloud orchestrators (Glue, Data Factory, Composer).
Integrations: Experienced in integrating data from diverse sources: APIs, RDBMS/NoSQL databases, flat files, and streaming platforms (Kafka, Kinesis, Pub/Sub).
Cloud Platform Expertise (Specialization-Specific)
Candidates should demonstrate deep expertise in anyone of the following:
Snowflake: SnowSQL, Streams, Tasks, Snowpark, and cost optimization.
Databricks: Delta Lake, Unity Catalog, Delta Live Tables (DLT), and Spark optimization.
GCP: BigQuery, Dataflow, Dataproc, Pub/Sub, and Cloud Functions.
Azure: Synapse Analytics, Data Factory, Azure Databricks, and Stream Analytics.
AWS: Redshift, S3, Lake Formation, Glue, and Lambda.
Professional Practices
SDLC & DevOps: Proficient in Git workflows, CI/CD pipelines (GitHub Actions, Azure DevOps, AWS CodePipeline), and IaC (Terraform/CloudFormation).
Data Governance: Strong understanding of data quality, lineage, observability, security (RBAC, encryption), and compliance frameworks.
Agile: Active experience in Agile/Scrum environments using Jira or Azure Boards.
Mentorship: Ability to lead projects and provide technical guidance to junior/mid-level engineers.
Responsibilities
Architecture: Architect, design, and implement scalable, reliable data solutions and pipelines aligned with business analytics needs.
Optimization: Manage and fine-tune cloud resources and workloads for maximum performance, reliability, and cost-efficiency.
Data Transformation: Lead the development of ETL/ELT processes for both batch and real-time data processing.
Collaboration: Partner with Product, Engineering, and Data Science teams to deliver effective, data-driven solutions.
Governance & Quality: Promote and enforce best practices in data governance, security, and data quality frameworks.
Mentorship: Provide technical leadership and mentorship to the team, ensuring architecture quality and best practices.
Documentation: Maintain comprehensive documentation of data architectures, configurations, and workflows.
Fusemachines is an Equal Opportunities Employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local laws.

Top Skills

Airflow
Spark
AWS
Azure
Composer
Dagster
Data Factory
Databricks
Dbt
GCP
Glue
Kafka
Kinesis
Pub/Sub
Pyspark
Python
Snowflake
SQL

Similar Jobs

8 Hours Ago
Remote or Hybrid
MH, IND
Junior
Junior
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As a Technical Account Manager at CrowdStrike, you will advise customers, resolve technical issues, and ensure their security success. Responsibilities include onboarding, conducting health checks, and collaborating with internal teams.
Top Skills: Enterprise Web TechnologiesWindows Operating Systems
12 Hours Ago
Remote or Hybrid
16 Locations
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Sr. Software Engineer will create file format parsers, collaborate on machine learning features, and maintain software systems. Responsibilities include testing, optimization, and documentation.
Top Skills: AWSAzureBitbucketC++GCPGitJenkinsJIRAPythonRust
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Collaboration Tool Engineer is responsible for administering, securing, and optimizing multiple collaboration platforms, ensuring compliance and integration, while enhancing user experience across the organization.
Top Skills: AsanaBoxDropboxKalturaMiroNextup.AiPowershellPythonRest ApisSmartsheetSso/Saml

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account