Avahi Logo

Avahi

Data Engineer

Posted 10 Days Ago
In-Office or Remote
Hiring Remotely in Pune, Maharashtra, IND
Mid level
In-Office or Remote
Hiring Remotely in Pune, Maharashtra, IND
Mid level
Design, build, and maintain scalable AWS data platforms, implement data governance, and optimize architectures while collaborating with cross-functional teams.
The summary above was generated by AI

This position is open to candidates based in India only. Applications from outside India will not be considered.

Company Overview

At Avahi, we’re redefining what it means to be a premier cloud-first consulting company, recognized for our people, culture, and innovative solutions. With expertise in Managed Services, Reselling, Staffing, and Professional Services, we are dedicated to delivering exceptional value and putting customers first.

As a remote-first, global team spanning North America, Europe, and Southeast Asia, we foster a collaborative and diverse environment where professional growth, creativity, and mutual respect thrive. Guided by our values—Customer-Centricity, Collaboration, Agility, Innovation, Integrity, and Diversity & Inclusion—we empower businesses to embrace the full potential of a cloud-first approach.

Key Responsibilities

  • Design, build, and maintain scalable AWS data platforms supporting batch and streaming pipelines, analytics, and AI/ML workloads, aligned with AWS Well-Architected best practices.
  • Build and operate data ingestion, transformation, and enrichment pipelines from internal systems and external APIs, handling structured, semi-structured, unstructured, and graph data.
    Implement data normalization workflows to ensure consistent schemas, high data quality, and reliable analytics, BI, and ML use cases.
  • Design and enforce data governance including cataloging, lineage, access control, and auditability.
  • Build and maintain knowledge graphs to model relationships across core business entities, enabling advanced analytics and inference.
  • Identify data gaps, inconsistencies, and missing relationships using strong analytical and inference skills.
  • Integrate data from enterprise platforms such as CRM and ERP systems (Salesforce, HubSpot, SAP, NetSuite, Dynamics 365, Workday).
  • Design secure data access layers for analytics, BI, ML, and downstream applications.
    Implement monitoring, observability, and data quality checks for freshness, completeness, and pipeline health.
  • Optimize data architectures for performance and cost efficiency using partitioning, indexing, compression, and storage tiering.
  • Build internal tooling, dashboards, and standardized scaffolding to improve visibility, maintainability, and onboarding.
  • Collaborate with cross-functional teams to deliver high-impact data solutions and share best practices, documentation, and technical guidance.

Required Skills & Qualifications

  • Strong experience designing and operating AWS data platforms, including S3, Glue, Lake Formation, Athena, Redshift, EMR, Kinesis/MSK, DynamoDB, OpenSearch, and Neptune.
  • Strong Python skills for data engineering, focused on modular, testable, and maintainable code.
  • Solid understanding of distributed data systems, including batch and streaming pipelines, fault tolerance, idempotency, and event-driven architectures.
  • Experience with data warehouse and lakehouse architectures, ETL/ELT pipelines, and analytical query engines.
  • Hands-on experience with Spark, Hadoop, Hive, or Flink.
  • Strong data modeling skills, including normalized, denormalized, and graph-based models, with safe schema evolution.
  • Advanced SQL skills for analytics and data engineering, including window functions, CTEs, and query optimization.
  • Experience integrating external APIs and enterprise systems, especially CRM and ERP platforms.
  • Knowledge of data governance, security, and compliance, including encryption, access control, and audit logging.
  • Experience implementing monitoring, observability, and data quality checks using CloudWatch and CloudTrail. 
  • Comfort with Infrastructure as Code using CloudFormation or Terraform.
  • Strong end-to-end ownership mindset, with a focus on scalability, reliability, and long-term maintainability.
  • Professional-level English communication skills, able to explain data architectures and trade-offs to technical and non-technical stakeholders.

Why Work Here

  1. Remote-First Flexibility: 
    Enjoy work-life harmony in a remote-first environment that allows you to work from anywhere. 
  2. Innovative Culture: 
    We embrace a startup mindset, encouraging creativity, agility, and growth. Be part of a team that explores cutting-edge technology and drives impactful solutions. 
  3. Career Development: 
    Avahi is committed to your growth, offering mentorship and opportunities to advance your career.
  4. Purpose-Driven Mission: 
    Join us in making a difference. Avahi is dedicated to championing diversity, supporting women in tech, and fostering sustainable practices. 
  5. Global Collaboration: 
    Work alongside a diverse, talented team, sharing insights and collaborating to create innovative solutions that make a real impact. 

Join Avahi and make an impact in a fast-paced, customer-focused environment with abundant opportunities for growth.

Accessibility and Inclusivity Statement
At Avahi, we are committed to fostering a workplace that celebrates diversity and inclusivity. We welcome applicants from all backgrounds, experiences, and perspectives, including those from underrepresented communities.

We are proud to be an equal opportunity employer, providing a fair and accessible recruitment process for all candidates. If you require accommodations at any stage of the application or interview process, please let us know, and we will work to meet your needs.

#LI-Remote

Similar Jobs

Yesterday
Remote or Hybrid
Senior level
Senior level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
The role involves designing and building cloud-based data solutions, managing data pipelines, ensuring data quality and performance optimization while leading a team in data engineering.
Top Skills: Aecorsoft/DatasphereAirflowBigQueryDatabricksDbtErwinGcp Cloud ServicesPl/SqlPostgres SqlPower BIPythonSQLTableau
3 Days Ago
Remote
India
Senior level
Senior level
Software
The Azure Data Engineer will design and build scalable data pipelines on Azure, manage ETL/ELT pipelines, optimize performance, and lead a small team in delivering high-quality solutions.
Top Skills: AdlsAzure Cloud ServicesAzure Data FactoryAzure DatabricksEltETLPysparkPythonSQL
3 Days Ago
In-Office or Remote
India
Mid level
Mid level
Software
The Data Engineer will design, build, and maintain ETL pipelines and data products, utilizing big data technologies and ensuring efficient data architecture. Responsibilities include troubleshooting complex issues, developing scalable solutions, and managing databases and data security compliance.
Top Skills: Amazon SnowflakeData LakeDockerEmrETLGitGlueJavaKubernetesPythonScala

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account