Suzega Logo

Suzega

Data Engineer

Reposted 5 Days Ago
Remote
Hiring Remotely in India
Entry level
Remote
Hiring Remotely in India
Entry level
Data Engineer responsibilities include designing and building scalable data management systems, optimizing data pipelines, managing data warehouses, and ensuring data quality and compliance.
The summary above was generated by AI
What You'll Do

As a Data Engineer, you will be engaged from the first client conversation all the way through to delivery — gathering requirements, designing the solution, and seeing it through to completion. You will design, construct, install, test, and maintain highly scalable data management systems and robust data pipelines. Your work will ensure data quality, reliability, and accessibility for our AI/ML engineers and LLM applications, leveraging cloud platforms and modern data engineering practices, including workflow orchestration.

  • Design, build, and optimize scalable ETL/ELT data pipelines using Python and cloud-native tools (on AWS, Azure, or GCP).

  • Develop data models and schemas optimized for analytical and AI/ML workloads.

  • Implement data quality checks and monitoring frameworks.

  • Manage and administer data warehouses, data lakes, and databases (SQL/NoSQL).

  • Implement and manage workflow orchestration tools (e.g., Airflow, Prefect, Dagster) for scheduling and monitoring data pipelines.

  • Collaborate closely with AI/ML Engineers and LLM Engineers to understand their data requirements.

  • Ensure data security and compliance standards are met.

  • Optimize data storage and processing costs on Hyperscaler platforms.

  • Write efficient and maintainable Python code for data processing tasks.

  • Work independently to troubleshoot and resolve data-related issues.



Requirements
What you’ll Bring
  • Strong proficiency in Python for data manipulation and pipeline development (e.g., Pandas, PySpark)

  • Expertise in SQL and experience with relational and NoSQL databases

  • Hands-on experience with cloud-based data services on at least one Hyperscaler (e.g., AWS S3/Glue/Redshift, Azure Data Factory/Synapse, GCP Cloud Storage/Dataflow/BigQuery)

  • Experience building and managing data pipelines and ETL/ELT processes

  • Familiarity with data warehousing concepts and data modeling

  • Understanding of data quality principles

  • Ability to work independently and take ownership of data infrastructure components

How to Apply:

  • Share your philosophy on developing data infrastructure, the methodologies you utilize, and provide concrete examples of data systems you've built that have delivered tangible results.

  • Tell us why you are interested to join Suzega 

What Happens Next:
  • We will review applications 
  • If shortlisted, you'll participate in two virtual interviews with our Trusted Interviewers
  • You will have to go through a coding interview
  • We will aim to complete our selection process within two weeks and notify you of our decision
Why This Matters

At Suzega, we're not just building better AI—we're thinking about how AI can work better with people and society. Are you ready to help shape what AI can and should do? Do you want to use your technical skills to make a real difference?

If so, we'd love to hear from you.

This is more than a job. It's a chance to shape the future.

Location: Work from anywhere in India


Benefits
Please review the Benefits & Perks section in our Team Member Handbook for comprehensive information.

Similar Jobs

4 Days Ago
In-Office or Remote
Vijaynagar, Shivpurī, Madhya Pradesh, IND
Senior level
Senior level
Cloud • Enterprise Web • Hardware • Information Technology • Internet of Things • Robotics • Semiconductor
Design and develop real-time and cloud/web/mobile data applications. Analyze requirements, implement ETL and data warehouse solutions, perform unit/functional/system testing, conduct code reviews, troubleshoot, and document technical deliverables. Build solutions using Python and cloud platforms, apply CI/CD and containerization, and collaborate in Agile teams while considering data governance and visualization needs.
Top Skills: AWSAzureCi/CdData WarehousingDockerDynamoDBEc2ETLGCPGenerative AiKubernetesLookerMongoDBPower BIPythonPyTorchRdsS3SQLTableauTensorFlow
4 Days Ago
In-Office or Remote
Senior level
Senior level
Insurance
Build and maintain multi-cloud ETL/ELT data pipelines to provision high-fidelity datasets, feature stores, and vectorized corpora for ML and Generative AI. Lead Snowflake architecture, optimize performance and costs, extract from legacy on-prem systems, orchestrate workflows, ensure data quality and security, and collaborate with MLOps and Data Science teams.
Top Skills: AirflowAws GlueAws LambdaAws RdsAws S3Aws Step FunctionsAzure Blob StorageAzure Data FactoryAzure SqlAzure SynapseDb2DockerFeature StoreIamLlmsMainframePrivatelinkPysparkPythonRagSnowflakeSnowpipeSQLSQL ServerVector Store
4 Days Ago
Remote
India
Mid level
Mid level
AdTech • Marketing Tech
Design, build, and maintain scalable ETL pipelines and data models for analytics and ML. Ensure data quality, optimize performance, enforce governance and security, and collaborate with stakeholders to deliver reliable, production-ready data for reporting and data science.
Top Skills: AirflowAWSAzureBigQueryDbtGCPPythonRedshiftScalaSnowflakeSQL

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account