Darkroom Logo

Darkroom

Senior Data Engineer

Posted 6 Days Ago
Remote
Hiring Remotely in India
Senior level
Remote
Hiring Remotely in India
Senior level
Build, operate, and scale ingestion pipelines for marketing data (ad platforms, ecommerce, analytics). Normalize and transform data into shared schemas, ensure multi-tenant reliability, observability, and security, and partner with AI/data-science teams to expose clean data for agents.
The summary above was generated by AI
What we're building

We're empowering small teams with technology that makes it easier to market and grow businesses. Our current focus is to help consumer brands shift from "workflow automation" to "agent management" within their marketing operations. Shadow is the AI coordination layer — providing shared AI memory, centralized agent control, and model orchestration for marketing teams.

Why join Shadow?
  • Product Ownership You'll ship production code daily and help steer key product and technical decisions.

  • Shape the Engineering Culture You'll influence how we work—tools, processes, standards, and hiring.

  • Work with Challenger Consumer Brands Talk directly to customers (CEOs, CMOs, VP's) of fast-growing consumer brands—some doing $80M–$500M in revenue.

The agency behind the product

Shadow is built alongside Darkroom — a performance marketing agency that's been operating for 10 years, employs 100+ people, runs 100+ clients at a time, and has worked with over 1,000 consumer brands. The agency is both our proving ground and our first user, which means the data you build with is real marketing data at real volume from day one — not a synthetic demo.

This is a fully remote role supporting a team in the EST time zone (9 AM–5 PM EST).

The role

You own the pipelines that bring the world's marketing data into Shadow — and keep them fast, accurate, and reliable as we scale to thousands of users. Every brand connects its full stack (ad platforms, ecommerce, analytics, email/SMS), and you make that data land cleanly, normalize into shared schemas, and stay in sync. The agent is only as good as the data underneath it; that layer is yours.

This is a hands-on, build-heavy engineering role for someone who has run large data systems before and wants to do it again in a smaller, faster environment.

What you'll own
  • Build and scale the ingestion layer across third-party marketing APIs (Meta, Google, TikTok, GA4, Shopify, Klaviyo, and more) — auth, extraction, rate-limit handling, backfill, and incremental sync.

  • Design normalization and transformation pipelines that map messy, platform-specific data into shared, queryable schemas (e.g. a unified creative/campaign/order model).

  • Own data reliability at scale — sync accuracy, freshness, coverage, and observability. Build the systems that detect when a connection breaks or a number looks wrong before a user does.

  • Engineer for multi-tenant scale and security: pipelines and storage that stay performant and cost-efficient across 1,000+ users and hundreds of connected brands — with strict data isolation, privacy, and compliance built in, not bolted on.

  • Partner with the AI and data-science teams to expose clean, well-modeled data the agent can retrieve and reason over.

Must haves
  • Experience building and operating large enterprise data pipelines engineered for scale — systems serving 1,000+ users (or equivalent data volume / tenancy), where reliability, isolation, and cost at scale were real constraints you solved.

  • Strong SQL and Python, with production experience in a modern data warehouse (BigQuery, Snowflake, Redshift, or similar).

  • Deep familiarity with ETL/ELT patterns, incremental sync, schema design, and data modeling for analytics.

  • Built and maintained integrations against third-party APIs — OAuth flows, pagination, rate limits, schema drift, and the operational reality of connectors that break.

  • A bias toward observability and data quality: you instrument your pipelines and you don't ship data you can't trust.

  • Experience building or operating within SOC 2-compliant systems with enterprise-grade security and privacy — you've handled sensitive customer data under real compliance constraints (access controls, encryption, data isolation, auditability) and treat it as a first-class engineering requirement.

Nice to haves
  • Experience in martech, adtech, or an adjacent data-heavy marketing domain — you've worked with ad platform or ecommerce data before and know where the bodies are buried (attribution windows, currency/timezone messes, deduping across platforms).

  • Familiarity with our stack: GCP (BigQuery, Cloud Run), PostgreSQL + pgvector, and orchestration/transformation tooling (dbt, Airflow, Dagster, or similar).

  • Experience with pipeline observability and tracing in an AI/LLM context (e.g. Langfuse).

  • Comfort supporting data that feeds AI agents and retrieval systems, not just dashboards.

Culture fit
  • Obsessive about data organization at scale. We're hiring for someone who lives in the data layer and wants to own it end to end.

  • You’re a power AI user. You've embedded AI into every workflow you touch and you think in systems — not one-off prompts, but repeatable structures that compound.

  • Entrepreneurial. You don't need much direction to move fast, you pivot when the situation demands it, and what you ship is production-grade, not a prototype you hand off for someone else to finish.

What we offer
  • Unlimited PTO + Local holidays (Relevant to your hub): Rebooting is part of the work. Take the time you need to stay sharp.

  • Remote-First Culture: Many roles are fully remote. Employees based in or near our New York or Lisbon HQs are expected to work hybrid with weekly in-office time. Hub locations include Brazil and Spain.

  • Parental Leave: Flexible parental leave to support new parents during this important transition.

  • Growth: Our interdisciplinary model gives every employee exposure far beyond their core role. Grow your skills, expand your influence, and stay at the forefront of the industry.

Equal Opportunity Statement

We are an equal opportunity workplace—we are dedicated to equal employment opportunities regardless of race, color, ancestry, religion, sex, national orientation, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.

Similar Jobs

12 Hours Ago
In-Office or Remote
Senior level
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Design, build and maintain scalable data pipelines, data lake ingestion, and data products. Collaborate with GTM and platform teams to enable self-serve analytics, implement microservices, and process high-volume streaming data using Spark, Airflow and AWS. Drive data modeling, warehousing best practices, and production reliability.
Top Skills: Amazon EmrAmazon KinesisAmazon RdsAmazon S3Amazon SqsApache AirflowSparkDatabricksJavaPythonScalaSQL
10 Days Ago
In-Office or Remote
Senior level
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Senior Data Engineer will design data architecture, collaborate with stakeholders, mentor junior engineers, and develop scalable ELT/ETL pipelines while ensuring data quality and system efficiency.
Top Skills: AirflowBitbucketCicdDatabricksDynamo DbGitMongo DbPostgresPythonRedshiftScalaSparkSQL
10 Days Ago
In-Office or Remote
Senior level
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
As a Senior Data Engineer at Atlassian, you will design data architectures, collaborate on business data requirements, and mentor junior engineers while leading initiatives to enhance data solutions and system efficiencies.
Top Skills: AirflowBitbucketDatabricksDynamo DbGitMachine LearningMongo DbPostgresPythonRedshiftScalaSparkSQL

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account