The role involves building and maintaining automated pipelines for data collection and cleaning from public sources, integrating scrapers with backend systems, and ensuring data accuracy and compliance with platform rules.
Web Scraper / Data Engineer
Location: Remote
Job Type: Full-time
About STERRY
At STERRY, we’re not your average Growth Marketing Agency—we’re the rocket fuel behind crowdfunding and e-commerce success. Since day one, we’ve helped clients pull in over $100 million in trackable online revenue. We build strategies that go beyond brand and marketing—we deliver measurable results rooted in online performance
Role Overview
We’re looking for an experienced Web Scraper / Data Engineer to help us build and maintain automated pipelines that collect, clean, and enrich creator and campaign data from public sources. You’ll be responsible for designing reliable, scalable scrapers and integrating them with our backend.
Responsibilities
Requirements
What We Offer
Location: Remote
Job Type: Full-time
About STERRY
At STERRY, we’re not your average Growth Marketing Agency—we’re the rocket fuel behind crowdfunding and e-commerce success. Since day one, we’ve helped clients pull in over $100 million in trackable online revenue. We build strategies that go beyond brand and marketing—we deliver measurable results rooted in online performance
Role Overview
We’re looking for an experienced Web Scraper / Data Engineer to help us build and maintain automated pipelines that collect, clean, and enrich creator and campaign data from public sources. You’ll be responsible for designing reliable, scalable scrapers and integrating them with our backend.
Responsibilities
- Build scrapers and crawlers to collect creator profile data (followers, engagement, category, contact info, etc.) from social platforms (TikTok, Instagram, YouTube, etc.) and directories
- Parse and clean unstructured data into structured datasets (JSON, CSV, or direct to database)
- Integrate with APIs (YouTube, TikTok, Instagram, etc.) where possible
- Detect and handle rate limits, CAPTCHA, and anti-bot mechanisms
- Implement and monitor scraping tasks using proxy rotation and headless browsers (Puppeteer, Playwright, Selenium, etc.)
- Collaborate with the backend team to feed data into AI recommendation engine
- Maintain high data accuracy, freshness, and compliance with platform TOS and privacy rules
Requirements
- 2+ years experience building web scrapers, crawlers, or data extraction pipelines
- Strong Python or Node.js skills (BeautifulSoup, Playwright, Puppeteer, Scrapy, or similar)
- Experience with APIs, JSON, REST, and rate-limiting management
- Familiarity with databases (MongoDB, PostgreSQL, Firebase, etc.)
- Knowledge of proxies, headless browsers, and data scaling infrastructure
- Attention to detail and ability to deliver clean, well-documented code
- (Bonus) Experience with influencer data, social analytics, or SaaS platforms
What We Offer
- Flexible working hours (remote-first)
- Competitive pay (hourly or project-based)
- Long-term potential to transition into a data engineering role
- Opportunity to shape the foundation of a fast-growing AI SaaS startup
Top Skills
Beautifulsoup
Firebase
JSON
MongoDB
Node.js
Playwright
Postgres
Puppeteer
Python
Rest
Scrapy
Similar Jobs
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The Associate Manager develops global medical content, collaborates with teams, ensures project execution, and trains on tools like generative AI.
Top Skills:
Generative Ai Technology Platforms
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Responsible for leading and supporting digital lab solutions, managing solution team backlog, maintaining system documentation, and providing technical support for lab systems.
Top Skills:
LimsModaMS Office
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The SRE Manager oversees global Hosting infrastructure reliability, applies SRE principles, automates processes, engages stakeholders, and drives operational excellence.
Top Skills:
Amazon AuroraAmazon RdsAnsibleApacheBashC/C#/C++GitIisJavaJbossMs SqlOraclePostgresPowershellPythonRedhatTerraformTomcatWeblogicWindows
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

