YipitData Logo

YipitData

Data Engineer (Web Scraping)

Posted 4 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Mid level
Remote
Hiring Remotely in India
Mid level
As a Web Scraping Engineer, you'll design, build, and maintain web scrapers, implementing advanced techniques and collaborating with teams to ensure efficient data processing and quality.
The summary above was generated by AI

About Us:

YipitData is the leading market research and analytics firm for the disruptive economy and most recently raised $475M from The Carlyle Group at a valuation of over $1B. Every day, our proprietary technology analyzes billions of alternative data points to uncover actionable insights across sectors like software, AI, cloud, e-commerce, ridesharing, and payments.

Our data and research teams transform raw data into strategic intelligence, delivering accurate, timely, and deeply contextualized analysis that our customers—ranging from the world’s top investment funds to Fortune 500 companies—depend on to drive high-stakes decisions. From sourcing and licensing novel datasets to rigorous analysis and expert narrative framing, our teams ensure clients get not just data, but clarity and confidence.

We operate globally with offices in the US (NYC, Austin, Miami, Mountain View), APAC (Hong Kong, Shanghai, Beijing, Guangzhou, Singapore), and India. Our award-winning, people-centric culture—recognized by Inc. as a Best Workplace for three consecutive years—emphasizes transparency, ownership, and continuous mastery.

What It’s Like to Work at YipitData:

YipitData isn’t a place for coasting—it’s a launchpad for ambitious, impact-driven professionals.

From day one, you’ll take the lead on meaningful work, accelerate your growth, and gain exposure that shapes careers.

Why Top Talent Chooses YipitData:

  • Ownership That Matters: You’ll lead high-impact projects with real business outcomes
  • Rapid Growth: We compress years of learning into months
  • Merit Over Titles: Trust and responsibility are earned through execution, not tenure
  • Velocity with Purpose: We move fast, support each other, and aim high—always with purpose and intention

If your ambition is matched by your work ethic—and you're hungry for a place where growth, impact, and ownership are the norm—YipitData might be the opportunity you’ve been waiting for.

About The Role:

We are seeking a Web Scraping Engineer to join our growing engineering team. In this hands-on role, you’ll take ownership of designing, building, and maintaining robust web scrapers that power critical reports and customer experiences across our organization. You will work on complex, high-impact scraping challenges and collaborate closely with cross-functional teams to ensure our data ingestion processes are resilient, efficient, and scalable, while delivering high-quality data to our products and stakeholders.

As Our Web Scraping Engineer You Will:

Refactor and Maintain Web Scrapers

  • Overhaul existing scraping scripts to improve reliability, maintainability, and efficiency.
  • Implement best coding practices (clean code, modular architecture, code reviews, etc.) to ensure quality and sustainability.

Implement Advanced Scraping Techniques

  • Utilize sophisticated fingerprinting methods (cookies, headers, user-agent rotation, proxies) to avoid detection and blocking.
  • Handle dynamic content, navigate complex DOM structures, and manage session/cookie lifecycles effectively.

Collaborate with Cross-Functional Teams

  • Work closely with analysts and other stakeholders to gather requirements, align on targets, and ensure data quality.
  • Provide support, documentation, and best practices to internal stakeholders to ensure effective use of our web scraped data in critical reporting workflows.

Monitor and Troubleshoot

  • Develop robust monitoring solutions, alerting frameworks  to quickly identify and address failures.
  • Continuously evaluate scraper performance, proactively diagnosing bottlenecks and scaling issues.

Drive Continuous Improvement

  • Propose new tooling, methodologies, and technologies to enhance our scraping capabilities and processes.
  • Stay up to date with industry trends, evolving bot-detection tactics, and novel approaches to web data extraction.

This is a fully-remote opportunity based in India. Standard work hours are from 11am to 8pm IST, but there is flexibility here.

You Are Likely To Succeed If:

  • Effective communication in English with both technical and non-technical stakeholders.
  • You have a track record of mentoring engineers and managing performance in a fast-paced environment.
  • 3+ years of experience with web scraping frameworks (e.g., Selenium, Playwright, or Puppeteer).
  • Strong understanding of HTTP, RESTful APIs, HTML parsing, browser rendering, and TLS/SSL mechanics.
  • Expertise in advanced fingerprinting and evasion strategies (e.g., browser fingerprint spoofing, request signature manipulation).
  • Deep experience managing cookies, headers, session states, and proxy rotations, including the deployment of both residential and data center proxies.
  • Experience with logging, metrics, and alerting to ensure high availability.
  • Troubleshooting skills to optimize scraper performance for efficiency, reliability, and scalability.

What We Offer:

Our compensation package includes comprehensive benefits, perks, and a competitive salary: 

  • We care about your personal life, and we mean it. We offer flexible work hours, flexible vacation, a generous 401K match, parental leave, team events, wellness budget, learning reimbursement, and more!
  • Your growth at YipitData is determined by the impact that you are making, not by tenure, unnecessary facetime, or office politics. Everyone at YipitData is empowered to learn, self-improve, and master their skills in an environment focused on ownership, respect, and trust. See more on our high-impact, high-opportunity work environment above!

We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender, gender identity or expression, or veteran status. We are proud to be an equal-opportunity employer.

Job Applicant Privacy Notice

Top Skills

Cookies
Headers
HTML
HTTP
Playwright
Proxies
Puppeteer
Restful Apis
Selenium
Session States
Ssl
Tls
Web Scraping

Similar Jobs

5 Hours Ago
Remote or Hybrid
Bengaluru, Karnataka, IND
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
The Project Manager drives project success by managing relationships, developing plans, coordinating teams, and tracking progress for customer and partner engagements.
Top Skills: Crm SoftwareMS OfficeOracleProject Management SoftwareSAP
5 Hours Ago
Remote or Hybrid
Bengaluru, Karnataka, IND
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
As a Staff II Software Engineer, you will develop BlackLine applications, focusing on data sync with various systems and improving software architecture.
Top Skills: Asp.Net CoreAWSAzureC#Ci/CdDevOpsGCPJwtKubernetesMicroservicesNoSQLOauth2ReactRest ApisSQLTerraform
5 Hours Ago
Remote or Hybrid
Bengaluru, Karnataka, IND
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
The Project Manager is responsible for managing customer and partner relationships, coordinating teams, and ensuring successful project delivery while tracking progress and managing risks.
Top Skills: Crm SoftwareMS OfficeProject Management Software

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account