Netomi Logo

Netomi

SDET III – Generative AI QA

Posted 18 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Gurugram, Haryana
Senior level
In-Office or Remote
Hiring Remotely in Gurugram, Haryana
Senior level
Lead development of automation frameworks for AI/ML applications, ensuring reliability and scalability of LLM-based products while optimizing CI/CD integrations and performance testing.
The summary above was generated by AI
About the Company:
Netomi is the leading agentic AI platform for enterprise customer experience. We work with the largest global brands like Delta Airlines, MetLife, MGM, United, and others to enable agentic automation at scale across the entire customer journey. Our no-code platform delivers the fastest time to market, lowest total cost of ownership, and simple, scalable management of AI agents for any CX use case. Backed by WndrCo, Y Combinator, and Index Ventures, we help enterprises drive efficiency, lower costs, and deliver higher quality customer experiences.

Want to be part of the AI revolution and transform how the world’s largest global brands do business? Join us!

We’re seeking a Senior SDET with expertise in Generative AI testing to lead the development of cutting-edge automation frameworks for AI/ML-powered applications. You’ll ensure the reliability, safety, and scalability of LLM-driven products while advancing traditional test automation for cloud-native systems.

Responsibilities

  • AI-Aware Test Automation - Design and maintain Python/Java-based automation frameworks (Selenium, Playwright, TestNG/JUnit) for web, API, and backend services.
  • Extend frameworks to test LLM integrations (OpenAI, HuggingFace, RAG pipelines) with prompt validation, hallucination checks, and response consistency tests.
  • Implement model benchmarking (latency, accuracy, bias/drift detection) for generative AI features.

  • Quality Infrastructure - Integrate tests into CI/CD pipelines (Jenkins, GitHub Actions) with cloud workflows (AWS/GCP).
  • Optimize performance testing (JMeter/Locust) for AI endpoints handling high-throughput inference.
  • Debug flaky tests in (non-deterministic) AI systems.

  • Leadership & Innovation - Mentor junior engineers on AI testing best practices.
  • Research tools like LangChain, synthetic data generators, or adversarial testing techniques.
  • Advocate for ML-specific quality metrics beyond traditional pass/fail.

Requirements

  • 7–9 years in QA automation with strong Python/Java proficiency.
  • Hands-on experience with Selenium, Playwright, REST Assured, and CI/CD tools (Jenkins, Docker).
  • Solid understanding of SQL/NoSQL databases and cloud platforms (AWS/GCP).
  • Exposure to performance testing (JMeter, K6) and scalable test frameworks.
  • Experience with LLM testing (prompt engineering, output validation, rubric-based grading).
  • Familiarity with OpenAI APIs, HuggingFace, or LangChain.
  • Knowledge of synthetic test data generation for edge-case scenarios.
  • Autonomy – Thrive in fast-paced, AI-driven environments with minimal supervision.
  • Analytical Mindset – Debug complex failures in probabilistic AI systems.
  • Communication - Explain technical trade-offs to non-technical stakeholders.

Netomi is an equal opportunity employer committed to diversity in the workplace. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, disability, veteran status, and other protected characteristics.

Top Skills

AWS
GCP
Github Actions
Huggingface
Java
Jenkins
Jmeter
Junit
K6
Langchain
NoSQL
Openai Apis
Playwright
Python
Rest Assured
Selenium
SQL
Testng

Similar Jobs

10 Hours Ago
Remote or Hybrid
India
Junior
Junior
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Lead the implementation of Absence Management Systems by analyzing and validating leave policies, creating data solutions using Excel and Macros, and supporting operational projects.
Top Skills: Advanced ExcelMacrosMS OfficePower QueryVBA
16 Hours Ago
Remote or Hybrid
17 Locations
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Develop and test third-party data ingestion pipelines for a Next-Gen SIEM, contributing to architecture, automated tests, and system performance.
Top Skills: Python,Golang,Docker,Kubernetes,Rest,Grpc,Rpc
16 Hours Ago
In-Office or Remote
35 Locations
Entry level
Entry level
Machine Learning • Natural Language Processing
Welo Data seeks contributors fluent in Portuguese for various AI tasks including annotation, evaluation, and prompt creation. Remote work is available.
Top Skills: AIDigital Tools

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account