Orion Innovation Logo

Orion Innovation

AI QA Test Engineer

Posted Yesterday
Be an Early Applicant
In-Office
6 Locations
Mid level
In-Office
6 Locations
Mid level
The AI QA Test Engineer will validate AI-driven systems, ensure quality through testing of ML models, and collaborate with cross-functional teams. Responsibilities include automated test development, data validation, and adherence to ethical AI standards.
The summary above was generated by AI

Orion Innovation is a premier, award-winning, global business and technology services firm.  Orion delivers game-changing business transformation and product development rooted in digital strategy, experience design, and engineering, with a unique combination of agility, scale, and maturity.  We work with a wide range of clients across many industries including financial services, professional services, telecommunications and media, consumer products, automotive, industrial automation, professional sports and entertainment, life sciences, ecommerce, and education.

About the Role

Experienced AI QA Test Engineer to ensure the quality, reliability, and ethical performance of AI-driven systems and applications. This role requires a deep understanding of AI/ML workflows, data validation, model testing techniques, and automation frameworks. The ideal candidate is highly analytical, detail-oriented, and passionate about building trusted AI products.

Participates in QA activities on project assignments and ensures timely delivery on QA commitments in a fast-paced product development environment.

Review of functional requirements and technical design documentation and creation of test plans, test scenarios and test cases. Execution of test plans and certification of application releases

Key Responsibilities:

AI/ML Testing

  • Validate ML models, data pipelines, and inference APIs for accuracy, fairness, and robustness
  • Evaluate model behavior across diverse data sets, edge cases, and unexpected inputs
  • Monitor model drift, performance degradation, and bias over time
  • Conduct prompt testing, hallucination detection, and safety validation for LLM-based systems
  • Experience of testing implementation of AI/ML into application
  • Manual Testing with automation exposure
  • API testing with SOAP UI/ Postman
  • Agile methodology experience
  • Test & Defect management with JIRA/ Azure DevOps (ADO)
  • DB Testing Experience  - Good to Have                                                                              
  • Tools - Deepeval/ Langfuse/ Pytest

Quality Engineering & Automation

  • Build and maintain automated test suites for AI features
  • Develop test scripts for ML workflows using Python, Playwright, and CI/CD tools
  • Design and execute manual and automated tests for functional, performance, and security scenarios
  • Identify defects, establish test metrics, reporting dashboards, and test traceability

Data Quality & Responsible AI

  • Validate training and inference data quality, labeling consistency, and data governance
  • Work with data science and product teams to implement responsible AI testing guidelines
  • Document and escalate risks related to bias, privacy, explainability, and compliance

Cross-Functional Collaboration

  • Partner with Data Scientists, ML Engineers, and Product Managers to define acceptance criteria
  • Participate in design reviews, model evaluations, and release readiness checkpoints
  • Contribute to QA strategy and Test Plans

Required Qualifications

  • Bachelor’s degree in computer science, Engineering, or related field (or equivalent experience)
  • 4+ years of experience in software QA engineering
  • 2+ years working with AI/ML or data-centric systems
  • Strong Python programming skills
  • Proficiency in automation frameworks (e.g., Playwright, WebdriverIO, Selenium)
  • Experience with ML platforms (TensorFlow, PyTorch, Hugging Face, MLflow, etc.)
  • Hands-on experience testing APIs, data pipelines, and distributed systems
  • Exposure to RAG pipelines and agent frameworks
  • Familiarity with prompt testing and LLM evaluation techniques
  • Experience with model evaluation metrics (precision, recall, F1, ROC-AUC, BLEU, perplexity, etc.)
  • Knowledge of MLOps, CI/CD, and cloud platforms (AWS, Azure, or GCP)
  • Familiarity with Responsible AI frameworks (NIST AI RMF, ISO/IEC standards, fairness toolkits)
  • Experience testing vector databases and AI search (Pinecone, Elasticsearch, Redis Vector)

Key Skills

  • AI/ML testing & analysis
  • Automation & scripting
  • Data validation & pipelines
  • Risk, ethics, & compliance mindset
  • Strong problem-solving & debugging skills
  • Excellent documentation & communication skills
  • Working with distributed teams

Orion is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, citizenship status, disability status, genetic information, protected veteran status, or any other characteristic protected by law.

Candidate Privacy Policy

Orion Systems Integrators, LLC and its subsidiaries and its affiliates (collectively, “Orion,” “we” or “us”) are committed to protecting your privacy. This Candidate Privacy Policy (orioninc.com) (“Notice”) explains:

  • What information we collect during our application and recruitment process and why we collect it;
  • How we handle that information; and
  • How to access and update that information.

Your use of Orion services is governed by any applicable terms in this notice and our general Privacy Policy.


Top Skills

AWS
Azure
Azure Devops
Deepeval
GCP
Hugging Face
JIRA
Langfuse
Mlflow
Playwright
Postman
Pytest
Python
PyTorch
Selenium
Soap Ui
TensorFlow
Webdriverio

Similar Jobs

Yesterday
Hybrid
Pune, Mahārāshtra, IND
Senior level
Senior level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Senior Systems Platform Architect designs and documents scalable infrastructure solutions, ensuring they meet business, security, and compliance requirements while collaborating with application teams and stakeholders.
Top Skills: Ci/Cd ToolsCloud TechnologiesContainer OrchestrationEncryptionGitopsIamLoad BalancersMicroservices ArchitectureMicrosoft VisioWeb Application FirewallsYamlZero Trust Architecture
Yesterday
Hybrid
Pune, Mahārāshtra, IND
Mid level
Mid level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Lead and manage complex fintech programs using Agile and hybrid project management methodologies, while ensuring project delivery, stakeholder engagement, and effective risk management.
Top Skills: Agile MethodologiesJIRAKanbanScaled Agile Framework (Safe)Scrum
Yesterday
Hybrid
Pune, Mahārāshtra, IND
Senior level
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The consultant will develop analytic solutions, manage data science environments, mentor junior colleagues, and collaborate with teams to enhance performance and drive innovation in TransUnion's analytic products.
Top Skills: Apache ArrowC++H2OHadoopHiveJavaLightgbmPythonRScalaSgeSlurmSparkSQLTorqueUnivaXgboost

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account