Checkmate (itsacheckmate.com) Logo

Checkmate (itsacheckmate.com)

Prompt Engineer - Data Science & Quality Analysis (India)

Reposted 9 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Senior level
Remote
Hiring Remotely in India
Senior level
The role involves developing and evaluating prompts for Voice AI systems, conducting data analysis, leading a team, and collaborating across functions to optimize AI performance.
The summary above was generated by AI

Checkmate is building advanced Voice AI systems for some of the largest restaurant and retail brands in the US, including several in the top 10. Unlike many companies still in the prototype phase, our AI solutions are live in production with real customers, achieving over 80% accuracy. This is a $1 billion market opportunity, and we’re scaling to 3,000+ stores by the end of this year.

Join us at this pivotal moment to shape AI products used daily by thousands of staff and customers, driving measurable impact at scale.

Essential Job Functions:

  • Prompt Design & Evaluation: Develop, test, and refine prompts for tasks such as text generation, question answering, data classification, and structured data extraction to optimize Voice AI performance.
  • Data-Driven Analysis & Quality Measurement: Design evaluation frameworks and analyze prompt outputs using quantitative metrics, human-in-the-loop evaluation, and user feedback to identify improvement opportunities.
  • Experimentation & Iteration: Conduct experiments to test prompt variations, measure their business and operational impact, and iterate to enhance accuracy, consistency, and safety.
  • Regression Testing & Compliance: Build principled regression test suites using tools like LangFuse and Galileo to ensure prompts remain compliant and high-performing as models and use cases evolve.
  • Collaboration Across Teams: Work closely with data science, product, legal, engineering, and operations teams to align prompt designs with business goals, operational workflows, and compliance requirements.
  • Model Adaptation & Strategy: Develop prompts across multiple LLMs (GPT, LLaMA, Gemini, and Checkmate’s fine-tuned models), understanding model differences to optimize outputs effectively.
  • Team Leadership & Mentorship: Lead a team of analysts focused on prompt evaluation and data quality analysis, guiding prioritization, experimentation, and reporting. Collaborate with ops teams for seamless deployment and feedback loops.
  • Research & Continuous Learning: Stay up to date on emerging prompting techniques, LLM behaviors, evaluation frameworks, and AI safety practices to keep Checkmate’s AI solutions best-in-class.

Requirements
  • Bachelor’s degree in Data Science, Computer Science, Statistics, Engineering, or a related field with 3-6 years of experience in relevant field.
  • Strong analytical and data science skills, with hands-on experience in Python (pandas, NumPy, scikit-learn).
  • Experience designing and conducting experiments and evaluations in applied AI or NLP contexts.
  • Proficiency in SQL and working with relational databases (e.g. MySQL, PostgreSQL, Oracle, MS SQL).
  • Good understanding of data processing, quality measurement, and testing fundamentals.
  • Experience leading analyst or operations teams, with strong prioritization, mentorship, and collaboration skills.
  • Strong problem-solving mindset with a drive to explore, optimize, and automate workflows.
  • Excellent communication skills for presenting insights to technical and non-technical stakeholders.
  • Candidates must be flexible and work during US hours at least until 6 p.m. ET in the USA, which is essential for this role & must also have their own system/work setup for remote work.

Preferred Qualifications

  • Experience with LLM evaluation and prompt engineering workflows
  • Familiarity with tools like LangFuse and Galileo for prompt evaluation and analysis
  • Knowledge of cloud platforms (AWS, GCP, Azure) and data pipeline tools
  • Familiarity with machine learning concepts and NLP workflows
  • Master’s or PhD in Data Science, Statistics, Computer Science, or a related field.

Top Skills

AWS
Azure
Galileo
GCP
Langfuse
Ms Sql
MySQL
Numpy
Oracle
Pandas
Postgres
Python
Scikit-Learn
SQL

Similar Jobs

An Hour Ago
Remote or Hybrid
Bangalore, Bengaluru, Karnataka, IND
Senior level
Senior level
Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
The Senior Product Security Engineer manages vulnerabilities, assesses severity, triages issues, and optimizes security processes while collaborating globally.
Top Skills: Atlassian SoftwareBashBlackduckCisco Ios-XeCoverityJIRAPythonSemgrepVigilesWiresharkYocto Cve Scanner
An Hour Ago
Remote or Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Cloud • Software
Lead full sales lifecycle for ThousandEyes Services and Enterprise Agreements in APJC. Drive adoption, build new markets, and collaborate cross-functionally to ensure alignment with customer outcomes.
Top Skills: Cloud InfrastructureDigital Experience AssuranceSaaS
An Hour Ago
Remote or Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Cloud • Software
As a Lead Site Reliability Engineer, you'll ensure cloud and big data platform reliability, collaborating with teams to design solutions, optimize infrastructure, and mentor others.
Top Skills: AirflowAWSAws BedrockAws SagemakerCloudwatchElk StackEmrGoGobblinGrafanaHdfsHiveHudiIcebergKafkaKubernetesMapreduceOpentelemetryOrcPrometheusPysparkPythonSparkTerraformThanosYarn

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account