AI Automation QA

Taguig, Metro Manila, Philippines

AI Automation QA

  • 202603409
  • Taguig, Metro Manila, Philippines
  • Full time
View favourites

Description

Key Accountabilities

  • Definition and execution of testing and quality assurance strategies for AI‑enabled workflows
  • Continuous evaluation and monitoring of system behavior in production environments
  • Contribution to auditability, risk management, and continuous quality improvement

Principal Responsibilities

  • Define quality criteria and testing strategies for agent workflows, covering accuracy, latency, safety, compliance, and operational risk
  • Build automated evaluation harnesses to assess agent performance, including hallucination rates, tool misuse, policy violations, and task success
  • Implement continuous production monitoring to detect anomalies, quality degradation, and emerging safety concerns
  • Develop and maintain automated test suites using Playwright for UI testing and custom scripts for API and workflow validation
  • Apply LLM evaluation frameworks to assess output quality, regression, and system drift over time
  • Produce and maintain dashboards and reports that communicate quality metrics and trends to engineering and stakeholders
  • Develop and maintain runbooks for common failure modes and contribute to incident response activities
  • Collaborate closely with developers to improve prompts, tool definitions, and workflow designs based on test results
  • Ensure testing, logging, and monitoring practices align with data privacy, audit, and regulatory requirements

Qualifications

Knowledge, Skills & Experience

Essential

  • Minimum 3 years’ experience in QA, test automation, or DevOps roles (or 2 years with direct experience testing AI or ML‑enabled systems)
  • Strong Python skills for test automation, evaluation harnesses, and basic data analysis
  • High attention to detail, with a focus on issues that materially impact reliability and user trust
  • Comfort working with evolving tools, frameworks, and testing practices
  • Collaborative mindset, using evidence‑based insights to influence product and engineering decisions

 

Technical Skills (Required)

  • Programming: Python (test automation, evaluation harnesses, data analysis)
  • UI Automation: Playwright (end‑to‑end workflow testing)
  • AI Evaluation: Deepeval, RAGAS, Evidently.AI (LLM quality, drift, and regression analysis)
  • Workflow Testing: API and agent workflow validation using custom scripts
  • Monitoring: Production quality monitoring and anomaly detection

 

Desirable

  • Pytest or equivalent testing frameworks
  • SQL for querying logs, metrics, or evaluation datasets
  • Prometheus, Grafana, or similar monitoring tools
  • Familiarity with hallucination detection and AI safety patterns
  • CI/CD pipelines and Git‑based workflows

 

WTW is an Equal Opportunity Employer

Unsolicited Contact

Any unsolicited resumes/candidate profiles submitted through our web site or to personal e-mail accounts of employees of Willis Towers Watson are considered property of Willis Towers Watson and are not subject to payment of agency fees. In order to be an authorized Recruitment Agency/Search Firm for Willis Towers Watson, any such agency must have an existing formal written agreement signed by an authorized Willis Towers Watson recruiter and an active working relationship with the organization. Resumes must be submitted according to our candidate submission process, which includes being actively engaged on the particular search. Likewise, for our authorized Recruitment Agencies/Search Firms, if the candidate submission process is not followed, no agency fees will be paid by Willis Towers Watson. Willis Towers Watson is an equal opportunity employer. If you would like to have your contact information saved for future consideration, please email: Agency.inquiries@willistowerswatson.com.

Our Offices

Our colleagues serve more than 140 countries and markets around the world. This gives a global dimension to everything we do and creates lots of exciting opportunities for you to collaborate and grow. Explore the map below to see where you career could take you.