AI Automation QA

Taguig, Metro Manila, Philippines

AI Automation QA

  • 202603409
  • Taguig, Metro Manila, Philippines
Ver favoritos

Description

Key Accountabilities

  • Definition and execution of testing and quality assurance strategies for AI‑enabled workflows
  • Continuous evaluation and monitoring of system behavior in production environments
  • Contribution to auditability, risk management, and continuous quality improvement

Principal Responsibilities

  • Define quality criteria and testing strategies for agent workflows, covering accuracy, latency, safety, compliance, and operational risk
  • Build automated evaluation harnesses to assess agent performance, including hallucination rates, tool misuse, policy violations, and task success
  • Implement continuous production monitoring to detect anomalies, quality degradation, and emerging safety concerns
  • Develop and maintain automated test suites using Playwright for UI testing and custom scripts for API and workflow validation
  • Apply LLM evaluation frameworks to assess output quality, regression, and system drift over time
  • Produce and maintain dashboards and reports that communicate quality metrics and trends to engineering and stakeholders
  • Develop and maintain runbooks for common failure modes and contribute to incident response activities
  • Collaborate closely with developers to improve prompts, tool definitions, and workflow designs based on test results
  • Ensure testing, logging, and monitoring practices align with data privacy, audit, and regulatory requirements

Qualifications

Knowledge, Skills & Experience

Essential

  • Minimum 3 years’ experience in QA, test automation, or DevOps roles (or 2 years with direct experience testing AI or ML‑enabled systems)
  • Strong Python skills for test automation, evaluation harnesses, and basic data analysis
  • High attention to detail, with a focus on issues that materially impact reliability and user trust
  • Comfort working with evolving tools, frameworks, and testing practices
  • Collaborative mindset, using evidence‑based insights to influence product and engineering decisions

 

Technical Skills (Required)

  • Programming: Python (test automation, evaluation harnesses, data analysis)
  • UI Automation: Playwright (end‑to‑end workflow testing)
  • AI Evaluation: Deepeval, RAGAS, Evidently.AI (LLM quality, drift, and regression analysis)
  • Workflow Testing: API and agent workflow validation using custom scripts
  • Monitoring: Production quality monitoring and anomaly detection

 

Desirable

  • Pytest or equivalent testing frameworks
  • SQL for querying logs, metrics, or evaluation datasets
  • Prometheus, Grafana, or similar monitoring tools
  • Familiarity with hallucination detection and AI safety patterns
  • CI/CD pipelines and Git‑based workflows

 

WTW is an Equal Opportunity Employer

Contacto no solicitado

Cualquier currículum o perfil de candidato no solicitado enviado a través de nuestro sitio web o a las cuentas de correo electrónico personales de los empleados de Willis Towers Watson se considera propiedad de Willis Towers Watson y no está sujeto al pago de honorarios de agencia. Para ser una agencia de reclutamiento/empresa de búsqueda autorizada por Willis Towers Watson, dicha agencia debe tener un acuerdo escrito formal existente firmado por un reclutador autorizado de Willis Towers Watson y una relación de trabajo activa con la organización. Los currículums deben enviarse de acuerdo con nuestro proceso de presentación de candidatos, que incluye participar activamente en la búsqueda particular. Asimismo, para nuestras agencias de reclutamiento/empresas de búsqueda autorizadas, si no se sigue el proceso de presentación de candidatos, Willis Towers Watson no pagará honorarios de agencia. Willis Towers Watson es un empleador que ofrece igualdad de oportunidades. Si desea que guardemos su información de contacto para considerarla en el futuro, envíe un correo electrónico a: Agency.inquiries@willistowerswatson.com .

Nuestras oficinas

Nuestros colegas prestan servicios en más de 140 países y mercados en todo el mundo. Esto le da una dimensión global a todo lo que hacemos y crea muchas oportunidades interesantes para colaborar y crecer. Explore el mapa a continuación para ver a dónde podría llevarlo su carrera.