AI QA Trainer - LLM Evaluation - Freelance Project

Invisible Expert Marketplace · Remote, Remote, South-Africa

Location
Remote
Job Type
Full-time
Posted
June 29, 2026

Job Description

Overview

Get AI-powered advice on this job and more exclusive features. We’re looking for AI QA trainers who specialize in model evaluation, LLM safety, prompt robustness, data quality assurance, multilingual and domain-specific testing, grounding verification, and compliance readiness checks. You’ll evaluate advanced language models on tasks such as hallucination detection, factual consistency, prompt-injection and jailbreak resistance, bias/fairness audits, chain-of-reasoning reliability, tool-use correctness, retrieval-augmentation fidelity, and end-to-end workflow validation. You will document every failure mode to raise the bar for quality.

On a typical day, you will converse with the model on real-world scenarios and evaluation prompts, verify factual accuracy and logical soundness, design and run test plans and regression suites, build clear rubrics and pass/fail criteria, capture reproducible error traces with root-cause hypotheses, and suggest improvement...

Ready to Apply?

Submit your application for AI QA Trainer - LLM Evaluation - Freelance Project at Invisible Expert Marketplace

Apply Now