Job Description
Overview
Get AI-powered advice on this job and more exclusive features. We’re looking for AI QA trainers who specialize in model evaluation, LLM safety, prompt robustness, data quality assurance, multilingual and domain-specific testing, grounding verification, and compliance readiness checks. You’ll evaluate advanced language models on tasks such as hallucination detection, factual consistency, prompt-injection and jailbreak resistance, bias/fairness audits, chain-of-reasoning reliability, tool-use correctness, retrieval-augmentation fidelity, and end-to-end workflow validation. You will document every failure mode to raise the bar for quality.
On a typical day, you will converse with the model on real-world scenarios and evaluation prompts, verify factual accuracy and logical soundness, design and run test plans and regression suites, build clear rubrics and pass/fail criteria, capture reproducible error traces with root-cause hypotheses, and suggest improvement...
Ready to Apply?
Submit your application for AI QA Trainer - LLM Evaluation - Freelance Project at Invisible Expert Marketplace
Apply Now