AI Safety Expert - Red Team
mercor
$20–$22 per hour
mercor
$20–$22 per hour
Role Responsibilities • Red team conversational AI models and agents by conducting jailbreaks, prompt injections, and bias exploitation. • Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks. • Apply structure by following taxonomies, benchmarks, and playbooks to maintain consistent testing. • Document reproducibly by producing reports, datasets, and attack cases that customers can act on. • Collaborate on sensitive projects with clear guidelines and wellness resources.
Qualifications Must-Have
• Prior red teaming experience in AI adversarial work, cybersecurity, or socio-technical probing. • Native fluency in English and Tamil. • Strong communication skills to explain risks to technical and non-technical stakeholders.
Preferred • Experience in Adversarial ML, Cybersecurity, or socio-technical risk analysis. • Skills in creative probing such as psychology, acting, or writing.
Resources & Support • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome • For any help or support, reach out to: support@mercor.com
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity. Originally