AI Safety Specialist - Fully Remote | Upto $22/hr
mercor
$20–$22 per hour
mercor
$20–$22 per hour
Role Responsibilities • Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases. • Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks. • Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing. • Document reproducibly by producing reports, datasets, and attack cases for customer action. • Work independently and asynchronously to meet deadlines while improving AI model performance.
Qualifications Must-Have
• Fluent in English and Gujarati. • Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing). • Ability to explain risks clearly to technical and non-technical stakeholders.
Preferred • Experience with Adversarial ML: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction. • Background in Cybersecurity: penetration testing, exploit development, reverse engineering. • Knowledge of Socio-technical risk: harassment/disinfo probing, abuse analysis, conversational AI testing.
Application Process (Takes 20–30 mins to complete) • Upload resume • AI interview based on your resume • Submit form
Resources & Support • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome • For any help or support, reach out to: support@mercor.com
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity. Originally