AI Content Red Team Analyst - Trust and Safety

Singapore·Operations·other
Apply on ByteDance (TikTok) →

About the team: The Trust & Safety (T&S) GenAI & Emerging Product team's mission is to empower the development of GenAI models and applications. We do this by building a world-class safety, testing, and risk management system that ensures GenAI innovations are launched responsibly. The AI Content Red Team sits within the T&S GenAI and Emerging Products pillar. The team is responsible for conducting unstructured adversarial testing of ByteDance's generative AI products and models to uncover emerging risks, alongside our structured evaluations. This team combines attacker-minded testing, risk discovery, and clear operational feedback loops to inform product decisions, policy development, mitigations, and longer-term evaluation strategy. We probe models and product experiences across modalities, use cases, and abuse patterns to identify failure modes, stress-test safeguards, and help teams improve safety before and after launch. We work closely with Trust & Safety teams (policy, product, engineering, data science, operations), and business teams across global markets. Success in this team requires strong judgment, creativity, analytical rigor, and the ability to translate ambiguous findings into actionable recommendations. Key Responsibilities - Conduct structured adversarial testing on AI models, features, and policies to identify vulnerabilities and emerging risks. - Explore product behavior across contexts and user journeys, to identify model failure modes that may not be captured in standard evaluations. - Investigate jailbreaks, evasions, prompt-based attacks, and other adversarial techniques relevant to content safety. - Document findings clearly and consistently, including risk descriptions, reproduction steps, severity assessments, and mitigation recommendations. - Partner with cross functional stakeholders (policy, product, business teams) to ensure mitigation validation and root cause closure - Support development of testing playbooks, taxonomies, and internal knowledge bases - Stay updated on emerging adversarial trends (e.g., deepfakes, multimodal manipulation, coordinated abuse), and shifts in the external risk landscape

More open roles at ByteDance (TikTok)