Principal AV Behavior and AI Safety Engineer: Technical Lead
General MotorsFull Time
Senior (5 to 8 years)
Key technologies and capabilities for this role
Common questions about this position
The salary range is $255K - $405K.
This information is not specified in the job description.
Candidates should have a proven track record of leading White Hat initiatives for frontier models, deep familiarity with LLM behavior, post-training evaluation techniques, and AI safety research, and be highly attuned to AI failure modes like sycophancy, jailbreaking, and harmful outputs.
The Integrity White Hat team is a small, high-impact group that proactively uncovers failure modes in OpenAI’s advanced models, products, and systems, partnering closely with research, safety teams, and security.
Strong candidates thrive at the intersection of research and safety, operate with high integrity and urgency, are motivated by ensuring AGI benefits humanity, prefer working near research teams, and have experience building high-trust, high-performance technical teams.
Develops safe and beneficial AI technologies
OpenAI develops and deploys artificial intelligence technologies aimed at benefiting humanity. The company creates advanced AI models capable of performing various tasks, such as automating processes and enhancing creativity. OpenAI's products, like Sora, allow users to generate videos from text descriptions, showcasing the versatility of its AI applications. Unlike many competitors, OpenAI operates under a capped profit model, which limits the profits it can make and ensures that excess earnings are redistributed to maximize the social benefits of AI. This commitment to safety and ethical considerations is central to its mission of ensuring that artificial general intelligence (AGI) serves all of humanity.