Posted 2 months ago

AI Safety Specialist (AI Engineering)

BostonOn-siteFull-time

AI Summary

AI Safety Specialist who tests, guards, and aligns AI behavior for secure deployments, including adversarial testing and RLHF alignment support.

About this role

We are searching for an AI Safety Specialist who will play a crucial role in enhancing the security and robustness of language models. You will ensure the safe deployment of AI systems by conducting adversarial testing, implementing protective measures, and aligning AI behavior with ethical principles.

Responsibilities:

Conduct adversarial testing on LLMs and multimodal agents.
Implement guardrails and real-time filtering for autonomous tool use.
Develop constitutional AI principles and assist with RLHF alignment pipelines.

Qualifications:

Background in cybersecurity, prompt engineering, or adversarial ML.
Experience with jailbreak taxonomies and automated red-teaming frameworks.
Strong analytical mindset for identifying edge cases.

Skills

Adversarial MLAutomated Red-teaming FrameworksConstitutional AICybersecurityGuardrailsJailbreak TaxonomyLLMsMultimodal AgentsPrompt EngineeringReal-time FilteringRed TeamingRLHF

AI Safety Specialist (AI Engineering)

About this role

Skills

Explore related jobs

More jobs at Hyphen Connect Limited

Similar Adversarial ML jobs

Jobs in Boston

Browse these categories