Posted 2 months ago
AI Safety Specialist (AI Engineering)
BostonOn-siteFull-time
AI Summary
AI Safety Specialist who tests, guards, and aligns AI behavior for secure deployments, including adversarial testing and RLHF alignment support.
About this role
We are searching for an AI Safety Specialist who will play a crucial role in enhancing the security and robustness of language models. You will ensure the safe deployment of AI systems by conducting adversarial testing, implementing protective measures, and aligning AI behavior with ethical principles.
Responsibilities:
- Conduct adversarial testing on LLMs and multimodal agents.
- Implement guardrails and real-time filtering for autonomous tool use.
- Develop constitutional AI principles and assist with RLHF alignment pipelines.
Qualifications:
- Background in cybersecurity, prompt engineering, or adversarial ML.
- Experience with jailbreak taxonomies and automated red-teaming frameworks.
- Strong analytical mindset for identifying edge cases.
Skills
Adversarial MLAutomated Red-teaming FrameworksConstitutional AICybersecurityGuardrailsJailbreak TaxonomyLLMsMultimodal AgentsPrompt EngineeringReal-time FilteringRed TeamingRLHF
Explore related jobs
More jobs at Hyphen Connect Limited
- General Counsel (Crypto & Commercial Litigation) - Hong Kong OnsiteAPAC
- Content Operations Specialist (US stock contract) - Crypto ExchangeGlobal
- VIP BD Manager/ Director - Global RemoteGlobal
- Web3 Frontend Developer (UX)APAC
- Sales Operations ManagerRemote - Hangzhou, China
- Compliance Officer/ Money Laundering Reporting Officer (CO/MLRO)Hong Kong
Similar Adversarial ML jobs
Jobs in Boston
- TControllerTA Digital Group LLC · Boston, Morocco
- DevOps Engineer (Maynard, MA)Transperfect · Boston, Massachusetts
- Property AccountantSynergy · Boston, Massachusetts
- HApplied AI Engineer (Application Deadline: July 16th)HqO, Inc. · Boston, Morocco
Manufacturing Consultant - Boston (Entry Level)Chartwell · Boston, Massachusetts
Research Assistant II LabHarvard University · Boston, MA