
Posted 2 months ago
AI / Eval Engineer
AI Summary
Evals-focused engineer who optimizes neural systems by analyzing behavior across models, prompts, retrieval, and ranking to improve clinician-facing responses.
About this role
About Vera Health
Vera is the fastest-growing clinical app for healthcare professionals in the world. Five people built it.
We’re already operating at a scale most teams never reach, with a team most people would think is too small to pull it off. That’s by design. Everyone here runs at a level that compresses what would normally take entire teams. We keep it that way by only hiring people who raise the standard the moment they join.
We’re building the founding team. The people who join now will define how this company operates for the next decade, and shape the future of healthcare.
We move at a pace that feels unreasonable to most, and completely natural to the right people. If that pace feels natural to you, you belong here.
The Role
You can’t improve what you can’t measure.
Every answer on Vera is a clinical decision being made by a clinician. That’s the bar. Not “mostly correct,” not “good on average.” We need to know exactly how the Vera behaves, where it holds up, where it breaks, and why. Think of the system like a chessboard with millions of different combinations. Different models, prompts, retrieval strategies, ranking, parameters, all interacting at once. Your job is to understand that space and push it in the right direction by optimizing every part of it, down to the smallest parameter.
This isn’t about writing eval scripts and calling it done. It’s about going a step further, understanding how the system behaves across specialties, clinical contexts, and edge cases, and improving it so that answers hold up no matter the setting.
You’ll be working on a complex search and reasoning engine, where the goal is simple but hard: make it better for every question, every specialty, every clinical setting.
We’re building something that doesn’t exist yet.
What You’ll Do
Optimize every layer of the system, from model behavior to retrieval to ranking to the smallest parameters
Take a system with thousands of moving parts and figure out which ones actually drive better answers
Push performance forward by testing, breaking, and refining how outputs are generated
Go deep on failures, not just what went wrong, but what pattern it reveals
Tune how the system behaves across different specialties, question types, and edge cases
Work directly with the founders on the highest-leverage improvements
Work with leading healthcare institutions to validate how the system performs in practice
What We’re Looking For
You don’t trust outputs you can’t explain
You care about being right, not just being close
You like working in messy spaces where there’s no obvious path
You move fast, but you don’t compromise when the stakes are high
You have strong technical depth in LLMs, search, or similar systems
You’re excited to work at the edge of what’s currently being done
How we work
Ship, then improve. We get things in front of users fast and let real feedback drive iteration. A feature in production teaches you more than a month of planning.
Own it completely. There's no handoff culture here. You take a problem and see it through, from first principles to production.
Always look out for the user. Every decision comes back to the clinician on the other end. If it doesn't make their day better, it doesn't ship.
Why Vera
Rocketship traction: 100%+ MoM growth, built by a team of 5
Real impact: Every feature you build helps clinicians make better decisions and save lives
Best time to join: We’ve just raised our Series A and are entering the most important phase of the company, where a small team defines everything that comes next
Details
Location: San Francisco (strongly preferred). For exceptional candidates, remote or relocation support (O-1A / H-1B sponsorship) available.
Compensation: Competitive salary + meaningful early-stage equity. Details shared in process.
Employment: Full-time
Skills
Explore related jobs
More jobs at Vera Health
Similar A/b Testing jobs
CRO Manager (m/w/d) – Personalisierung & A/B-TestingJobs bei Lautsprecher Teufel GmbH · Berlin
Google Ads / SEA Marketing ManagerIn für Growth Hacking (m/w/d) | A/B-Testing | Performance MarketingNetzproduzenten® GmbH · Dresden, Germany
Vactor/Vacall Combination Truck DriverB&B Contracting Group · Surrey, Canada
Jobs in San Francisco
ControllerConnor Consulting · San Francisco, California- FRobotics Build EngineerFoundation Robotics Labs Inc. · San Francisco, Canada
Assistant Store Manager- Client Experience & DesignArticle · San Francisco
Assistant Store Manager- Visual MerchandisingArticle · San Francisco
Member of Product Staff, EngineerMetaview · San Francisco- Senior Product Manager, Buying & OptimizationTatari · San Francisco, California