Posted 8 months ago

Research Engineer

San FranciscoOn-site

AI Summary

Research Engineer focused on building autonomous post-training systems, scaling experiments, and integrating reinforcement learning and human feedback within the Mantis platform.

About this role

About the Role

As a Research Engineer at Metis, you’ll work on building the next generation of autonomous post-training systems that leverage our Mantis platform. You’ll operate at the intersection of cutting-edge ML research and scalable engineering, designing, implementing, and deploying algorithms that improve how AI agents learn from feedback, synthetic data, and real-world interactions.

You’ll move seamlessly between papers and production, leading large-scale experiments, creating optimized training pipelines, and helping shape the future of post-training autonomy. You’ll have significant ownership, high compute budgets, and the mandate to push the state of the art in applied reinforcement and preference optimization.

What You'll Do

Research and help build an autonomous post-training agent leveraging the Mantis platform
Design and execute large-scale experiments on synthetic data generation and algorithmic architecture
Develop and refine methods for reinforcement learning, reward modeling, and human feedback integration
Collaborate cross-functionally with Core and Platform Engineering to deploy and evaluate models in production settings
Publish or contribute to leading-edge research in the post-training domain
Use tooling and compute efficiently to iterate on experimental pipelines and accelerate research velocity

Requirements

Deep experience in machine learning, preferably reinforcement learning, post-training, or alignment research
Demonstrated research contributions; ideally published papers (ICML, NeurIPS) or public implementations
Strong proficiency in Python and ML frameworks (PyTorch, JAX, or TensorFlow)
Comfort with distributed training, high-throughput data pipelines, and large-scale experiment management
Ability to reason independently, formulate hypotheses, and run experiments from idea → insight → product impact

Compensation & Benefits

Base: $200,000–$1,000,000
Significant Equity
Full medical, dental, and vision
Wellness & L&D stipend
Equinox membership
Breakfast, lunch, and dinner provided (Unlimited Doordash)
$25,000 housing stipend

About Metis

Metis helps enterprises and labs build the most reliable AI agents by leveraging post-training. Our platform enables the creation, improvement, and deployment of the most capable frontier agents designed for rigorous, real-world workflows.

Momentum

0 → six-figure monthly revenue in the last six weeks
Working with several Fortune 500 enterprises & frontier AI labs
Growing 150%+ MoM

Backed by

Y Combinator, CRV, and executives from OpenAI, Google, Mercor, NVIDIA, and others.

Skills

Alignment ResearchDistributed TrainingExperimental DesignHigh-throughput Data PipelinesHuman Feedback IntegrationJAXLarge-scale Experiment ManagementMantis PlatformPost-trainingProduction DeploymentPythonPyTorchReinforcement LearningResearch Publication (ICML, NeurIPS)Reward ModelingTensorFlow

Explore related jobs

More jobs at Metis, Inc.

Similar Alignment Research jobs

Jobs in San Francisco

Browse these categories

Experimental Design Jobs Python Jobs PyTorch Jobs TensorFlow Jobs Jobs in San Francisco