
Posted 4 months ago
Member of Technical Staff - Efficient ML
AI Summary
Member of Technical Staff focusing on efficient ML techniques for training and inference, including distributed training, kernels, and optimization for low-latency serving.
About this role
Introducing Moonlake, AI for creating world simulations.
Scope of Work
Training efficiency
Dataloaders, fusion, activation remat, gradient checkpointing.
FSDP/ZeRO/tensor+pipeline parallel; NCCL tuning.
GPU + kernel performance
Nsight profiling, Triton/CUDA kernels, fused ops.
Flash-attention–style speedups, sequence packing, KV-cache tricks.
Inference optimization
Low-latency serving, continuous batching, speculative decoding.
Quantization (GPTQ/AWQ), distillation, pruning.
Infra + reliability
SLURM/K8s multi-node jobs, checkpoint hygiene.
Determinism, env pinning, GPU failure handling.
We are committed to being an on-site, in-person team currently based in San Mateo
Skills
Explore related jobs
More jobs at Embedding VC
Similar Checkpoint Hygiene jobs
Jobs in San Francisco
- 3Program Monitor (TLP)3rd Street Youth Center & Clinic · San Francisco, Canada
ABM and Content MarketerHealthleap · San Francisco Office
Account ExecutiveFrec Markets, Inc. · San Francisco- VBilingual Field Interviewer (English/Cantonese/Mandarin)Vital Research · San Francisco, United States
Director of Operations - Millennium TowerAction Property Management · San Francisco, Canada
Office ManagerClearView Healthcare Partners · San Francisco, California