Posted 26 days ago

Senior Machine Learning Engineer / Technical Lead

MuscatRemoteFull-time

AI Summary

Senior ML Engineer to lead and contribute hands-on to production AI systems, with responsibility for model performance, MLOps, and leading a team of junior engineers.

About this role

The Role

We are looking for a hands-on Senior ML Engineer to lead the development and operation of production-grade AI systems across LLMs, OCR, and voice. This role combines deep technical ownership with leadership of a high-performing team of junior engineers, as well as direct engagement with stakeholders to shape AI solutions.

Responsibilities:

Lead and mentor a team of highly talented junior ML engineers through:
Code reviews, design reviews, and technical direction
Enforcement of strong software engineering and ML best practices
Design, deploy, and operate scalable AI systems with a focus on reliability and performance
Lead production deployment of LLMs and multimodal systems (RAG, OCR, voice)
Own **model performance end-to-end **, combining evaluation, observability, and hardware optimization:
Build evaluation pipelines (benchmarks, regression testing, LLM-as-judge)
Implement deep observability (tracing, latency, error tracking)
Optimize GPU utilization (multi-GPU serving, batching, quantization, memory tuning)
Continuously improve throughput, latency, and cost efficiency
Architect and manage GPU infrastructure:
Model serving, load balancing, and scaling strategies
Hardware-aware deployment and performance tuning
Build and maintain robust MLOps pipelines:
Model/version management, CI/CD, automated testing, and rollback strategies
Monitoring and feedback loops for continuous improvement
Engage directly with clients and stakeholders to:
Gather and clarify business requirements
Translate non-technical needs into well-defined technical problems
Communicate solutions, trade-offs, and progress through clear documentation, reports, and proposals
Contribute hands-on to system design, implementation, debugging, and production incident resolution

Requirements

Proven experience deploying LLMs in production
Strong experience with GPU-based inference and optimization
Solid backend engineering skills (Python, APIs, distributed systems)
Experience with MLOps and production ML systems
Experience with OCR/document AI and/or voice systems (STT/TTS)
Experience with Docker and Kubernetes
Strong understanding of modern AI architectures (RAG, vector DBs, agent workflows)
Experience mentoring or leading engineers
Strong communication skills with the ability to bridge business and technical domains

Nice to Have:

Experience with open-weight models (Qwen, Llama, DeepSeek, Gemma)
Experience with on-prem / sovereign AI deployments
Experience with LoRA / fine-tuning
Multilingual or Arabic NLP experience

Skills

APIsAutomatic TestingBenchmarksCI/CDDistributed SystemsDockerDocument AIError-trackingGPU OptimizationHardware-aware DeploymentKubernetesLatencyLLM-as-judgeLLMsMemory TuningMLOpsMLOps PipelinesModel/version ManagementMonitoringMulti-GPU ServingObservabilityOCRProduction DeploymentPythonQuantizationRAGRollbacksRouting And Load BalancingScaling StrategiesSTT/TTSSystem DesignTracingVector DatabasesVoice Systems

Senior Machine Learning Engineer / Technical Lead

About this role

Requirements

Skills

Explore related jobs

Similar APIs jobs

Jobs in Muscat

Browse these categories