Posted 2 months ago
Compute Operations Lead
AI Summary
Lead daily operations across compute infrastructure, manage hardware and Linux platforms, coordinate vendor support, and mentor engineers in a production HPC environment.
About this role
Qube Research & Technologies (QRT) is a global quantitative and systematic investment manager, operating in all liquid asset classes across the world. We are a technology- and data-driven group implementing a scientific approach to investing. Combining data, research, technology, and trading expertise has shaped our collaborative mindset, which enables us to solve the most complex challenges. QRT’s culture of innovation continuously drives our ambition to deliver high-quality returns for our investors.
You will join the Compute Operations team, responsible for managing on-premise compute infrastructure and ensuring reliable, high-performance services for research and trading activities. The team works closely with Quantitative Researchers, Traders, and platform engineering teams supporting scheduling, orchestration, and control plane systems.
Your Future Role within QRT
You will:
- Lead daily operations across compute infrastructure, covering hardware support and Linux-based platforms
- Oversee lifecycle management, health monitoring, and repair of HPC server hardware, including liquid-cooled systems
- Coordinate with hardware vendors for support, maintenance, and replacement activities
- Monitor infrastructure using platforms such as HP OneView, Dell OpenManage Enterprise (or equivalent)
- Triage and prioritise issues raised by users of compute platforms, including scheduling and containerised environments
- Collaborate with Linux and platform engineering teams on incident resolution and capacity planning
- Plan and oversee maintenance, inspections, and change management activities
- Maintain and support disk encryption services and related configurations
- Mentor and support engineers across hardware and Linux domains, ensuring effective workload distribution
- Maintain operational documentation, reporting, and continuous service improvement initiatives
Your Present Skillset
- 5+ years of experience in compute infrastructure, covering hardware and/or Linux systems
- Experience leading a team or operating as a senior engineer in a production environment
- Strong understanding of HPC server hardware and vendor support processes
- Experience with infrastructure monitoring and management tools (e.g. HP OneView, Dell OpenManage Enterprise, or similar)
- Familiarity with compute platforms and orchestration tools (e.g. Slurm, Kubernetes)
- Experience with operational tooling such as NetBox, Infoblox, Temporal, HashiCorp Vault, or similar systems
- Strong incident management, prioritisation, and communication skills
QRT is an equal opportunity employer. We welcome diversity as essential to our success. QRT empowers employees to work openly and respectfully to achieve collective success. In addition to professional achievement, we are offering initiatives and programs to enable employees achieve a healthy work-life balance.
Skills
Explore related jobs
More jobs at Qube Research & Technologies
Similar Capacity Planning jobs
Jobs in London
- Senior Service DesignerUtility Warehouse · London, England
- Sales Advisor 35hH&M Group · London, United Kingdom
- Senior Legal CounselNBCUniversal · London, United Kingdom
- Wellbeing PractitionerCatch22 · London, England
Information Security AdministratorQuadient · London, United Kingdom- Service Protection AnalystEvelyn Partners · London, United Kingdom