Jobless Developer
kaigentic logo
kaigentic

Posted 16 days ago

Open

Software Development Engineer III, DevOps

BengaluruOn-siteFull-time

AI Summary

Senior Software Engineer on the DevOps team responsible for architecting cloud/Kubernetes infrastructure, establishing observability, and leading incident response for a regulated, enterprise AI platform.

About this role

kAIgentic is building the intelligence layer for the world's most ambitious enterprises. Headquartered in Singapore with teams in India and Japan, our software platform helps large organizations evolve as fast as technology itself by turning the tacit know-how locked inside their people into safe, governed, AI-powered operations.

The hardest part of enterprise transformation is not strategy. It is execution. Institutional knowledge lives in people's heads, systems are fragmented, and risk tolerance is low. kAIgentic captures how work actually happens, designs better workflows, and runs them inside an intelligence layer that is observable, auditable, and engineered for the most regulated environments on earth. The outcome is an enterprise that continuously improves.

We are backed by SMBC Group as our founding partner and customer zero, and our platform is already being proven inside one of the most complex, regulated operating environments in the world. That means real problems, real data, and real production impact from Day 1.

The Role

As a Senior Software Engineer on the DevOps team, you will drive the infrastructure strategy for kAIgentic’s enterprise AI platform. You will own reliability, scalability, and operational excellence, leading a team that keeps mission-critical systems running safely in regulated financial environments.

What You’ll Do

  • Architect cloud and Kubernetes infrastructure for multi-region, multi-tenant enterprise deployments, covering GPU scheduling and resource optimization

  • Define the observability strategy integrating application metrics, agent tracing (Langfuse, Arize Phoenix), business KPIs, and SLO framework across all platform services

  • Design CI/CD platform architecture and architect Temporal cluster operations for highly available, durable workflow execution

  • Design security operations including secrets rotation, vulnerability scanning, and compliance automation

  • Lead incident response, establish post-mortem culture, and make build-vs-buy decisions for infrastructure tooling

  • Lead and mentor the DevOps team while driving AI-native practices to accelerate infrastructure automation

What You’ll Bring

  • AI-native velocity as a default mode of working (mandatory)

  • Strong expertise in Python and Go

  • Deep expertise in 3+ of the following:

  • Cloud architecture at scale (AWS, GCP, Azure)

  • Kubernetes platform engineering (multi-cluster, GPU, service mesh)

  • CI/CD platform architecture

  • Observability platform architecture

  • Reliability engineering and SLO-driven operations

  • Infrastructure security and compliance automation

  • Track record of leading infrastructure teams in production-critical environments

  • Strong architectural thinking and communication skills

Why join kAIgentic?

We are a global team of builders who thrive in ambiguity, care deeply about the customers we serve, and believe the intelligence layer is how enterprise work will be reshaped over the next decade. We are building the connective tissue that lets large companies operate with the speed of a startup and the trust of an institution.

We look for people who:

  • Combine technical excellence with genuine customer empathy.

  • Are entrepreneurial and energized by zero-to-one problems with no playbook.

  • Lead with ownership, integrity, and collaboration, not titles.

  • Want to help define a new category of enterprise AI, not just ship inside an existing one.

  • Working here means being surrounded by peers who challenge assumptions, celebrate progress, and build with both courage and care.

Life at kAIgentic

  • Intelligence layer at the core. You will be shaping the substrate that turns institutional knowledge into governed, production-grade operations. This is enterprise infrastructure with real consequences.

  • Innovation at enterprise scale. Startup velocity meets the depth, scale, and stakes of mission-critical, regulated environments. Both are non-negotiable.

  • Ownership from Day One. Your work directly shapes the product, the culture, and the outcomes our customers see.

  • Learning and growth. You will work alongside seasoned leaders from leading enterprises who have built and scaled global businesses.

  • A culture of trust. Psychological safety, transparent disagreement, and disciplined experimentation are how we operate, not slogans on a wall.

  • Global collaboration. Teams across Singapore, India, Japan, Europe, and the US, working as one.

  • A mission worth the effort. Building something the world has not seen before: anintelligence layer that helps enterprises continuously improve how they run.

Skills

Arize PhoenixAWSAzureCI/CDCompliance AutomationGCPGOGPU SchedulingIncident ResponseKubernetesLangFuseObservabilityPythonSecrets ManagementSecurity AutomationService MeshSLOsTemporalVulnerability Scanning

Explore related jobs

Browse these categories