Jobless Developer
Lendbuzz logo
Lendbuzz

Posted 18 days ago

Open

Senior DevOps / Site Reliability Engineer (SRE)

Tel AvivOn-siteFull-time

AI Summary

Senior SRE leading reliability, capacity planning, and automation for a production environment in Tel Aviv. Partners with development teams to improve observability, scalability, and cost/performance optimization.

About this role

At Lendbuzz, we believe financial opportunity should be more personalized and fair. We develop innovative technologies that provide underserved and overlooked borrowers with better access to credit. From our employees to our dealers, partners, and borrowers, we’ve built a company and a culture around a resolute belief in the promise and power of diversity. We value independent and critical thinking.

We’re looking for a Senior SRE to join our Infrastructure team in Tel Aviv. If you love digging deep into distributed systems, optimizing production environments, and building the automation that keeps everything running smoothly at scale, we’d love to talk.

Responsibilities:

  • Own the reliability, availability, and performance of our production environment.

  • Lead high quality effort and long term solutions that will improve stability and resiliency of our production environments.

  • Work closely with dev teams to review architecture, influence system design, and ensure services are built to be observable and scalable.

  • Handle capacity planning, system tuning, and infrastructure cost/performance optimization.

  • Maintain and improve our GitOps-driven deployment pipelines.

Requirements:

  • 7-8+ years of hands-on experience in an SRE or DevOps role, managing production environments at scale.

  • Deep experience with AWS (architecture, security, and best practices).

  • Strong programming skills in Python, Go, or TypeScript.

  • 3+ years working with Infrastructure as Code - AWS CDK or Terraform.

  • Production experience with Kubernetes (EKS) and microservices architecture.

  • Hands-on experience with CI/CD and GitOps tools (GitHub Actions, ArgoCD).

  • Solid understanding of Linux internals and networking fundamentals.
  • Good communication skills in English and Hebrew.

Nice-to-Have:

  • Experience setting up or managing observability stacks (Datadog, Coralogix, Prometheus, Grafana).

  • Experience with Crossplane or advanced Helm usage.

  • Background working in a SaaS company.

Skills

ArgoCDAWSCDKCoralogixCrossplaneDataDogEKSGitHub ActionsGOGrafanaHelmKubernetesLinuxNetworkingPrometheusPythonSaaSTerraformTypeScript

Explore related jobs

Browse these categories