Job Description
As a Senior DevOps Engineer, you’ll collaborate closely with developers and product teams to build, evolve, and maintain our high-performance, multi-region cloud infrastructure. Your work will directly support real-time streaming applications that demand maximum reliability and transparency.
Key Responsibilities:
-
Design, maintain, and optimize multi-region Kubernetes clusters (with a focus on Rancher RKE).
-
Automate deployments and infrastructure using Terraform, Helm, and GitLab CI/CD.
-
Ensure uptime, scalability, and security for global streaming services.
-
Develop and manage observability solutions using the Grafana stack (Loki, Tempo, Mimir, Grafana).
-
Build robust pipelines for metrics, tracing, and logging.
-
Implement best practices for monitoring, alerting, and incident response.
-
Support containerized applications from development to production.
-
Lead initiatives for platform resilience and performance.
-
Participate in an on-call rotation to support critical systems after hours.
Ideal Profile
-
5+ years of experience in a DevOps, SRE, or infrastructure-focused role.
-
Strong expertise in Kubernetes (Rancher RKE experience is a plus).
-
In-depth knowledge of Linux, networking, and cloud-native architecture.
-
Production experience with observability tools like Grafana, Loki, Tempo, and Mimir.
-
Skilled in infrastructure-as-code tools like Terraform, Helm, and GitLab CI.
-
Understanding of metrics, distributed tracing, log aggregation, and OpenTelemetry.
-
Proficiency with Unix systems; experience with Go is a plus.
-
Fluent in English (German is a bonus).
-
Analytical, detail-oriented, and a proactive team player.