We are hiring a skilled Site Reliability Engineer (SRE) to strengthen our 24x7 operational support team. In this role, you’ll ensure platform stability, reliability, and security through observability, automation, and proactive monitoring. You will manage Kubernetes, CI/CD pipelines, and IaC solutions, while scripting in Python, Go, or Bash to drive efficiency. You’ll configure and optimize Elasticsearch/Prometheus platforms, ensuring secure logging, scalable monitoring, and insightful dashboards. As part of a global support team, you’ll participate in incident response, troubleshooting, and major incident management, ensuring rapid recovery and minimal downtime. You’ll also uphold security standards, compliance requirements, and contribute to the continuous improvement of SOPs. Collaboration with engineers and stakeholders will be key in delivering resilient, reliable, and high-performing systems.
Typ:
VollzeitArbeitsmodell:
Vor OrtKategorie:
Erfahrung:
2+ yearsArbeitsverhältnis:
AngestelltVeröffentlichungsdatum:
04 Nov 2025Standort:

Möchtest über ähnliche Jobs informiert werden? Dann beauftrage jetzt den Fuchsjobs KI Suchagenten!