Hello there, people at Telus Health are happy to greet you
TELUS Health is empowering every person to live their healthiest life. Guided by our vision to create a healthier future, we are leveraging the power of our cutting-edge technology and focusing on the uniqueness of each individual to create the future of health. As a leading global health and well-being provider – encompassing physical, mental and financial health – TELUS Health is improving health outcomes for consumers, patients, healthcare professionals, employers and employees.
TELUS Health supports the total health and well-being of over 35 million lives worldwide with our clinical expertise, global presence and digital well-being platform offered through our Integrated Health Solutions. We empower healthier, happier, and more productive employees by combining our award-winning Employee Assistance Program with proactive wellness solutions in a digital ecosystem that helps them prevent and manage issues in family, health, life, money, and work.
We're seeking a Platform (Site Reliability) Engineering Manager (w/m/d) to join our Engineering team in Berlin.
This is a hybrid position, requiring 2 days in the office (Wednesdays & Fridays).
Your mission in a nutshell: - You will lead and evolve our Platform and Site Reliability Engineering function, ensuring the reliability, scalability, and security of our global services while building and developing a high-performing team.
How you will spend your time:
- Lead, develop and grow a team of Site Reliability and Platform Engineers, fostering a culture of ownership and continuous improvement
- Define and drive the reliability strategy across services, including SLIs, SLOs and error budgets
- Ensure high availability, scalability and performance across multi-region AWS environments
- Own and improve incident management processes, on-call practices and operational excellence
- Drive automation and reduce operational toil through tooling and standardisation
- Partner with Security and Compliance teams to ensure adherence to standards such as GDPR, ISO 27001 and SOC 2
- Provide architectural guidance across infrastructure, networking and platform services
- Collaborate with engineering, product, data and AI teams to support reliable and scalable systems
- Communicate risks, performance metrics and priorities to both technical and non-technical stakeholders
What you should bring to the table:
- Strong experience in Site Reliability Engineering, DevOps or Platform Engineering within AWS environments
- Proven experience leading and developing engineering teams
- Deep expertise in AWS services (e.g. EC2, S3, RDS, Lambda, VPC, IAM)
- Strong knowledge of Infrastructure as Code (Terraform or CloudFormation)
- Experience with container orchestration (ECS or EKS)
- Solid understanding of distributed systems and reliability engineering principles
- Experience designing and maintaining CI/CD pipelines
- Strong understanding of networking, security and observability practices
- Experience managing incident response and operational processes
- Excellent stakeholder management and communication skills
- Fluent English
Nice to have:
- Experience with globally distributed systems and large-scale production environments
- Exposure to security incident response and compliance audits
- Experience supporting AI/ML infrastructure on AWS
- Experience mentoring senior engineers or managers
- Relevant certifications (e.g. AWS, Kubernetes, Terraform)
You will be successful in your role after 6 months, if: - You have established clear reliability standards and SLO frameworks
- Your team operates effectively with strong ownership and mature on-call practices
- Platform reliability, scalability and operational efficiency have measurably improved
#J-18808-Ljbffr