N26

Senior Site Reliability Engineer - AI Platform Barcelona

N26 Berlin

Stellenbeschreibung:

About the opportunity

We are seeking a Senior Site Reliability Engineer to join the Platform Engineering Domain in the AI Platform Team.

The mission of Platform Engineering is to provide trusted, performant, self-service platforms that empower product teams to build "the bank the world loves to use." The AI Platform team contributes to this mission by creating scalable, secure, and compliant infrastructure solutions that support MLOps and GenAI capabilities.

The ideal candidate is not only a seasoned SRE expert ready to apply their skills to the challenges of AI infrastructure but also an enthusiastic learner excited to grow alongside a team pioneering cutting-edge platform solutions. If you thrive in an environment where expertise meets curiosity, and where mentorship and innovation go hand in hand, we’d love to hear from you.

In this role, you will:

  • Design, develop, and implement platform solutions that enhance the reliability, security, and scalability of the AI Platform infrastructure.
  • Provide technical leadership in cloud infrastructure, networking, CI/CD, and security for AI and MLOps workloads.
  • Collaborate closely with Data Scientists, ML Engineers, and Product Teams to ensure seamless model deployment and operational efficiency.
  • Mentor and coach team members, fostering a culture of knowledge sharing, technical excellence, and continuous improvement.
  • Take an active role in shaping the team's strategy, roadmap, and architecture.
  • Drive incident management and troubleshooting efforts, ensuring a stable and predictable AI development and deployment environment.
  • Improve observability and monitoring, ensuring the AI Platform meets performance and compliance requirements.

What you need to be successful

Background and skills:

  • Strong hands-on experience in designing, implementing, and maintaining cloud-based infrastructure, particularly in AWS.
  • Strong experience in infrastructure as code (Terraform, CloudFormation, or similar).
  • Proficiency in at least one programming language (Python preferred).
  • Experience with networking and security best practices in cloud environments.
  • Hands-on experience with CI/CD pipelines (GitHub Actions, ArgoCD, Jenkins, or similar).
  • Familiarity with observability tools (DataDog, Prometheus, Grafana, OpenTelemetry).

Nice to have:

  • Experience in AI/ML production systems and the unique challenges of scaling AI workloads.
  • Experience in orchestration for AI/ML workloads.
  • Familiarity with MLOps tools (e.g., AWS SageMaker, Bedrock, Kubeflow, MLflow).
  • Strong understanding of compliance and governance in AI/ML platforms.

Traits:

  • Excellent collaboration and communication skills, with the ability to work across teams and mentor engineers.
  • Strong sense of ownership, with a proactive approach to problem-solving and process improvements.
  • Passion for building high-quality, scalable, and secure AI infrastructure.
  • Eagerness to learn and contribute to the evolution of AI platforms.

What’s in it for you

  • Accelerate your career growth by joining one of Europe’s most talked about disruptors

NOTE / HINWEIS:
EnglishEN: Please refer to Fuchsjobs for the source of your application
DeutschDE: Bitte erwähne Fuchsjobs, als Quelle Deiner Bewerbung

Stelleninformationen

  • Typ:

    Vollzeit
  • Arbeitsmodell:

    Vor Ort
  • Kategorie:

  • Erfahrung:

    2+ years
  • Arbeitsverhältnis:

    Angestellt
  • Veröffentlichungsdatum:

    06 Nov 2025
  • Standort:

    Berlin

KI Suchagent

AI job search

Möchtest über ähnliche Jobs informiert werden? Dann beauftrage jetzt den Fuchsjobs KI Suchagenten!