InterEx Group

Site Reliability Engineer

Stellenbeschreibung:

Our client is one of the world’s leading manufacturers of semiconductor chip-making equipment. A majority of the world’s microchips receive their critical lithographic patterning in machines made by this organisation. In addition, they produce metrology tools and advanced applications to analyze and optimize the performance of the customer production process. Job MissionTroubleshoot short-term problems and translate, develop into structural improvements on our distributed data and compute platform infrastructure. Be accurate, be precise and help drive up the aggregate availability of the installs of these distributed computing systems in Korea, Taiwan, Israel, China and the US (etc.). Be part of the computing platform that is one of the main pillars under the production of the next-generation microchips of Apple, Samsung and many others. Responsibilities:Create awareness in other teams about methods and procedures we use to help them to prevent repetitive help requests.Help application developers to understand the infrastructure / cluster / system“We are the team that is in charge of understanding & explaining how the system fits into the customer’s ecosystem”Share knowledge / mindset to other teams (dev/infra engineers)Cross functional, share knowledge between infra engineersContribute towards building VCP as a Product which meets our standards of qualityIncrease stability and reliability of VCP by automated testing and automationCustomer satisfaction and product reliabilityImprove the functionality and reliability of VCPTranslate customer ecosystem needs to engineering deliverablesFind the broken pieces of the puzzle at system/cluster levelCombination of individual ‘stories’ in a complete bookMake the VCP reliable by improving system resilience (bug-fixing and beyond)Resolve bugs in a sustaining way (implement regression test, design structural fixes)Ambassador of predictable component lifecycle managementTechnical roadmap maintenance (App life cycle management)Support feature and service request from the fieldSuggest improvements to our technical solutions and way of working, and implement them in alignment with your team and their stakeholders Highly valued qualifications & experiences:Experience with DC/OSExperience with new technology introduction @ zero downtime including data migrationFan of automatic testing and qualification, if can be part of CI/CD pipeline.Affinity to dig deep into the details of networking issuesAvailable to work (remotely) outside regular office hours when it proves that attempt to build a fail-safe system was not yet successful. We really want this to be an exception, not a rule. Required qualifications & experiences:Knowledge of distributed computing systems, practical experience (must!)Experienced in build and release infrastructure, Maven, Nexus, Bamboo, GithubFamiliar with at least one scripting language (Python)Experience with AnsibleLinux expert
NOTE / HINWEIS:
EnglishEN: Please refer to Fuchsjobs for the source of your application
DeutschDE: Bitte erwähne Fuchsjobs, als Quelle Deiner Bewerbung

Stelleninformationen

  • Veröffentlichungsdatum:

    22 Nov 2025
  • Standort:

    Germany
  • Typ:

    Vollzeit
  • Arbeitsmodell:

    Remote
  • Kategorie:

    Development & IT
  • Erfahrung:

    Erfahren
  • Arbeitsverhältnis:

    Angestellt

KI Suchagent

AI job search

Möchtest über ähnliche Jobs informiert werden? Dann beauftrage jetzt den Fuchsjobs KI Suchagenten!

Diese Jobs passen zu Deiner Suche:

company logo
Site Reliability Engineer - remote (w/m/d)
Hypoport AG (International, Group)
Vollzeit WorkFromHome
24 Nov 2025
REDSOFA GROUP
Lead Site Reliability Engineer
REDSOFA GROUP
Vollzeit WorkFromHome
24 Nov 2025
TieTalent
Head of Site Reliability Engineering (SRE)
TieTalent
Vollzeit Eschborn
24 Nov 2025
ventx - we make IT!
Site Reliability Engineer Public Cloud (w/m/x)
ventx - we make IT!
Vollzeit München
24 Nov 2025
Tipico
(AWS) Site Reliability Engineer (Java Platform) (m/f/x)
Tipico
Vollzeit Karlsruhe
24 Nov 2025
Deutsche WertpapierService Bank AG
Anwendungsbetreuer - Site Reliability Engineer (m/w/d)
Deutsche WertpapierService Bank AG
Vollzeit WorkFromHome
24 Nov 2025
IONOS SE
Site Reliability Engineer (w/m/d) Application Hosting/TOSAAS
IONOS SE
Vollzeit WorkFromHome
26 Nov 2025
Google
Senior Staff Software Engineer, Site Reliability Engineering
Google
Vollzeit Munich
28 Nov 2025Development & IT