Senior Platform Engineer - AI Infrastructure & Observability (m/f/d)
We’re looking for a Senior Platform Engineer (m/f/d) excited about building infrastructure for AI-first applications . You’ll own our cloud platform — from Kubernetes clusters running real‑time voice agents to ClickHouse analytics pipelines processing millions of events daily.
You’ll tackle novel observability challenges such as monitoring ClickHouse cluster health , ensuring sub‑200ms latency for voice AI, and tracking data pipeline quality .
We’re AI‑first not just in what we build, but in how we operate — leveraging AI‑native tools for incident response and building automation that uses LLMs to accelerate debugging and root cause analysis .
Your Mission
- Automate Incident Management: Implement AI‑native incident management tools to accelerate response and automate root cause analysis.
- Manage Cloud Infrastructure: Operate and optimize AWS EKS infrastructure with Terraform , tailored for AI workloads and analytics pipelines.
- Ensure Data Reliability: Maintain ETL workflows , ClickHouse cluster health , and batch jobs , ensuring data freshness and quality.
- Optimize System Performance: Design API failover strategies , implement caching layers, and continuously optimize infrastructure.
- Improve Developer Experience: Maintain Skaffold‑based local development environments, enhance CI/CD pipelines , and build internal productivity tooling.
- Enhance Observability: Implement and monitor SLOs , use AI tools for log analysis , and improve visibility through structured logging .
Your Profile: What you need to succeed
- Infrastructure Expertise: 5+ years of software engineering experience and 3+ years running Kubernetes in production (AWS EKS preferred).
- IaC Mastery: Strong Terraform and GitOps workflows experience, with deep AWS knowledge (VPCs, RDS, ElastiCache, Lambda).
- Data & AI Focus: Experience monitoring ETL pipelines and analytics workloads (ClickHouse, Redshift, BigQuery), and excitement for AI‑native operations tools such as log analysis or automated remediation.
- Backend & Leadership: Proficient in Python or Kotlin/Java , familiar with FastAPI , Spring Boot , Django , or gRPC . Able to work independently, mentor others, and drive technical decisions.
- Mindset: Strong written communication skills, comfort with ambiguity, and motivation to build at the intersection of AI and infrastructure .
- Development Opportunities : Steep development opportunities without entrenched hierarchies.
- Culture: We are an ambitious team that wants to achieve a lot with Acto, but we don't take ourselves too seriously and like to have fun together, be it working together or at regular team events.
- Flexibility: Enjoy a flexible work schedule and a remote set up with the option to work from any Adesso offices around Germany or our Munich Office.
- ️ Health & work‑life balance: Enjoy 30 days of vacation and an attractive Wellhub membership.
- Ownership: Whether you are an intern or a full‑time acto‑naut, Acto instills a deep sense of ownership, where every team member is entrusted with meaningful responsibilities and the autonomy to make impactful decisions.
- Equipment: You will receive a state‑of‑the‑art hardware setup with a choice between Mac and Windows.
About us
At Acto , we’re building agentic AI systems that go beyond chat, systems that reason, act, and deliver results across complex real‑world workflows for field sales teams.
Our mission is to turn enterprise data and tools into intelligent, decision‑capable agents that support people where it matters: in meetings, in daily operations and on the road. We're a small, product‑focused team combining deep experience in AI, ML engineering, and systems design.
We started in 2021 and since then have not only been able to win renown medium‑sized companies as customers, but also bring experienced investors such as 468 Capital and Cusp Capital on board.
#J-18808-Ljbffr