Senior AI Developer with Python - Customer Care AI Platform team (f/m/d)

Stellenbeschreibung:

About The Team

Our mission is to build a modern ecosystem used for all IONOS customer support needs. The tools developed by us are used in over 20 locations, by more than 2,000 users, supporting 8 million customer contracts across 10 markets.

The development team has full responsibility for the development lifecycle. This means we plan, develop, test and deploy our software without any other internal or external dependencies.

Our portfolio revolves around an internally built CRM now being enhanced with AI capabilities.

About The Product You Will Be Building

We are building a next‑generation AI platform designed to redefine how our company interacts with customers. This isn’t just a chatbot; it’s a high‑performance, multimodal AI ecosystem powered by state‑of‑the‑art Speech‑to‑Speech (S2S) models, advanced Large Language Models (LLMs), and intelligent orchestration frameworks. The platform will understand, reason, and respond across text and voice while seamlessly executing real‑time actions to resolve customer needs.

We are aiming for a hybrid architecture of Open Source LLMs, industry‑leading proprietary models, and Model Context Protocol (MCP) to enable contextual reasoning, tool invocation, and seamless orchestration across systems. The goal is not just to talk to the customer, but to act on their needs.

What Makes This Project Unique

The Voice Frontier: building low‑latency, emotive speech‑to‑speech pipelines for a truly natural voice channel experience.
Deep System Integration: the platform connects directly to the company’s core systems via MCPs, allowing the AI to access real‑time customer context and execute complex workflows.
Self‑Evolving Logic: developing an automated QA and evaluation module that continuously analyzes interactions across channels and adapts system behavior in hours, not weeks.
Hybrid Innovation: working at the intersection of “build vs. buy,” integrating the best of the open‑source community with custom‑built internal infrastructure.

What's In It For You

You won’t just be shipping code; you’ll help evolve a concept that shifts the industry. You’ll join a friendly, experienced team where your voice matters and your contribution shapes real‑world outcomes. You’ll work in a modern environment with technologies and practices that help us ship reliable software efficiently.

Role Description

As an AI Engineer on this team, you will build the core intelligence systems behind our multimodal AI platform. You will be responsible for moving beyond simple chat interfaces to build high‑performance, real‑time systems that handle complex reasoning, deep context retrieval, LLM orchestration, retrieval‑augmented generation (RAG) and seamless voice interactions.

Main Responsibilities

Design Agentic Workflows: design and implement LLM‑based systems that go beyond response generation, enabling structured tool usage, workflow orchestration, and secure interaction with internal services via MCP.
Build and Optimize RAG & CAG: develop high‑performance Retrieval‑Augmented Generation and Context‑Augmented Generation pipelines to ensure accurate, relevant, and low‑latency responses; continuously improve context management, ranking strategies, and grounding mechanisms for complex, multi‑step interactions.
Voice Channel Mastery: develop and optimize real‑time Speech‑to‑Speech pipelines, focusing on streaming architectures, latency reduction, and maintaining a natural conversational flow.
Evaluation, Quality & Alignment: build and maintain an automated QA module, including LLM‑as‑a‑judge patterns, to measure accuracy, safety, latency, and resolution quality at scale; translate evaluation insights into systematic models and prompt improvements.
Model Strategy & Hybrid Integration: integrate and operate both commercial foundation models (e.g., OpenAI, Anthropic, Google) and open‑source alternatives (e.g., Qwen, Kimi, DeepSeek, Moonshot, GLM), selecting and optimizing models based on performance, latency, cost, and use‑case requirements.

We Are Looking For Some Of

Strong Python and/or Java engineering skills: advanced‑level Python development experience, including asynchronous programming (e.g., FastAPI, asyncio) and building high‑performance, production‑grade services. Experience with streaming architectures is an advantage.
LLM Application & Multi‑Agent Orchestration Experience: hands‑on experience building LLM‑powered systems, including multi‑step workflows, stateful agents, and tool invocation. Familiarity with orchestration frameworks such as LangChain, LlamaIndex, or LangGraph for stateful, multi‑turn agents.
Advanced Retrieval & Context Management: deep understanding of vector databases (e.g., Weaviate, Qdrant, pgvector, Elasticsearch), semantic search, embedding strategies, and re‑ranking techniques; experience designing and optimizing RAG pipelines.
Real‑Time & Low‑Latency Systems: experience in designing systems that operate under latency constraints, including streaming APIs, event‑driven architectures, and performance optimization; understanding of trade‑offs between quality, cost, and response time.
Evaluation‑Driven Development: experience implementing evaluation frameworks for LLM‑based systems, including automated QA pipelines and LLM‑as‑a‑judge patterns.
Familiar with API Design: knowledge of RESTful API design and OAuth2.

What We Offer

Access to local and international trainings, development and growth opportunities, including e‑learning platforms covering both technical and soft‑skill areas.
Modern technologies and product responsibility.
Flexible work schedule.
Hybrid work option.
Medical services package from one of two private providers.
25 vacation days per year.
Substitute days off for public holidays that fall on the weekend.
Meal tickets.
Internal referral program.
Team events and networking events organized to promote a passionate, creative and diverse culture.
Summerfest and Winterfest parties.
Office coffee, soft drinks and fresh fruits.