Rakuten Symphony is a Rakuten Group company, providing global B2B services for the mobile telco industry and enabling next-generation, cloud-based, international mobile services.
Building on the technology Rakuten used to launch Japan’s newest mobile network, we are taking our mobile offering global. Let’s build the future of mobile telecommunications together!
Job Purpose
This role supports day‑to‑day operations of the engineering lab, including access management, cluster operations, networking, automation, monitoring.
Key Responsibilities
The Ops & Infrastructure Engineer is responsible for managing day‑to‑day operations of the engineering lab, including access management, cluster operations, networking, automation, monitoring, and test setup preparation. This role requires broad technical knowledge across Linux, networking, scripting, virtualization, Kubernetes, Android tools, and telecom/OSS systems. The engineer ensures smooth operation of lab infrastructure, supports internal teams, and maintains accurate documentation and workflows.
Required Knowledge, Skills & Experience
- Basic to intermediate networking (L2, IP, routing).
- Linux and windows administration basic.
- Scripting (Python, Bash) basic.
- Basic workflow automation.
- Telecom/OSS fundamentals basic.
- Strong documentation and organizational skills.
- Kubernetes and docker basics (pods, deployments, logs, namespaces).
- Experience with CI/CD or DevOps tools.
- Knowledge of monitoring tools (Prometheus, Grafana).
- Experience with automation frameworks.
- Understanding of cloud platforms (AWS, Azure, GCP).
Key Interfaces for Role
IAM & Access Management
- Manage user access to clusters, servers, Kubernetes namespaces, and lab systems
- Maintain access logs and follow security policies
- Create and manage role‑based access for lab environments
Networking & Connectivity
- Troubleshoot L2 issues (Create workflow for automating Monitoring)
- Servers (IP, hostname, RAID, BIOS, NIC bonding)
- Manage VM deployments, snapshots, and resource allocation
- Support cluster access, kubeconfig management, and RBAC
- Assist with cluster monitoring and troubleshooting
Automation & Scripting
- Write scripts (Python/Bash) to automate repetitive tasks
- Build workflow automation for lab operations
- Integrate automation with Jira, monitoring tools, or OSS systems
Telecom & OSS
- Work with OSS/NMS tools for alarms and monitoring
- Assist in troubleshooting network elements and lab nodes
Monitoring & L2 Support
- Monitor lab systems, clusters, and network health
- Respond to alerts and perform first‑level troubleshooting
- Escalate issues to L3/engineering when needed
- Maintain dashboards and monitoring tools
Key Performance Indicators
- Broad technical knowledge across multiple domains.
- Hands‑on, practical, and comfortable in lab environments.
- Able to work independently and support multiple teams.
- Organized, detail‑oriented, and proactive.
#J-18808-Ljbffr