Cloud Site Reliability Engineer | Japan Jobs | Fidel Consulting KK

Cloud Site Reliability Engineer

Job Id : 9608
Posted : 2025-09-16
Industry : Telecommunications
Employment Type : Full Time, Permanent
Required Skills : Japanese JLPT N5, Kubernetes, designing, telecom clouds, Linux , networks, Python, Ansible, cloud platform software
City : Tokyo
State : Tokyo
Country : Japan
Annual Salary : ¥10,000,000 ~ ¥12,000,000

Job Description

Appealing Points:

  • Engage with cutting-edge cloud and Kubernetes technologies This role offers hands-on involvement in designing and operating telecom-grade hybrid cloud environments with a focus on Kubernetes and cloud-native tooling.
  • Opportunities to demonstrate leadership beyond technical skills Beyond technical proficiency, the position allows you to lead incident management, create architectural documentation, and manage operational risks as a technical leader.
  • Utilize deep telecom and networking expertise in a specialized field Ideal for professionals with knowledge in telecom standards (e.g., 3GPP) and virtualization technologies, offering a platform to apply and deepen industry-specific technical expertise.

Annual salary: 10 million and above

Job Qualification:

  • 5+ years of combined experience in designing, building, and/or operating telecom clouds (on-premises private clouds).
  • Extensive experience with Kubernetes, including deploying, managing, and troubleshooting clusters in production.
  • Certified Kubernetes Administrator (CKA) or equivalent certification.
  • Strong Linux expertise, including experience with performance tuning, troubleshooting, and scripting.
  • Solid networking fundamentals for cloud and telecom environments.
  • Proven ability to create detailed technical documentation, including requirements, HLD (High-Level Design), and LLD (Low-Level Design) documents.
  • Strong troubleshooting skills across multiple stacks: hardware (servers), OS (Linux), networks, Kubernetes, and cloud platform software.
  • Proven experience in incident management and root cause analysis in production environments.
  • Development expertise in designing and implementing operational efficiency and automation tools (e.g., Ansible, Python).
  • Proven management and communication skills to lead operations as a technical leader, including progress and risk management.

 Preferred Qualifications

  • Experience with cloud-native technologies, including Istio, Helm, and Prometheus.
  • Knowledge of standardization for the Telecom industry (e.g., 3GPP).
  • Expertise in hybrid cloud environments, integrating on-premises private clouds with public cloud services.
  • Familiarity with Continuous Integration/Continuous Deployment (CI/CD) pipelines and tools like Jenkins, GitLab.
  • Understanding of observability practices, including experience with Grafana for visualization and Prometheus for monitoring metrics.
  • Strong understanding of virtualization technologies like VMware or KVM.
  • Relevant Linux certifications (e.g., RHCSA, RHCE) are highly preferred.

Language Skills: Elementary level Japanese (JLPT N5)  and Business level English.

Company Description:

The largest eCommerce company in Japan, and the third-largest e-com Business-level merce market place company worldwide.

The organization provides a variety of consumer and business-focused services including e-commerce, e-reading, travel, banking, securities, credit card, e-money, portal and media, online marketing, and professional sports

[Measures against passive smoking]

No smoking indoors allowed

Designated smoking area