.
Site Reliability Engineer II (Apptio)
  • Kraków
Site Reliability Engineer II (Apptio)
Kraków, Kraków, Lesser Poland Voivodeship, Polska
IBM
16. 12. 2025
Informacje o stanowisku

Join to apply for the Site Reliability Engineer II (Apptio) role at IBM

2 days ago Be among the first 25 applicants

Introduction

At IBM, work is more than a job – it’s a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you’ve never thought possible. Are you ready to lead in this new era of technology and solve some of the world’s most challenging problems? If so, let’s talk. Curiosity and courageous thinking are vital when working in IBM, as we continue our dedication in guaranteeing that we are at the forefront of cloud technology. Our renowned legacy means we are leading the way in everything from analytics and security through to unmatched hardware & software designs. We provide our clients with the full end‑to‑end transformation as we build IBM’s next generation cloud platform focused around delivering performance and predictability at a global scale.

Your Role and Responsibilities

As a Site Reliability Engineer, you will play a crucial role in supporting, maintaining, and operationally improving the cloud infrastructure. Working closely with various teams, your focus will be on ensuring the health and reliability of production and test systems. Your proactive approach will be essential in responding promptly to issues and alerts, contributing to the development of new capabilities, and collaborating with other SRE teams and program managers to deliver mission‑critical services to the market.

Key Duties

  • Platform Engineering: Participate in development and maintenance of large‑scale Internal Developer Platform (IDP) based on Kubernetes.
  • Collaborative Partnership: Partner with development teams and program managers, contributing to the seamless delivery of mission‑critical services to the market.
  • Cross‑Functional Troubleshooting: Collaborate with engineering teams to provide initial assessments and possible workarounds for production issues. Troubleshoot and resolve production issues effectively.
  • Integration Planning: Work with support and development teams to identify and resolve issues. Discuss and plan integration tasks to enhance overall system performance.
  • Rapid Issue Response: Respond promptly to production issues and alerts, providing swift resolution and maintaining system availability.

Required Technical And Professional Expertise

  • Proven Experience: Expertise in large‑scale, distributed Linux/Unix environments and container orchestration technologies like Docker, Kubernetes, and Helm.
  • System Monitoring and Troubleshooting: Strong experience with observability tools (e.g., Prometheus, Grafana, DataDog) to ensure optimal system performance and uptime.
  • Automation Proficiency: Proficiency in using declarative infrastructure tools like Terraform and CloudFormation to automate production workflows efficiently.
  • Collaborative Mindset: Ability to work collaboratively across teams while adopting GitOps and CI/CD principles for seamless operations.
  • Effective Communication Skills: Excellent communication and mentoring skills, fostering knowledge sharing and team development in English.

Seniority Level

Mid‑Senior level

Employment Type

Full‑time

Job Function

Engineering and Information Technology

Industries

IT Services and IT Consulting

Referrals increase your chances of interviewing at IBM by 2x

Get notified about new Site Reliability Engineer jobs in Cracow, Małopolskie, Poland.

#J-18808-Ljbffr

  • Praca Kraków
  • Kraków - Oferty pracy w okolicznych lokalizacjach


    174 727
    24 535