.
Senior Site Reliability Engineer
  • Kraków
Senior Site Reliability Engineer
Kraków, Kraków, Lesser Poland Voivodeship, Polska
Kontakt.io
26. 5. 2025
Informacje o stanowisku

technologies-expected :


  • Kubernetes
  • Docker
  • AWS
  • Kafka
  • Prometheus
  • Grafana
  • OpenTelemetry
  • Python
  • Java

technologies-optional :


  • Jenkins
  • ArgoCD
  • Go
  • Bash

about-project :


  • Kontakt.io is building the platform that care operations run on.
  • We reduce waste, cut costs, and improve revenue by improving throughput, asset utilization and staff productivity. Our platform uses AI, RTLS, and EHR data to enable self-learning agents to automate workflows, adapt in real-time, and orchestrate all of care delivery operations.
  • Easy to deploy and scale, it gives a clear picture of spaces, equipment, and people, eliminating inefficiencies and enhancing the patient experience. With measurable 10X ROI and over 20+ use cases, Kontakt.io is the go-to platform for better and faster care delivery operations.
  • As a Site Reliability Engineer (SRE) at Kontakt.io, you will be responsible for ensuring the scalability, availability, and security of our cloud-based AI-driven healthcare platform. You will collaborate with software, data, and infrastructure teams to build highly resilient and automated systems, allowing hospitals and care facilities to operate seamlessly and without downtime.
  • Your expertise in cloud infrastructure, automation, monitoring, and performance optimization will directly impact how healthcare organizations leverage real-time data to enhance patient care and operational efficiency.
  • If you are passionate about highly available systems, automation, and making an impact in healthcare, join Kontakt.io and help us build the future of smart care operations!

responsibilities :


  • Design and maintain highly available, fault-tolerant, and scalable cloud infrastructure.
  • Implement SLOs, SLIs, and SLAs to track system reliability and optimize uptime.
  • Participate in 24/7 on-call rotation.
  • Oversee production platform deployments.
  • Monitor latency, traffic, errors, and system health using modern observability tools.
  • Conduct root cause analysis (RCA) and post-mortems to continuously improve system resilience.
  • Automate infrastructure provisioning using Terraform, Ansible, or Pulumi.
  • Implement CI/CD pipelines to ensure seamless and safe deployments.
  • Enable self-healing mechanisms using Kubernetes operators, auto-scaling, and fault detection.
  • Ensure compliance with HIPAA, GDPR, and other healthcare data regulations.
  • Define and execute disaster recovery (DR) and business continuity plans.
  • Manage and optimize AWS environments for cost-efficiency and performance.
  • Deploy and manage observability tools and build real-time alerting and response frameworks
  • Establish best practices for logging, debugging, and performance monitoring.
  • Improve incident response automation through runbooks, AI-based anomaly detection, and predictive analytics.

requirements-expected :


  • 3+ years of experience as an SRE.
  • Software engineering experience.
  • Strong expertise in Kubernetes, Docker, and container orchestration.
  • Experience managing cloud-native environments (AWS).
  • Experience with event-driven architectures, Kafka, or real-time data streaming.
  • Knowledge of machine learning infrastructure.
  • Previous experience in healthcare, compliance (HIPAA), and highly regulated environments.
  • Proficiency in Infrastructure as Code (IaC) using Terraform.
  • Deep knowledge of networking, DNS, load balancing, and security best practices.
  • Experience with CI/CD pipelines (Jenkins, CI, or ArgoCD).
  • Hands-on experience with monitoring and logging tools (Prometheus, Grafana, ELK, OpenTelemetry).
  • Strong programming skills in Python, Golang, or Bash for automation.
  • Knowledge of machine learning infrastructure.

offered :


  • Work on a mission-driven platform that improves healthcare operations and patient outcomes.
  • B2B contract or an employment agreement.
  • Competitive salary and stock option plan.
  • Collaborate with top engineers, data scientists, and AI experts.
  • Flexible remote or hybrid work options (office in Krakow).
  • Collaborative and self-organized environment.
  • private medical care, cafeteria system.

benefits :


  • sharing the costs of sports activities
  • private medical care
  • life insurance
  • remote work opportunities
  • flexible working time
  • fruits
  • integration events
  • dental care
  • no dress code
  • coffee / tea
  • drinks
  • parking space for employees

  • Praca Kraków
  • Kraków - Oferty pracy w okolicznych lokalizacjach


    85 120
    8 755