.
Senior Site Reliability Engineer @ Kontakt.io
  • Kraków County
Senior Site Reliability Engineer @ Kontakt.io
Kraków, Kraków County, Lesser Poland Voivodeship, Polska
Kontakt.io
1. 3. 2025
Informacje o stanowisku

Kontakt.io is building the platform that care operations run on.


We reduce waste, cut costs, and improve revenue by improving throughput, asset utilization and staff productivity. Our platform uses AI, RTLS, and EHR data to enable self-learning agents to automate workflows, adapt in real-time, and orchestrate all of care delivery operations.
Easy to deploy and scale, it gives a clear picture of spaces, equipment, and people, eliminating inefficiencies and enhancing the patient experience. With measurable 10X ROI and over 20+ use cases, Kontakt.io is the go-to platform for better and faster care delivery operations.


As a Site Reliability Engineer (SRE) at Kontakt.io, you will be responsible for ensuring the scalability, availability, and security of our cloud-based AI-driven healthcare platform. You will collaborate with software, data, and infrastructure teams to build highly resilient and automated systems, allowing hospitals and care facilities to operate seamlessly and without downtime.Your expertise in cloud infrastructure, automation, monitoring, and performance optimization will directly impact how healthcare organizations leverage real-time data to enhance patient care and operational efficiency.


If you are passionate about highly available systems, automation, and making an impact in healthcare, join Kontakt.io and help us build the future of smart care operations!


  • 3+ years of experience as an SRE
  • Strong expertise in Kubernetes, Docker, and container orchestration.
  • Experience managing cloud-native environments (AWS).
  • Experience with event-driven architectures, Kafka, or real-time data streaming.
  • Knowledge of machine learning infrastructure.
  • Previous experience in healthcare, compliance (HIPAA), and highly regulated environments.
  • Proficiency in Infrastructure as Code (IaC) using Terraform.
  • Deep knowledge of networking, DNS, load balancing, and security best practices.
  • Experience with CI/CD pipelines (Jenkins, CI, or ArgoCD).
  • Hands-on experience with monitoring and logging tools (Prometheus, Grafana, ELK, OpenTelemetry).
  • Strong programming skills in Python, Golang, or Bash for automation.
  • Knowledge of machine learning infrastructure.

Kontakt.io is building the platform that care operations run on.


We reduce waste, cut costs, and improve revenue by improving throughput, asset utilization and staff productivity. Our platform uses AI, RTLS, and EHR data to enable self-learning agents to automate workflows, adapt in real-time, and orchestrate all of care delivery operations.
Easy to deploy and scale, it gives a clear picture of spaces, equipment, and people, eliminating inefficiencies and enhancing the patient experience. With measurable 10X ROI and over 20+ use cases, Kontakt.io is the go-to platform for better and faster care delivery operations.


As a Site Reliability Engineer (SRE) at Kontakt.io, you will be responsible for ensuring the scalability, availability, and security of our cloud-based AI-driven healthcare platform. You will collaborate with software, data, and infrastructure teams to build highly resilient and automated systems, allowing hospitals and care facilities to operate seamlessly and without downtime.Your expertise in cloud infrastructure, automation, monitoring, and performance optimization will directly impact how healthcare organizations leverage real-time data to enhance patient care and operational efficiency.


If you are passionate about highly available systems, automation, and making an impact in healthcare, join Kontakt.io and help us build the future of smart care operations!

,[Design and maintain highly available, fault-tolerant, and scalable cloud infrastructure., Implement SLOs, SLIs, and SLAs to track system reliability and optimize uptime., Participate in 24/7 on-call rotation, Oversee production platform deployments, Monitor latency, traffic, errors, and system health using modern observability tools., Conduct root cause analysis (RCA) and post-mortems to continuously improve system resilience., Automate infrastructure provisioning using Terraform, Ansible, or Pulumi., Implement CI/CD pipelines to ensure seamless and safe deployments., Enable self-healing mechanisms using Kubernetes operators, auto-scaling, and fault detection., Ensure compliance with HIPAA, GDPR, and other healthcare data regulations., Define and execute disaster recovery (DR) and business continuity plans., Manage and optimize AWS environments for cost-efficiency and performance., Deploy and manage observability tools and build real-time alerting and response frameworks, Establish best practices for logging, debugging, and performance monitoring., Improve incident response automation through runbooks, AI-based anomaly detection, and predictive analytics. Requirements: Site reliability engineering, Kubernetes, Docker, AWS, Kafka, healthcare, IaC, Terraform, Networking, Security, CI/CD, Prometheus, Grafana, Machine learning, Python, Golang, Bash Tools: Jira, Confluence, GitHub, GIT, Agile, Scrum. Additionally: Sport subscription, Private healthcare, Flat structure, Small teams, International projects, Free coffee, Bike parking, Shower, Free snacks, Free beverages, Free parking, Startup atmosphere, No dress code.

  • Praca Kraków
  • Kraków - Oferty pracy w okolicznych lokalizacjach


    132 347
    12 062