Informacje o stanowisku
Our Client ranks at the top of the leading companies providing services at the highest level, allowing us to build modern and efficient businesses. The clients objective is to change business operational and technological models and adapt them to a rapidly changing world.
Responsibilities:
- provide Level 3 production support for mission-critical applications such as payment and transaction systems,
- participate in a 24/7 on-call rotation, ensuring rapid incident response and minimal downtime,
- perform in-depth troubleshooting and diagnostics across applications, databases, virtual machines, and network layers,
- analyze logs, metrics, and traces using Splunk, Prometheus, and Grafana to identify and resolve performance issues,
- support and monitor Kafka, Cassandra, and other distributed messaging and data systems,
- collaborate with development teams to debug issues, validate fixes, and manage releases,
- provide architectural insights to enhance system performance, scalability, and reliability,
- troubleshoot multi-tier Java/J2EE applications built with Spring, Hibernate, JDBC, MQ, and Web Services,
- operate within Unix/Linux and Windows environments, maintaining system stability and availability,
- work with Oracle and MySQL databases to support incident resolution and performance tuning,
- conduct root cause analyses (RCA), implement corrective measures, and coordinate cross-team responses during production incidents.
Requirements:
- experience in production support, SRE, or reliability engineering roles,
- proven expertise in incident management, problem resolution, and maintaining high service availability,
- hands-on experience with observability and monitoring tools (Splunk, Prometheus, Grafana),
- strong proficiency in scripting languages such as Python or Bash for automation and diagnostics,
- familiarity with Java-based enterprise systems, including application servers and middleware components,
- solid understanding of databases (Oracle, MySQL) and distributed systems like Kafka and Cassandra,
- strong analytical and troubleshooting skills, with the ability to manage and resolve complex, multi-layered production issues,
- excellent communication and collaboration skills, with a proactive approach to reliability and continuous improvement.
Our client offers:
- Contract of Employment or a B2B contract,
- extensive benefits package on CoE: Multisport Card, Lux Med medical healthcare including dental care, life insurance, cafeteria benefits,
- hybrid working model from office in Warsaw,
- training and continuous learning and certification opportunities,
#J-18808-Ljbffr
Praca WrocławWrocław - Oferty pracy w okolicznych lokalizacjach