ALTIMETRIK POLAND SPÓŁKA Z OGRANICZONĄ ODPOWIEDZIALNOŚCIĄ
16. 9. 2024
Informacje o stanowisku
technologies-expected :
AWS
Docker
Kubernetes
Terraform
Splunk
Prometheus
Grafana
OpsGenie
Groovy
Java
Python
Kafka
Oracle
MySQL
Google Cloud Platform
about-project :
We are currently seeking a highly skilled SRE Senior Engineer with solid experience to help lead transformational initiatives within IT operations, encompassing development as well. As a crucial figure in this role, you will participate/help designing and implementing cutting-edge
SRE solutions, driving the transformation of IT operations organizations to adopt an engineering-centric approach.
responsibilities :
Participate in design, architecture of reliable, scalable, and high-performance systems and services with a focus on operational excellence, availability, and performance
Evangelize SRE evolution within IT operations and promoting a culture of engineering excellence and best practices
Define best practices and principles for SRE, including incident management, monitoring, alerting, and automation
Collaborate with development teams on resiliency to ensure that services and applications are designed with operational reliability in mind
Implement monitoring systems to assess the performance of applications and infrastructure, and proactively identifying areas for optimization
Understanding incident and problem management process, post-mortems, and driving improvements to prevent future incidents
Analyze resource utilization patterns and forecasting future capacity needs to ensure optimal performance and cost-efficiency
Ensure that SRE practices align with security and compliance requirements andimplementing measures to protect systems and data
Operational excellence with focus on automation and developing tools to streamline operational tasks and increase efficiency
Provide guidance and mentorship to SRE teams, fostering skill development, and building a strong and capable SRE practice
Ability to develop close relationship with other operational teams to integrate SRE practices and drive overall operational improvements across enterprise
Stay up to date on industry trends, new technologies, and best practices in SRE and applying relevant advancements to the organization
requirements-expected :
Around 8-10 years of SRE hands on experience with cloud technologies, development, SRE toolsets and automation
Strong hands-on experience with any Cloud Technology (AWS): Control Tower, Project Setup, Creating Accounts, RDS, SSO
Solid understanding and hands on experience with Docker/Kubernetes/Microservices
Should have good experience with Linux Commands, GitLab CICD Setup and Terraform (state management, etc)
Hands on APM Tool/s experience, preferably Datadog or AppDynamics or Dynatrace
Good understanding of Observability Framework leveraging programmatic SLI/SLO blueprints to standardize the collection of golden signals.
Should have automation (data refresh, releases, DB snapshots) experience using Ansible or any other scripting languages
Experience with following languages (Groovy-DSL, Java, Python, Yaml and microservices architecture)
Good understanding and hands on experience with MQ, Kafka
Experience with Databases (Oracle, MySQL)
Any of the relevant professional certifications – Certified Site Reliability Engineer (CSRE), Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer Professional, Google Cloud Professional; DevOps Enginee