Samsung Ads is an advanced advertising technology company in rapid growth that focuses on enabling brands to connect with Samsung TV audiences as they are exposed to digital media by using the industry’s most comprehensive data to build the world’s smartest advertising platform. Being part of an international company such as Samsung and doing business around the world means that we get to work on the most challenging projects with stakeholders and teams located around the globe.
The Engineering Platform (EP) team is a team that builds, operates, and offers products that benefit multiple engineering teams within the Samsung Ads project. These are typically foundational services and include runtime environments, scheduling, observability, monitoring, and more.
As an embedded Site Reliability Engineering (SRE) you’ll be part of a software development team and act as a subject matter expert on the challenges of usability, performance, reliability, scalability and observability.
The ideal candidate has deep knowledge and a strong interest in process automation, observability, software-defined infrastructure, and approaches it from the perspective of a software engineer. Challenges of globally distributed services, deciding what and when state should be shared, and simulating failure scenarios should drive you.
You will work with some incredibly talented and passionate developers with a solid technical background to bring products and services to a market with unique technical challenges.
Technologies in use
AWS
Kubernetes
Terraform
EKS, Rancher
HashiCorp Vault, Prometheus, Okta
GitHub Actions, ArgoCD, Argo Rollout
Grafana, Sloth, Loki, Tempo
responsibilities :
Co-architect new services, including failure tolerance and self-healing by-design, as well as establishing clear scaling-out paths
Act as a subject matter expert for the challenges of infrastructure and operation within your team
Translate Product Owner requirements into actionable technical tasks
Advise on tuning observability systems to represent the health of the systems your team is responsible for and glean insights to plan for growth
Contribute to the global SRE practice
Empower your development team with tooling and automation, including CI/CD
Continuously improve internal services for ease of packaging, configuration, and deployment
Participate in shared on-call rotation
Evaluating and benchmarking new solutions, establishing capacity and growth plans
Developing and supporting usable and maintainable tooling for the engineering organization
Administration of services, whether built in-house or from external vendors
Continuous optimization of services on all layers (hardware, software) for high performance
Monitoring of all critical services, sharing pager duty, troubleshooting, and addressing problems as they arise (including any needed changes in code, topology, resources, or configuration)
Backup/DR implementation, plans, documentation, and exercises
Co-own technical relationships with several service providers and vendors
requirements-expected :
Strong expertise administrating and scaling Kubernetes on AWS (CKA, CKAD, CKS are nice to have)
Strong understanding of distributed systems and client-server architectures
Strong Linux system administration and troubleshooting skills, including solid knowledge of how the various components work (kernel, CPU, memory, disk, network)
Experience with Infrastructure as Code tools (Terraform and custom modules)
Experience working in microservices environments
Capacity and willingness to work in an agile multi-team environment
Demonstrated ability to prioritize tasks and promptly resolve problems
Ability to work autonomously, multi-task, and work in a fast-paced environment
You have a track record of making things better and leading solutions that remove technical pain points and facilitate growth
You enjoy working with others who are intelligent and passionate about building practical, reliable, high-performance products
Excellent communication skills in English
Experience in Observability Platforms.
Relevant software engineering experience with at least one language (Go, Ruby, Python, Erlang or Java)
offered :
Friendly working atmosphere
Wide range of trainings (technical / soft-skills / e-learning platform)
Opportunity to work in multiple projects
Multidisciplinary and multicultural team
Working with the latest technologies on the market
Monthly integration budget
Possibility to attend local and foreign conferences
Opportunity to participate in science research (scientific papers, project proposals, patents applications, development of own side-projects)
Office in Warsaw Spire near metro station
Attractive relocation package
Hybrid model of work – 3 days from the office per week
benefits :
sharing the costs of sports activities
private medical care
sharing the costs of foreign language classes
life insurance
corporate products and services at discounted prices
integration events
dental care
no dress code
leisure zone
pre-paid cards
redeployment package
baby layette
employee referral program
charity initiatives
unlimited free access to Copernicus Science Center