Overview
Visa is a world leader in payments and technology, with a mission to connect the world through an innovative, convenient, reliable, and secure payments network. Join a purpose-driven organization and experience Life at Visa.
The Staff Site Reliability Engineer (SRE) is a critical part of our Visa Cloud platform strategy. This role focuses on ensuring Visa’s development platform and tooling enable engineers to innovate by reducing infrastructure management. You will drive observability, instrument automation for recurring issues, and collaborate with software engineering teams to ensure security, availability, and performance of the platform. The role requires hands-on engineering to develop reliability for the Visa Cloud Platform.
This is a hybrid position with days in the office to be confirmed by the Hiring Manager. Visa Cloud SRE operates on a 24/7/365 model and may require shift or on-call coverage, including weekends.
Responsibilities
- Guide the instrumentation of monitoring for the DevTools (IaaS/PaaS/Container as a Service).
- Ensure platform target SLAs are met and implement appropriate SLIs for supporting services.
- Collaborate with developers during service transitions to evaluate reliability and operability, ensuring adequate monitoring, alerting, and observability.
- Partner with peers within Operations & Infrastructure to support ongoing maintenance and enhancements of the platform.
- Set standards for automating routine tasks and workflows in support of the larger DevEx SRE team.
- Support multiple internal stakeholders with a variety of technical challenges and analyze patterns to propose effective solutions.
- Contribute to 24/7/365 operations, including on-call or shift work as required.
Qualifications
Basic Qualifications:
- 5+ years of relevant work experience with a Bachelor’s Degree or 2+ years with an Advanced degree, or 0 years with a PhD, OR 8+ years of relevant work experience.
Preferred Qualifications:
- 5+ years of relevant work experience with a Bachelor’s Degree or 2 years with an Advanced degree or 0 years with a PhD, OR 8+ years of relevant work experience.
- Master’s Degree in IT, CS, or a related field and/or 5+ years of relevant work experience.
- Hands-on experience with Linux and Windows systems and a good understanding of distributed computing environments.
- Intermediate programming/scripting in Python, Java, Go, PowerShell, JavaScript, Terraform, Ansible, Helm, Chef, or CloudFormation.
- 2+ years of experience managing CI/CD tooling (e.g., Jenkins, GitHub, Bitbucket, ArgoCD, Artifactory, Azure DevOps) in a large-scale environment.
- 3+ years of experience managing observability tooling (e.g., Grafana, Prometheus, Splunk, Datadog, New Relic, Dynatrace, Sentry) in a large-scale environment.
- Advanced understanding of YAML, JSON, HTML, XML.
- 2+ years of experience supporting relational and non-relational databases (e.g., MySQL, MongoDB, PostgreSQL), including queries, performance, and scaling.
- Experience managing container infrastructure and enabling a container-first transformation.
Additional Information
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability, or protected veteran status. Visa will consider qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.