We are seeking a highly motivated and experienced Team Leader to manage and mentor our talented Admins/DevOps team. In this role, you will be responsible for:
- overseeing the day-to-day operations of the team,
- ensuring the smooth and efficient running of our IT infrastructure,
- driving the adoption of DevOps/FinOps best practices,as well as improving our security posture.
The ideal candidate possesses a solid technical background with specific expertise in GCP, Terraform, and Grafana, excellent leadership skills, and a passion for fostering a collaborative and high-performing team environment.
Responsibilities:
- Team Leadership & Management:
- Lead, mentor, and coach a team of 10 system administrators and DevOps engineers.
- Foster a positive and collaborative team environment that encourages knowledge-sharing and innovation.
- Own and actively develop technical improvements roadmap.
- Delegate tasks effectively, ensuring team members have the resources and support they need to succeed.
- Infrastructure Management:
- Oversee the maintenance and administration of our IT infrastructure, including servers, networks, databases, and GCP services.
- Ensure high availability, performance, and security of all systems.
- Implement monitoring and alerting systems to proactively identify and resolve issues.
- Develop and maintain comprehensive documentation for all infrastructure components.
- DevOps Implementation:
- Manage the company’s infrastructure using Terraform and Ansible.
- Champion the adoption of DevOps principles and practices across the organisation.
- Design and implement organisation-wide CI/CD pipelines to automate software application build, test, and deployment.
- Collaborate with development teams to optimise application performance and scalability.
- Monitoring & Observability:
- Implement and maintain monitoring tools and dashboards (using Grafana) to track system health, performance, and availability.
- Define and track key performance indicators (KPIs) for infrastructure and applications.
- Proactively identify and resolve performance bottlenecks and system issues.
- Participate in incident response and post-mortem analysis to prevent future occurrences.
Qualifications:
- Bachelors degree in Computer Science, Information Technology, or a related field.
- 7+ years of proven experience, with at least 2+ years in a similar role, leading and managing a team of IT professionals.
- Strong understanding of Linux server administration.
- Extensive experience with Google Cloud Platform (GCP) services and infrastructure.
- Proficiency in Terraform for infrastructure provisioning and management.
- Experience with Grafana for building dashboards and visualizing system metrics.
- Proficiency in scripting languages (e.g., Bash, Python) and automation tools (e.g., Ansible, Puppet, Chef).
- Excellent communication, interpersonal, and leadership skills.
- Strong problem-solving and analytical abilities.
Bonus Points:
- Deep understanding of SRE principles and practices.
- Knowledge of CI/CD tools (e.g., Jenkins, GitLab CI, CircleCI) and containerization technologies (e.g., Docker, Kubernetes).
- Experience with database administration (e.g., MySQL, PostgreSQL).
- Knowledge of security best practices and compliance frameworks (e.g., ISO 27001, SOC 2).
- Certifications in relevant technologies (e.g., Google Cloud Certified Professional Cloud Architect, HashiCorp Certified: Terraform Associate).