Site Reliability Engineering (SRE) Team Lead
Hybrid: Once a week in office
About Akixi
Akixi is a fast-growing and profitable privately-owned company based in West Sussex, UK. Our portfolio of cloud-based real-time call and contact analytics software is delivered through our network of IT and telecoms partners around the world and we have over 7,000 active customer sites.
We are proud to have been recognised within the industry, winning the 'Best Analytics Platform' in the UC Awards 2020 and 'Best Call Management Solution' in the Comms National Awards 2020.
Akixi is part of the Cisco Partner Ecosystem and a member of the Cisco Solution Partner Program.
Job Summary
We are seeking an experienced and highly motivated Site Reliability Engineering (SRE) Team Lead / Technical Lead to join our dynamic team. This is a pivotal role responsible for leading and developing a team of SREs, improving operational excellence, and shaping the future of our infrastructure and reliability engineering practices.
The ideal candidate will have hands-on experience with AWS infrastructure and automation, and will be able to drive strategic initiatives, while fostering a high-performing, collaborative, and resilient engineering culture.
Key Responsibilities
· Lead, mentor, and develop a team of DevOps Engineers.
· Establish clear goals, provide regular feedback, and foster professional growth within the team.
· Set performance standards and deliver regular performance evaluations.
· Define and drive the SRE strategy aligned to the organization's broader technology and business goals.
· Collaborate with engineering, security, and product teams to align priorities and ensure seamless service delivery.
· Develop and advocate for best practices in reliability, performance, and scalability.
· Architect, implement, and manage highly available, scalable, and secure AWS infrastructure.
· Drive the automation of infrastructure and operational tasks using Ansible and similar tools (e.g., Terraform, CloudFormation).
· Foster a cloud-native mindset and promote design patterns such as microservices, containerization (e.g., ECS, EKS, Fargate), and serverless technologies.
· Oversee incident response practices, drive root cause analysis, and champion continual improvement processes.
· Monitor and improve system reliability, availability, and performance.
· Establish and enforce SLAs, SLOs, and error budgets.
· Implement robust monitoring, logging, and alerting strategies (e.g., CloudWatch, Datadog, Prometheus).
· Effectively communicate technical concepts to non-technical stakeholders and senior leadership.
· Represent the SRE function across the organization and in technical leadership forums.
Required Skills & Experience
· Proven experience leading, managing, and developing technical teams.
· Deep expertise in AWS services (EC2, ECS, RDS, S3, Fargate, IAM, VPC, etc.).
· Strong proficiency in automation and configuration management tools (especially Ansible; familiarity with Terraform is useful).
· Solid understanding of DevOps and SRE principles (e.g., CI/CD pipelines, IaC, version control, GitOps).
· Good knowledge of modern monitoring and observability practices.
· Strategic and critical thinker capable of balancing technical, business, and operational needs.
· Excellent planning, organizational, and communication skills.
· Experience in cloud-native application design and architecture.
Desirable Skills
· Knowledge and experience with Microsoft Azure cloud services.
· Kubernetes and container orchestration expertise.
· Experience with security best practices in cloud environments.
· Exposure to compliance and regulatory requirements (e.g., ISO 27001, SOC 2).
What We Offer
· Competitive salary and benefits package.
· A dynamic, supportive work environment focused on innovation and growth.
· Flexibility in working hours and remote working options.
· 25 Days Holidays (increasing by 1 day for every year in service until 30 days) + 1 Day Birthday
· Pension
· EV salary sacrifice car scheme
· Training opportunities
· Private medical insurance included for the spouse as w
- Department
- Engineering
- Remote status
- Hybrid

Crawley
Our Perks & Benefits
-
🏄🏻♀️ 25 days’ leave (increasing by one day for each year of service, to a maximum 30 days)
-
🎂 Day off for your birthday
-
💊 Private healthcare
-
💹 Company pension
-
👓 Financial contribution towards eye test/glasses
-
🎉 Social events
Workplace, Culture & Diversity
Corporate culture is essential to allow the organization to differentiate itself. In terms of image on the one hand, it has strengths both internally and externally to consumers. It is indeed a source of cohesion and motivation of employees and it limits conflicts. With customers, it conveys a positive image and can develop a feeling of closeness to the company or even become a criterion of choice.
Site Reliability Engineering (SRE) Team Lead
Loading application form
Already working at Akixi?
Let’s recruit together and find your next colleague.