Have an account?
  • Personalized content
  • Your products and support
Forgot password?
Register
Need an account?
Create an account

Site Reliability Engineer

Investigo Change Solutions

More jobs from this company

Site Reliability Engineer

Site Reliability Engineer

Permanent

£70,000

Midlands based/Hybrid working

As the Site Reliability Engineer you will be joining the clients Platform Engineering Team to help build, manage, and support some of the clients core infrastructure.

Key areas of responsibilities:

  • Ensuring the platform services meet high standards for availability, reliability, and performance
  • Defining and promoting best practices for observability, incident management, and operational processes
  • Leading incident management efforts
  • Partner with platform engineers and product teams
  • Develop and maintain monitoring, logging, and alerting solutions to provide actionable insights into platform health and performance

Key Skills

  • You will have a deep understanding of concepts such as SLAs, SLOs, and error budget
  • You will have expertise in tools such as Prometheus, Grafana, Loki, or similar
  • You will have experience in leading incident response processes, including root cause analysis and implementing preventative measures
  • You will be proficient in Scripting languages (eg, Python, Bash)
  • You will need to work effectively with cross functional teams
  • You will be a problem solver
About the Company

Job Specification

30 Jan 2025

Job Location

Job type

Full time

Job category

Information Technology, Telecommunications

Monthly salary