Job Description
Your new day-to-day will see you proactively monitoring critical infrastructure and supporting the live production environment. These systems underpin the entire efficiency of the business's operations, downtime costs the company serious cash. You’ll ensure things are fixed quickly and good monitoring is in place to make sure issues are identified and solved before they can cause too much damage. You will be working with tools such as Grafana, Splunk, and New Relic.
You will be monitoring key infrastructure using bespoke tools and responding to alerts from the Network Operations Centre (NOC). You’ll investigate incidents, resolving what you can and escalating when necessary. Communication is key, working with different teams, stakeholders, and the senior team in order to ensure the operations continue smoothly.
Performing daily checks to keep production systems in top shape, coordinating planned maintenance, and managing potential scheduling conflicts will be your r...