Develop software to make infrastructure services self-managing and self-service
Deliver continuous service improvement by developing Infrastructure as Code
Eliminate manual, repetitive, automatable, tactical tasks that are devoid from value
Improve system performance, make effective use of resources, distribute load and reduce latency
Identify SLOs (Service Level Objectives) to meet availability and latency objectives
Develop pro-active monitoring solutions that alert on symptoms and not just on outages
Perform detailed root cause analysis (RCAs) on incidents and outages to prevent future
Partner with development teams to improve services via rigorous testing and release procedures
Identify technical debt and partner with application teams to build remediation plans
Develop standard operational procedures and produce effective documentation
Analyse workloads and devise suitable cloud migration strategies where appropriate
Ensure all project / investment workloads are delivered according to plans and budget defined
Liaise with Infrastructure Control and IT Risk teams to satisfy internal and external audit requests
Deputise for team lead when required to do so and act-up accordingly
Identify cost saving and optimisation opportunities across the group
Build strong working relationships across the organisation
Adhere to the core values of the bank
Perform daily health and compliance checks for all systems as required
Ensure all systems are backed up successfully and any issues are promptly resolved
Validate monitoring alerts and batch job failures are detected promptly and satisfactorily resolved
Ensure sufficient capacity is available to accommodate drive growth
Respond to emails sent to the team distribution list / mailboxes in a timely manner
Handle incidents and requests with efficiency and a "customer first" mindset
Maintain infrastructure in a highly available, reliable, secure and performant manner
General Server / Database / Virtualisation Administration maintenance activities
Provide technical support to application support and development teams
Provide consultancy to application support and development teams
Take part in On-Call weekend work rotation; triaging and addressing production issues as they arise