Job Overview We are seeking an experienced AWS L5 Engineer to lead our Managed Services division as a Technical Head. You will be instrumental in overseeing the architecture, security, availability, and operational efficiency of cloud infrastructure for enterprise clients. This role demands an expert in AWS cloud services with proven experience in large-scale mission-critical environments. You will drive automation, ensure high availability, and deliver continuous improvements to support our clients business needs. Key Responsibilities Operational Leadership: Lead the daily management of AWS infrastructure, focusing on operational excellence and cost optimization. Manage 24/7 cloud operations, ensuring performance, uptime, and SLA adherence. Conduct incident management, troubleshooting, and root cause analysis (RCA) to resolve escalations efficiently. Implement disaster recovery (DR) strategies and perform regular failover testing to ensure business continuity. Proactive Monitoring & Automation Develop and implement automated monitoring solutions using AWS CloudWatch, ELK Stack, and other monitoring tools to address alerts proactively. Create and maintain automation scripts for backups, patching, and routine tasks using AWS Systems Manager, Lambda, or custom scripts. Regularly audit AWS environments to enforce security policies, best practices, and regulatory compliance. Cost Optimization Continuously monitor AWS usage and recommend optimizations to improve cost efficiency without sacrificing performance. Manage Reserved Instances, Spot Instances, and S3 lifecycle policies for cost savings. Collaborate with finance teams to forecast cloud spend and ensure adherence to budgetary constraints. Security & Compliance Management Oversee security configurations, including VPC setups, IAM roles, and encryption standards, to ensure adherence to best practices. Perform regular security audits and vulnerability assessments, ensuring compliance with frameworks such as GDPR, SOC 2, and ISO 27001. Lead initiatives in patch management, security remediations, and adherence to governance protocols. Infrastructure Automation & IaC Lead the implementation of Infrastructure-as-Code (IaC) using AWS CloudFormation or Terraform to automate provisioning and scaling. Develop automation frameworks for monitoring, backup, scaling, and recovery, leveraging AWS services like Lambda, CloudWatch, and CloudFormation. Client Relationship & Service Management Act as the primary technical lead for client engagements, managing escalations and providing timely resolutions. Collaborate with account management teams to deliver monthly and quarterly reports on infrastructure performance, cost, and security metrics. Engage with clients to identify opportunities for service improvement and ensure optimal service delivery. Continuous Improvement & Innovation Drive innovation by staying up-to-date on AWS technologies and industry trends, making recommendations to enhance client environments. Lead service improvement plans (SIPs) based on key performance indicators (KPIs) and SLA reviews. Spearhead initiatives to improve automation, incident response times, and infrastructure reliability. Qualifications Experience: 15+ years of hands-on experience in managing cloud environments with 5+ years in AWS-focused roles. Proven experience in managing large-scale, high-availability, mission-critical workloads on AWS. Deep understanding of ITIL and other service management methodologies for operational excellence. Technical Expertise Expertise in core AWS services such as EC2, S3, RDS, Lambda, CloudWatch, and VPC networking. Strong knowledge of monitoring tools such as AWS CloudWatch, ELK Stack, Prometheus, or Datadog. Advanced scripting skills in Python, Bash, or similar for automating tasks and infrastructure management. Deep experience with IAM, security governance, and cost management tools. Thorough understanding of network security concepts, including VPC management, security groups, and firewall configurations. Soft Skills Excellent problem-solving abilities with a focus on root cause analysis and proactive issue management. Strong communication skills for effectively collaborating with technical and non-technical teams. Proven ability to lead in high-pressure environments, balancing multiple priorities and delivering under tight deadlines. Preferred Qualifications AWS Certifications such as AWS Certified SysOps Administrator or AWS Certified Solutions Architect. Hands-on experience with multi-cloud environments, including Azure or GCP. Familiarity with automation tools like AWS Systems Manager, AWS Lambda, and third-party cloud management tools. Working knowledge of compliance frameworks such as GDPR, SOC 2, ISO 27001, and ITIL,
Employement Category:
Employement Type: Full time Industry: IT Services & Consulting Role Category: Not Specified Functional Area: Not Specified Role/Responsibilies: AWS L5 Engineer - Managed Services Job in