Position Name: Junior AWS SRE Total Experience : 3 to 5 years Job Location: Mumbai Setting up world class observability platform for Multi Cloud Infrastructure services. Reviewing and contributing to setting up observability for infrastructure of new/existing cloud apps. Analyzing, troubleshooting, and designing vital services, platforms, and infrastructure while always thinking about reliability, scalability, resilience, automation, security, and performance Continue improving cloud product reliability, availability, maintainability & cost/benefitincl. developing fault-tolerant tools to ensure general robustness of the cloud infra. Responsible for availability, performance, monitoring, and incident response, among other things, of the platforms and services of cloud Landing zone. Manage capacity across public and private cloud resource poolsincl. automating scale down/up of environments. Ensuring that everything that goes to production complies with a set of general requirements like diagrams, documents, security compliance, dependencies of other services, monitoring and logging plans, backups, and possible high availability setups. Ensuring the efficient functioning of cloud resources and functions in accordance with company security policies and best practices in cloud security Employ exceptional problem-solving skills, with the ability to see and solve issues before they affect business productivity. Support developers in optimizing and automating cloud engineering activities, e.g. real-time migration, provisioning and deployment, etc. Monitoring and action of hardware degradation, networking problems, high usage of resources, or slow responses on cloud Landing zone. Preparing and managing runbook having procedures necessary for getting services up and running again quickly in case of any issues. Enable automation for some of key functions like CI/CD across SDLC phases, monitoring, alert, incident response, infra provisioning, and patching. As Site Reliability Engineers focus on system reliability, they reduce operational expenses, lessen and mitigate failure points, while automate monotonous time and resource-wasting tasks resulting in economic savings both in terms of effort and money. Failure resolution is preemptive, as SRE Engineers, identify failure causes early while mitigating faults more holistically. Developing and maintaining cloud solutions in accordance with best practices. Perform Incident Analysis on a regular basis with the intention of preventing and finding a long term solve for Incidents *Interested candidates can drop their CVs on hidden_email,
Employement Category:
Employement Type: Full time Industry: IT Services & Consulting Role Category: Not Specified Functional Area: Not Specified Role/Responsibilies: Senior Aws Sre - Devops Job In Blazeclan
Contact Details:
Company: Blazeclan Technologies Location(s): Other Maharashtra