Job Description
Hello Job Seekers,
Greetings from SoftSol India Limited,
We are hiring for the below position
Location: Hyderabad (Preferred) Bangalore (optional)
Program Overview:
Site Reliability Engineering (SRE) is what we get when we treat operations as a software problem. The mission is to ensure that services are available 24 X 7 , ensuring performance metrics are met with low latency and capacity for future growth is met. Operations viewed as code gives unlimited scope for automated provisioning , deployment , patching and faster response to critical issues.
As a member of a small cross functional squad, youll own a particular infrastructure challenge. Design and document systems, including writing and reviewing code, to automate away problems. Undertake measured, methodical, troubleshooting of complicated systems under pressure. Partake in an on-call rotation alongside the engineers who build our production backend.
As a member of a small cross functional squad, youll own a particular infrastructure challenge. Design and document systems, including writing and reviewing code, to automate away problems. Undertake measured, methodical, troubleshooting of complicated systems under pressure. Partake in an on-call rotation alongside the engineers who build our production backend.
Key responsibilities:
- Experienced with agile continuous delivery and DevOps and will champion the culture, processes, and tools required to maintain a frictionless high-quality development environment
- Deep dive into technology and are on the forefront of the latest tools, technologies, and strategies and help evaluate, prototype, and introduce them to our team
- You will assist in building, maintaining and debugging state-of-the-art engineering technical
- Build strategies to utilize automation tools to build/create infrastructure hardware, software and other technical components baked into the orchestration
- Able to communicate technical ideas to non-technical team members
- Participate in defining microservices infrastructure from inception and design, through deployment, operation and continuous refinement
- Support services before they go live through activities such as system design, deployment automation, capacity planning and launch reviews
- Maintain services once they are live by measuring and monitoring availability, latency and overall system health
- Scale systems through mechanisms like automation
- Evolve systems by pushing for changes that improve reliability and velocity
- Provides input to a Risk Management Plan that will anticipate reliability-related, and non-reliability-related risks that could adversely impact plant operation
- Develops engineering solutions to repetitive failures and all other problems that adversely affect plant operations. These problems include capacity, quality, cost or regulatory compliance issues
- Other parameters that define operating condition, reliability and costs of assets
- Provides technical support to production, maintenance management and technical personnel
- Applies value analysis to repair/replace, repair/redesign, and make/buy decisions
Eligibility:
- Minimum 5years experience as Linux/UNIX/Windows systems administration
- Minimum 2-4 years experience in a role supporting cloud-based solutions or as an SRE
- Bachelor's degree in Computer Science or equivalent practical experience
- Experience in one or more of the following: Python, Go, Perl, PowerShell, or shell scripting
- Experience with Unix/Linux operating systems internals and administration
- Extensive experience with GitLab and Jenkins
- Data analysis techniques that can include:
- Reliability modeling and prediction
- Fault Tree Analysis
- Root-cause and Root-Cause Failure Analysis
- Failure Reporting, Analysis and Corrective Action Systems
- Extensive experience with Clouds, Kubernetes and Docker
- Expertise in designing, analyzing and troubleshooting large-scale distributed systems
- Ability to debug and optimize code and automate routine tasks
- Ability to apply systematic approach to solve problems with a sense of ownership and focus
- Strong communication skills with the ability to articulate technical details to different audiences
Please revert back or send your updated CV on the below details.
Ashwin Kumar
As*********r@so****l.com
Technical Recuriter || Talent Acquisition
SOFTSOL INDIA LIMITED
Mobile: +91 6305972***
Job Classification
Industry: Miscellaneous
Functional Area: Engineering - Software,
Role Category: DevOps
Role: DevOps
Employement Type: Contract
Education
Under Graduation: Any Graduate
Post Graduation: MBA/PGDM in HR/Industrial Relations
Contact Details:
Company: Softsol India Ltd.
Address: SEZ IT/ITES, HILL NO.2, PLOT NO.6, RUSHIKONDA,, MADHURAWADA, Visakhapatnam, Andhra Pradesh, Visakhapatnam, Andhra Pradesh, India
Location(s): Hyderabad
Keyskills:
Terraform
Site Reliability Engineering
Devops