We are looking for a Site reliability Engineer with BlueConch Technologies. Please follow the below mentioned process if you are looking out for an opportunity.
If interested, share below details at the earliest along with your CV.
Current CTC:
Expected CTC:
What is your notice period:
Are you ready for Pune as a job location:
SRE
Skill Devops, Cloud (GCP)
Engage in and improve the software development lifecycle from inception and design, through development, deployment, operation and refinement for greater reliability.
Influence and design infrastructure, architecture, standards and methods for the systems.
Support services prior to production via infrastructure design, software platform development, load testing, capacity planning and launch reviews.
Maintain services during deployment and in production by measuring and monitoring key performance and service level indicators including availability, latency, and overall system health.
Automate system scalability and continually work to improve system resiliency, performance and efficiency
Practice sustainable incident response and blameless postmortems
Remediate tasks within corrective action plan via sustainable, preventative, and automated measures whenever possible
Provide technical guidance or support for the development or troubleshooting of systems
Binding and orchestrating the system infrastructure with the application layer to enable High Availability/Clustering load balancing and integration
Responsible for establishing end-to-end monitoring and alerting on all critical aspects to ensure SLOs, SLIs, and SLAs and get proactive notifications of possible issues for all systems
Establish performance baseline, capacity thresholds, correlate events, and define monitoring/alerting criteria
Create Run Books, first level of support for Production Incidents, Conduct Post Incident reviews
Administering software in public cloud
Build Tool Creation and Access (Bitbucket/Jenkins)
Setting up bitbucket repos and laying out the code management and review process
Continuous Integration (Jenkins/Rundeck) for CICD Pipelines
Setting up infrastructure and application monitoring
Provision service accounts and all required roles/permissions through Terraform scripts
Provisioning all required cloud services through Terraform scripts
Confluence page creation and access provision to team members
Jira board creation and access provision to team members for Requirements Management/Defect Tracking.
Setup of Code scan tools and integration of SonarQube and Fortify
Assist in required platform setup in both on-prem and in cloud for DEV/QA/UAT/PROD and access provision to team members
Ensuring a non-disrupted operation after Go Live. This will be in terms of L2 support, app monitoring, log analysis, and coordination with DEV team for L3
Deployment/ Migration of applications to required environments.
Raising Change Request and getting approval from CAB.
Working with Security advisement team on getting necessary approvals
GCP specific
Responsibilities:
GCP Certification preferred
Two plus years of experience in designing and deploying enterprise solutions in Google Cloud
Experience with cloud services such as GKE, VPC, Subnets, Load-balancers, Interconnect, Cloud storage, BigTable, CloudSQL, BigQuery, IAM, Stack driver monitoring
Experience with one or more Infrastructure as Code (IaC) tools such as Terraform (preferred)
Best Regards,
Ne***********a@bl**********h.com
Keyskills: Site Reliability Engineer GCP sre
With over Six years of IT experience, combining extensive capabilities in technology with deep domain expertise, we deliver seamless solutions that bring tangible business value to leading organizations around the world.Simplicity is a virtue. That's how we would like to describe our people and work...