Go to search
Senior Site Reliability Engineer – SRE, Python
Site Reliability Engineering, DevOps, Jenkins, Docker, Kubernetes, Datadog, New Relic, Splunk, Grafana
Bangalore
We are seeking a talented and motivated Senior Site Reliability Engineer (SRE) to join our organization.
The SRE will play a crucial role in ensuring the Reliability, Scalability, Capacity Planning and performance of our infrastructure and applications.
Responsibilities
- Play a key role in monitoring and observability to ensure optimal performance
- Set up and maintain SLA, SLO, and SLI for various services
- Manage and enhance microservices architectures
- Handle alert management and incident response effectively
- Develop and maintain code including logic writing
- Ensure smooth network communication within microservices environments
- Author and manage database queries
- Create and update yaml configuration files
- Design and implement robust CI/CD pipelines
Requirements
- 5 to 8 years of hands-on experience in site reliability engineering
- Proficiency in Python programming and scripting
- Background in microservices and familiarity with their network communication
- Expertise in alert management and incident management systems
- Skills in coding and writing logical solutions
- Capability to write and optimize DB queries
- Qualifications in designing and maintaining CI/CD pipelines
- Understanding of monitoring, observability and setting up SLAs, SLOs and SLIs