Job brief.
Role: Lead Site Reliability Engineer
Location: Pune and Noida
Culture: Hybrid
Experience – 6 -9 Years
Requirements.
Looking for candidates having exposure for monitoring tools, python, linux, terraform, Jenkins, Kubernetes.
- Proficient in Splunk/ELK, and Datadog.
- Experience with observability tools such as Prometheus/InfluxDB, and Grafana.
- Possesses strong knowledge of at least one scripting language such as Python, Bash, Powershell or any other relevant languages.
- Design, develop, and maintain observability tools and infrastructure.
- Collaborate with other teams to ensure observability best practices are followed.
- Develop and maintain dashboards and alerts for monitoring system health.
- Troubleshoot and resolve issues related to observability tools and infrastructure.
Bachelor’s Degree in information systems or Computer Science or related discipline with relevant experience.
- Proficient in Splunk/ELK, and Datadog.
- Experience with Enterprise Software Implementations for Large Scale Organizations
- Exhibit extensive experience about the new technology trends prevalent in the market like SaaS, Cloud, Hosting Services and Application Management Service
- Monitoring tools like : Grafana, Prometheus, Datadog,
- Experience in deployment of application & infrastructure clusters within a Public Cloud environment utilizing a Cloud Management Platform
- Professional and positive with outstanding customer-facing practices
- “Can-do” attitude, willing to go the extra mile
- Consistently follows-up and follows-through on delegated tasks and actions