Job brief.

Role: Lead Site Reliability Engineer
Location: Pune and Noida
Culture: Hybrid
Experience – 6 -9 Years

Requirements.

Looking for candidates having exposure for monitoring tools, python, linux, terraform, Jenkins, Kubernetes.

  • Proficient in Splunk/ELK, and Datadog.
  • Experience with observability tools such as Prometheus/InfluxDB, and Grafana.
  • Possesses strong knowledge of at least one scripting language such as Python, Bash, Powershell or any other relevant languages.
  • Design, develop, and maintain observability tools and infrastructure.
  • Collaborate with other teams to ensure observability best practices are followed.
  • Develop and maintain dashboards and alerts for monitoring system health.
  • Troubleshoot and resolve issues related to observability tools and infrastructure.

Bachelor’s Degree in information systems or Computer Science or related discipline with relevant experience.

  • Proficient in Splunk/ELK, and Datadog.
  • Experience with Enterprise Software Implementations for Large Scale Organizations
  • Exhibit extensive experience about the new technology trends prevalent in the market like SaaS, Cloud, Hosting Services and Application Management Service
  • Monitoring tools like : Grafana, Prometheus, Datadog,
  • Experience in deployment of application & infrastructure clusters within a Public Cloud environment utilizing a Cloud Management Platform
  • Professional and positive with outstanding customer-facing practices
  • “Can-do” attitude, willing to go the extra mile
  • Consistently follows-up and follows-through on delegated tasks and actions

Apply for this job

Use the form below to submit your job application.

Allowed Type(s): .pdf, .doc, .docx