Jade Global Logo

Jade Global

Senior Site Reliability Engineer (SRE) – Datadog Observability

Posted 3 Days Ago
Be an Early Applicant
In-Office
5 Locations
Senior level
In-Office
5 Locations
Senior level
Lead SRE implementation initiatives with a focus on Datadog, ensuring system reliability and performance. Collaborate with teams to improve observability and implement automation for incident management.
The summary above was generated by AI
Senior Site Reliability Engineer (SRE) – Datadog Observability1

Job Title: Senior Site Reliability Engineer (SRE) – Datadog Observability
Experience Required: 8+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in Datadog
Location: Hyderabad preferable but open for Pune and remote
Job Summary:
We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE implementation initiatives with a strong focus on Datadog Observability. The ideal candidate will bring deep technical expertise in building reliable, scalable, and observable systems, with hands-on experience in integrating enterprise applications and middleware
Key Responsibilities:
 

  • Drive end-to-end SRE implementation, ensuring system reliability, scalability, and performance.

  • Design, configure, and manage Datadog dashboards, monitors, alerts, and APM for proactive issue detection and resolution.

  • Utilize the Datadog Roles API to create and manage user roles, global permissions, and access controls for various teams.

  • Collaborate with product managers, engineering teams, and business stakeholders to identify observability gaps and design solutions using Datadog.

  • Implement automation for alerting, incident response, and ticket creation to improve operational efficiency.

  • Work closely with business and IT teams to support critical Financial Month-End, Quarter-End, and Year-End closures.

  • Leverage Datadog AI

  • Provide technical leadership in observability, reliability, and performance engineering practices

Required Skills and Experience:
 

  • 8+ years of experience in Site Reliability Engineering, Observability

  • Minimum 3+ years of hands-on experience with Datadog (dashboards, APM, alerting, log management, Roles API, and monitoring setup).

  • Proven experience implementing SRE best practices—incident management, postmortems, automation, and reliability metrics

  • Excellent stakeholder management and communication skills; experience collaborating with business and IT teams.

  • Strong problem-solving mindset and ability to work in high-pressure production support environments.

Preferred Qualifications:
 

  • Certification in Datadog or related observability platforms.

  • Knowledge of CI/CD tools and automation frameworks.

  • Experience in cloud platforms (AWS, Azure, or OCI).

  • Exposure to ITIL-based production support processes.

Top Skills

AWS
Azure
Datadog
Oci

Jade Global Pune, Mahārāshtra, IND Office

Vadgaon Sheri Road, Nyati Tech Park, 7th Floor, , , Pune, Maharashtra , India, 411014

Jade Global Pune, Mahārāshtra, IND Office

7th Floor M Agile, Baner Road Baner, , , Pune, Maharashtra , India, 411045

Similar Jobs

18 Days Ago
In-Office
5 Locations
Senior level
Senior level
Information Technology • Consulting
The role involves leading SRE initiatives focusing on Datadog observability, ensuring system reliability and scalability, and implementing best practices for incident management and automation.
Top Skills: Automation FrameworksAWSAzureCi/Cd ToolsDatadogOci
11 Hours Ago
Remote or Hybrid
18 Locations
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Engineering Manager will lead the Linux sensor development team, manage engineers, drive technical strategy, and ensure high code quality for cybersecurity features.
Top Skills: CC++EbpfKubernetesLinuxUnix
3 Days Ago
Remote or Hybrid
16 Locations
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Sr. Software Engineer will create file format parsers, collaborate on machine learning features, and maintain software systems. Responsibilities include testing, optimization, and documentation.
Top Skills: AWSAzureBitbucketC++GCPGitJenkinsJIRAPythonRust

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account