Jade Global Logo

Jade Global

Senior Site Reliability Engineer (SRE) – Datadog Observability

Reposted Yesterday
Be an Early Applicant
In-Office
5 Locations
Senior level
In-Office
5 Locations
Senior level
The role involves leading SRE initiatives focusing on Datadog observability, ensuring system reliability and scalability, and implementing best practices for incident management and automation.
The summary above was generated by AI
Senior Site Reliability Engineer (SRE) – Datadog Observability1

Job Title: Senior Site Reliability Engineer (SRE) – Datadog Observability
Experience Required: 8+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in Datadog
Location: Hyderabad preferable but open for Pune and remote
Job Summary:
We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE implementation initiatives with a strong focus on Datadog Observability. The ideal candidate will bring deep technical expertise in building reliable, scalable, and observable systems, with hands-on experience in integrating enterprise applications and middleware
Key Responsibilities:

 

  • Drive end-to-end SRE implementation, ensuring system reliability, scalability, and performance.
  • Design, configure, and manage Datadog dashboards, monitors, alerts, and APM for proactive issue detection and resolution.
  • Utilize the Datadog Roles API to create and manage user roles, global permissions, and access controls for various teams.
  • Collaborate with product managers, engineering teams, and business stakeholders to identify observability gaps and design solutions using Datadog.
  • Implement automation for alerting, incident response, and ticket creation to improve operational efficiency.
  • Work closely with business and IT teams to support critical Financial Month-End, Quarter-End, and Year-End closures.
  • Leverage Datadog AI
  • Provide technical leadership in observability, reliability, and performance engineering practices

Required Skills and Experience:
 

  • 8+ years of experience in Site Reliability Engineering, Observability
  • Minimum 3+ years of hands-on experience with Datadog (dashboards, APM, alerting, log management, Roles API, and monitoring setup).
  • Proven experience implementing SRE best practices—incident management, postmortems, automation, and reliability metrics
  • Excellent stakeholder management and communication skills; experience collaborating with business and IT teams.
  • Strong problem-solving mindset and ability to work in high-pressure production support environments.

Preferred Qualifications:
 

  • Certification in Datadog or related observability platforms.
  • Knowledge of CI/CD tools and automation frameworks.
  • Experience in cloud platforms (AWS, Azure, or OCI).
  • Exposure to ITIL-based production support processes.

Top Skills

Automation Frameworks
AWS
Azure
Ci/Cd Tools
Datadog
Oci

Jade Global Pune, Mahārāshtra, IND Office

Vadgaon Sheri Road, Nyati Tech Park, 7th Floor, , , Pune, Maharashtra , India, 411014

Jade Global Pune, Mahārāshtra, IND Office

7th Floor M Agile, Baner Road Baner, , , Pune, Maharashtra , India, 411045

Similar Jobs

Yesterday
Remote or Hybrid
19 Locations
Expert/Leader
Expert/Leader
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Lead and mentor a team of threat researchers focusing on malware analysis. Oversee technical guidance, team growth, and hands-on contributions during critical projects. Requires advanced knowledge in reverse engineering and threat research automation.
Top Skills: Automation WorkflowsGenerative AiLarge Language ModelsThreat Intelligence PlatformsYara Rule Generation
7 Days Ago
Hybrid
Jaipur, Rajasthan, IND
Senior level
Senior level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Unit Manager will lead training delivery and governance in GOSC, manage certification processes, create training materials, and mentor training teams.
Top Skills: MS OfficePower BI
9 Days Ago
Remote or Hybrid
18 Locations
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Engineering Manager will lead the Linux sensor development team, manage engineers, drive technical strategy, and ensure high code quality for cybersecurity features.
Top Skills: CC++EbpfKubernetesLinuxUnix

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account