Iron Mountain Logo

Iron Mountain

Lead Site Reliability Engineer

Reposted 3 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
Senior level
In-Office or Remote
2 Locations
Senior level
Responsible for implementing and enhancing enterprise observability and automation platforms, ensuring optimal network performance and compliance with governance standards.
The summary above was generated by AI

At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.

We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways. 

Are you curious about being part of our growth stor​y while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.

Job Summary

Iron Mountain is seeking a proactive and skilled Observability Automation & Integration Lead Engineer to join our Infrastructure Transformation team.

In this role, you will be responsible for implementing, managing, and enhancing enterprise observability and automation platforms to ensure optimal network and application performance across a global ecosystem.

The Infrastructure Transformation team is a dynamic group dedicated to modernizing our technical infrastructure, driving efficiency through automation, and ensuring the continuous availability of critical systems.

What You'll Do (Responsibilities)

In this role, you will:

  • Responsibility 1: Drive Enterprise Platform Engineering - Design, implement, and maintain highly available, 24x7 continuous monitoring solutions using platforms like Datadog and SolarWinds, including configuring alerts, creating dashboards, and conducting data trend analysis.

  • Responsibility 2: Champion Automation & Integration - Collaborate with Enterprise Architects and operations teams to automate infrastructure operations, integrate monitoring data with platforms like Configuration Management Database (CMDB)/ServiceNow, and identify opportunities for proactive monitoring solutions.

  • Responsibility 3: Ensure Design and Operational Adherence - Ensure compliance with architectural governance and security standards in all designs, drive process improvements, and provide on-call support for critical issues outside of normal business hours.

What You'll Bring (Skills & Qualifications)

The ideal candidate will have:

  • 10+ years of experience in monitoring platform engineering with tools such as Datadog, SolarWinds, Prometheus, or Grafana.

  • Strong knowledge of network and application performance monitoring, including configuring monitors using protocols like Simple Network Management Protocol (SNMP), Secure Shell (SSH), Windows Remote Management (WinRM), Windows Management Instrumentation (WMI), or Java Management Extensions (JMX).

  • Proven ability in automating infrastructure operations using tools like Ansible and Python and integrating systems via Representational State Transfer (REST) Application Programming Interface (API)/scripting.

  • Bachelor's degree in Computer Science, Information Technology, or a related field.

What We Offer (Benefits)

This section lists benefits specific to the role and region. Since this information was not included in the original job description, I will include the standard Iron Mountain offerings from the template as a placeholder, which can be modified based on the role and location requirements:

  • Competitive compensation and benefits aligned with the experience.

  • Flexible work options/alternative work options to support work-life balance.

  • Comprehensive health, wellness, and retirement plans.

  • Opportunities for continuous learning and professional growth.

Call to Action

If you are passionate about building scalable, high-performance systems and enhancing enterprise observability, apply today to join the Iron Mountain Infrastructure Transformation team!

Category: Information Technology

Top Skills

Ansible
Datadog
Grafana
Jmx
Prometheus
Python
Rest Api
Snmp
Solarwinds
Ssh
Winrm
Wmi

Similar Jobs

52 Minutes Ago
Remote or Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Cloud • Software
As a Lead Site Reliability Engineer, you'll ensure cloud and big data platform reliability, collaborating with teams to design solutions, optimize infrastructure, and mentor others.
Top Skills: AirflowAWSAws BedrockAws SagemakerCloudwatchElk StackEmrGoGobblinGrafanaHdfsHiveHudiIcebergKafkaKubernetesMapreduceOpentelemetryOrcPrometheusPysparkPythonSparkTerraformThanosYarn
18 Days Ago
Easy Apply
Remote
India
Easy Apply
Mid level
Mid level
eCommerce
The Site Reliability Engineer ensures scalability and reliability of infrastructure and applications through automation and incident response while collaborating with DevOps teams.
Top Skills: AnsibleBashCi/CdDockerGCPGoogle Cloud Operations SuiteGoogle Cloud PlatformGrafanaJenkinsKubernetesPrometheusPythonTerraform
13 Days Ago
Remote
Shri Bhrigukshetra, BLR, Uttar Pradesh, IND
Senior level
Senior level
Fintech • Analytics
As a Lead Site Reliability Engineer, you will oversee system availability, automation, incident response, and support cloud migration and observability practices.
Top Skills: AWSAzureBigpandaC#DatadogEntra IdGCPGoJavaPythonTerraformUnix/LinuxWindows

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account