Iron Mountain Logo

Iron Mountain

Lead Site Reliability Engineer

Reposted 5 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
Senior level
In-Office or Remote
2 Locations
Senior level
Responsible for implementing and enhancing enterprise observability and automation platforms, ensuring optimal network performance and compliance with governance standards.
The summary above was generated by AI

At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.

We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways. 

Are you curious about being part of our growth stor​y while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.

Job Summary

Iron Mountain is seeking a proactive and skilled Observability Automation & Integration Lead Engineer to join our Infrastructure Transformation team.

In this role, you will be responsible for implementing, managing, and enhancing enterprise observability and automation platforms to ensure optimal network and application performance across a global ecosystem.

The Infrastructure Transformation team is a dynamic group dedicated to modernizing our technical infrastructure, driving efficiency through automation, and ensuring the continuous availability of critical systems.

What You'll Do (Responsibilities)

In this role, you will:

  • Responsibility 1: Drive Enterprise Platform Engineering - Design, implement, and maintain highly available, 24x7 continuous monitoring solutions using platforms like Datadog and SolarWinds, including configuring alerts, creating dashboards, and conducting data trend analysis.

  • Responsibility 2: Champion Automation & Integration - Collaborate with Enterprise Architects and operations teams to automate infrastructure operations, integrate monitoring data with platforms like Configuration Management Database (CMDB)/ServiceNow, and identify opportunities for proactive monitoring solutions.

  • Responsibility 3: Ensure Design and Operational Adherence - Ensure compliance with architectural governance and security standards in all designs, drive process improvements, and provide on-call support for critical issues outside of normal business hours.

What You'll Bring (Skills & Qualifications)

The ideal candidate will have:

  • 10+ years of experience in monitoring platform engineering with tools such as Datadog, SolarWinds, Prometheus, or Grafana.

  • Strong knowledge of network and application performance monitoring, including configuring monitors using protocols like Simple Network Management Protocol (SNMP), Secure Shell (SSH), Windows Remote Management (WinRM), Windows Management Instrumentation (WMI), or Java Management Extensions (JMX).

  • Proven ability in automating infrastructure operations using tools like Ansible and Python and integrating systems via Representational State Transfer (REST) Application Programming Interface (API)/scripting.

  • Bachelor's degree in Computer Science, Information Technology, or a related field.

What We Offer (Benefits)

This section lists benefits specific to the role and region. Since this information was not included in the original job description, I will include the standard Iron Mountain offerings from the template as a placeholder, which can be modified based on the role and location requirements:

  • Competitive compensation and benefits aligned with the experience.

  • Flexible work options/alternative work options to support work-life balance.

  • Comprehensive health, wellness, and retirement plans.

  • Opportunities for continuous learning and professional growth.

Call to Action

If you are passionate about building scalable, high-performance systems and enhancing enterprise observability, apply today to join the Iron Mountain Infrastructure Transformation team!

Category: Information Technology

Top Skills

Ansible
Datadog
Grafana
Jmx
Prometheus
Python
Rest Api
Snmp
Solarwinds
Ssh
Winrm
Wmi

Similar Jobs

8 Days Ago
Remote or Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Cloud • Software
Lead SRE responsible for designing, building, and optimizing cloud and big-data infrastructure to ensure availability, scalability, and security of ML/AI systems. Provide technical leadership, mentor teams, troubleshoot production issues, drive automation, and define the platform roadmap while collaborating with cross-functional stakeholders.
Top Skills: AirflowAlertmanagerAWSCloudwatchEksElkEmrGoGobblinGrafanaHadoopHdfsHiveKubernetesLinuxOpentelemetryPrometheusPythonSagemakerSparkTerraformThanos
13 Days Ago
In-Office or Remote
2 Locations
Senior level
Senior level
Fintech • Financial Services
Lead SRE/Platform Engineer to design, build, and operate enterprise monitoring, observability, and automation platforms. Maintain Grafana/Prometheus/Splunk ecosystems, deploy containerized workloads on OpenShift, develop automation scripts, integrate with ServiceNow, implement SLI/SLOs, support CI/CD, and ensure security and governance across platforms.
Top Skills: Grafana,Prometheus,Splunk,Dynatrace,Opsramp,Solarwinds,Openshift,Docker,Python,Shell,Powershell,Servicenow,Ci/Cd,Git,Linux,Melt,Terraform,Ansible,Power Automate,Jenkins,Api Integration,Cloud
4 Days Ago
Remote
India
Senior level
Senior level
Information Technology • Consulting
The Senior SRE Engineer will ensure the reliability of microservices, manage incidents, and collaborate on automation, security, and infrastructure projects while participating in a 24/7 on-call rotation.
Top Skills: AWSAzureCi/CdContainersGCPJavaOrchestrationPython

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account