At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.
We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways.
Are you curious about being part of our growth story while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.
Qualifications:
- Minimum 6 years Global Operations and Support Engineer experience
- Support experience in Records Management Applications and Data Management Applications that deliver records, boxes, files and data management capabilities such as storing, archiving, shredding, asset transfer, permanent withdrawal, destruction, etc.
- Build and/or Manage the Operational process and procedures for Cloud/Web/4GL/DataCenter Application Systems that encompass key business functions in the inventory transfer, billing, reporting, pricing updates, warehouse move, location and data analysis.
- Experience in Support Operations such as triaging, optimization, performance improvement, status communication and workflow redesign.
- Responsible for leading the technical strategy for our underpinning infrastructure, alerting & monitoring and incident resolution to optimize the MTTR targets.
- Accountable for the performance and results of multiple applications
- Works on issues where analysis of situations or data requires an in-depth knowledge of organizational objectives and processes.
- Experience in managing & supporting monthly/quarterly/annual Billing Cycles that are critical for the company’s financial health.
- Supporting Cloud native Applications in Google Cloud Platform (GCP) with prior experience in:
- Building Automation Services and Instrumentation for the Observability Program
- Implementing Application Reliability strategy for Iron Mountain Warehouse Applications
- Experience in defining the log based metrics, monitors, thresholds for defining Error Budget, Service Level Objectives (SLO), SLI and creating event dashboards for cloud native Iron Mountain.
- Teaming Collaboration with ‘no blame’ virtue is a key differentiator.
Required Experience and Skills:
- Records Center Applications and Data Management Applications that deliver records and data management capabilities such as storing, archiving, shredding, asset transfer, permanent withdrawal, etc.
- Application Sustainment Management & Global Service Delivery.
- Managing the Observability workstream sitting with Engineering/Development and SRE teams to improve availability, performance and reliability of the applications.
- Managing escalations from customers, Customer Care, Global Account Management and handling triage with technical teams
- Qualifying new work orders from customers that request Account Consolidation and Single Sign-On capabilities to improve the revenue from the sustainment services.
- SRE Management (Site Reliability Engineering)
- As Application SRE Engr, focus on the exception handler streamlining to build and support the log based metrics definition.
- Understanding of the Datadog Log mgmt and Alert mgmt features to define Log based metrics, alerts and dashboards.
- Support of Applications built with Google Cloud logging, Identity & Access mgmt, Cloud network and Project.
- Understanding of the Gitlab code repository mgmt, Roles, Projects, Groups, merge request, container registry management, reporting DevOps metrics and analytics
- Managing the Data Center Applications built on Linux/Windows, Apache/Tomcat & Java
- Bachelors of Science in Computer Science & Engineering (4 years degree) and 6+ years working experience.
Preferred skills:
- Experience in DataDog, ServiceNow and Open source options Open telemetry, Grafana Mimir, Tempo, Loki, Grafana Dashboards, Prometheus and Jaeger.
- Gitlab Agile - epics, features, stories, product management support, program
- management, boards, reporting metrics, KPI.
- Experience as SRE Engineer in defining the Application level log based business
- metrics such as successful Orders, Failed Shipping notices, etc.
- Experience in Google Cloud Functions, Workflow, Google Kubernetes Engine (GKE).
- Agile SAFe certification/ ITIL Framework experience