Mastercard Logo

Mastercard

Lead Site Reliability Engineer

Posted 2 Days Ago
Be an Early Applicant
Hybrid
Pune, Maharashtra
Senior level
Hybrid
Pune, Maharashtra
Senior level
The role involves leading operations for enterprise storage platforms, focusing on Software Defined Storage. Responsibilities include collaborating on storage solutions, improving infrastructure availability, mentoring staff, and ensuring compliance.
The summary above was generated by AI

Our Purpose

Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.

Title and Summary

Lead Site Reliability Engineer

==
Lead Site Reliability Engineer (Storage) – Pune, India
Our Purpose
We work to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions help individuals, financial institutions, governments and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team – one that makes better decisions, drives innovation and delivers better business results.

Role Summary
We’re seeking a Lead Site Reliability Engineer to advance our SRE capabilities across enterprise storage platforms, with a focus on Software Defined Storage (Ceph). This role involves managing and leading software defined CEPH storage (Object, Block, File) efforts, building automation and monitoring solutions, improving infrastructure availability, collaborating across global teams

Key Responsibilities
• Lead day-to-day operations of Mastercard’s enterprise storage platforms.
• Represent the storage team in project meetings, offering technical support and guidance.
• Collaborate with internal teams to understand monitoring and automation needs.
• Design and implement storage solutions using tools like Ansible, Bitbucket, CHEF, Jenkins.
• Administer and maintain enterprise monitoring tools in a multi-tier storage environment.
• Troubleshoot issues across networking, Linux/Unix systems, and applications.
• Maintain documentation for all solutions and processes.
• Lead and mentor team of engineers and drive cross-training efforts.
• Participate in disaster recovery planning and yearly audits
• Continuously learn and integrate emerging technologies.
• Lead vulnerability management, patching and compliance efforts

About You
• Proven experience resolving complex availability issues through automation and monitoring.
• Self-starter with minimal need for supervision.
• Comfortable working with geographically distributed teams.
• Strong expertise in UNIX/Red Hat Linux, Ceph storage, and networking/security.
• Proficient in scripting languages like Python and Bash.
• Hands-on experience with Grafana, Prometheus, HAProxy, and Pacemaker.
• Ability to identify and automate repetitive tasks.
• Strong analytical and problem-solving skills.
• Excellent communication, documentation, and time management abilities.
• Experience leading workstreams and mentoring technical talent.
• Familiarity with ITSM processes, incident/change management, and vendor coordination.
• Willingness to provide 3rd-line out-of-hours operational support.

Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
• Abide by Mastercard’s security policies and practices.
• Ensure the confidentiality and integrity of the information being accessed.
• Report any suspected information security violation or breach, and
• Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.

Corporate Security Responsibility


All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:

  • Abide by Mastercard’s security policies and practices;

  • Ensure the confidentiality and integrity of the information being accessed;

  • Report any suspected information security violation or breach, and

  • Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.




Top Skills

Ansible
Bash
Bitbucket
Ceph
Chef
Grafana
Haproxy
Jenkins
Linux/Unix
Pacemaker
Prometheus
Python

Mastercard Pune, Mahārāshtra, IND Office

Poona Club Road, Pune, Maharashtra, India, 411001

Similar Jobs

20 Days Ago
Hybrid
Mumbai, Maharashtra, IND
Expert/Leader
Expert/Leader
Financial Services
The Lead Software Engineer focuses on site reliability, collaborates with teams to implement SRE practices, and mentors engineers to enhance system resilience and observability.
Top Skills: C++DatadogDynatraceGrafanaJavaOpen TelemetryPrometheusPythonSplunk
23 Days Ago
In-Office
Pune, Maharashtra, IND
Senior level
Senior level
Cloud • Security • Software • Cybersecurity
The Senior Site Reliability Engineer will guide teams in reliability engineering, promote observability, mentor engineers, and enhance the performance of Veeam's cloud infrastructure.
Top Skills: C#GoGrafanaJavaJavaScriptKubernetesNode.jsOpentelemetryPrometheusPulumiTerraformTypescript
13 Days Ago
In-Office
Pune, Maharashtra, IND
Mid level
Mid level
AdTech • Digital Media • Healthtech • Marketing Tech • Analytics
The Site Reliability Engineer will manage production systems focusing on reliability and performance, automate processes, and maintain cloud services.
Top Skills: AirflowAnsibleAWSBashCi/CdDockerGitGrafanaHelmKubernetesMySQLPrometheusPythonRedisTerraform

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account