Zeta Logo

Zeta

Senior Site Reliability Engineer

Reposted Yesterday
Be an Early Applicant
In-Office
Hyderabad, Telangana
Mid level
In-Office
Hyderabad, Telangana
Mid level
The Senior Site Reliability Engineer will enhance application performance through monitoring solutions, automate tasks, troubleshoot incidents, and develop scalable system management solutions.
The summary above was generated by AI
About us
 
Build the future of banking.
 
Zeta is a next-generation banking technology company providing cloud-native, fully stackable processing and core banking platforms for issuers. With a focus on scalability, compliance, and innovation, Zeta empowers financial institutions to modernize their technology infrastructure and deliver secure, seamless digital banking experiences. Our impact runs at real-world scale. Today, over 25 million cards are live on Zeta-powered platforms across 7 countries, supported by a passionate team of 1,700+ Zetanauts across India, the US, EMEA, and Asia. Backed by SoftBank Vision Fund, Mastercard, and other reputed strategic investors, we reached a valuation of $2 billion in 2025.
 
Our focus is on establishing product lines that focus on key outcomes by addressing real customer pain points, modernizing legacy systems, and strengthening core fundamentals. As a result, our systems and platforms support a wide range of banking and payments capabilities, including:
 
1. Tachyon, our cloud-native banking stack built for population-scale systems
2. Cipher, our unified authentication platform for secure, high-volume banking environments
3. Digital Credit as a Service, enabling banks to launch credit lines on UPI
4. Elena, our intelligent and conversational AI platform for banking
5. Pixel, India’s first digital-native credit card, launched in partnership with HDFC Bank, for whom we also revamped their PayZapp mobile app: Winner of the Celent Model Bank Award for Payments Innovation 2024
6. Sparrow, the leading card experience for non-prime cardholders in the US
…and more across cards, payments, lending, and core banking.
 
We are an engineering-first organization that values ownership, bias for action, and long-term thinking. Together, we solve some of the hardest problems in banking tech. Our culture is built around trust, collaboration, and creating the conditions for you to drive impact proportionate to your potential. Reinforcing our commitment to creating an inclusive and supportive workplace, we have been consistently recognized as a Great Place to Work. If you want to build cutting-edge banking tech that enables banks to serve millions reliably, securely, and at a population scale, Zeta is your playground.
 
If you would like to learn more about how we have grown and evolved over the years, watch our journey here. You can also explore our website and follow us on LinkedIn, Instagram,YouTube, and X.
 
 

Responsibilities

  • Work to understand any arising issues and overall application performance by enacting monitoring solutions.
  • Conduct consistent and thorough analysis of current systems and work to reduce the quantity of existing problems, suggesting new solutions to help upgrade & refine such systems.
  • Provide support across a broad range of areas including monitoring, processes & tools, architecture, and Root Cause Analysis.
  • Develop and maintain monitoring and alerting systems to proactively detect and resolve issues.
  • Automate routine tasks to improve system efficiency and reduce downtime.
  • Troubleshoot and resolve incidents and outages.
  • Develop and implement automation scripts and tools to improve the efficiency and effectiveness of system management tasks.
  • Identifying areas for improvement, and designing solutions that are scalable, reliable, and easy to maintain.
  • Monitoring & acting on Alerts to avoid production outages, Incidents.
  • Upkeeping of Run books for the Alerts

Skills

  • 4-6 years of sysadmin experience in handling large-scale distributed system software deployments in cloud or in an on-premises environment.
  • Strong cloud management foundation.
  • Unix shells, Python & Go programming proficiency.
  • Experience in MySQL or PostgresQL in database.
  • Outstanding teammate who can collaborate and influence in a multifaceted environment.
  • Excellent interpersonal, and written communication skills.
  • Excellent debugging and troubleshooting skills.
  • Ability to define standard operating procedures for supported platform features.
  • Experience working with observability tools and practices(Prometheus, Grafana).
  • Experience in troubleshooting and resolving incidents.
  • Cloud experience in AWS (preferred) including hands-on experience with AWS-CLI.
  • Hands-on experience in the Orchestration and Containerisation like Kubernetes, Containers.
  • Experience with CI /CD (i. e. Jenkins, ArgoCD).
  • Solid Understanding of Networking (firewall, connectivity, routing, iptables, subnet config, etc.).
  • Experience with Linux OS and Shell/Python Scripting.
  • Experience in programming with Python, Go
  • Experience with API Gateway like Kong, Nginx based systems.
  • Experience with security best practices and technologies.
  • BS degree in Computer Science or a related technical field involving coding, or equivalent practical experience

Experience and Qualification

  • 4-6 years of sysadmin experience in handling large-scale distributed system software deployments in cloud or in an on-premises environment.

Zeta is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We encourage applicants from all backgrounds, cultures, and communities to apply and believe that a diverse workforce is key to our success.

Top Skills

Api Gateway
Argocd
AWS
Go
Grafana
Jenkins
Kubernetes
MySQL
Nginx
Postgres
Prometheus
Python
Unix

Similar Jobs

4 Days Ago
Hybrid
Senior level
Senior level
Information Technology • Insurance • Software
The Senior Manager, SRE oversees reliability efforts, manages teams, implements monitoring tools, and coordinates incident responses to ensure operational excellence.
Top Skills: AWSCi/CdGitlabInfrastructure-As-CodeJenkinsOctopusdeploy
3 Days Ago
In-Office
Senior level
Senior level
Software
The Senior Site Reliability Engineer will oversee cloud product reliability, collaborate with development teams, manage deployments, and drive operational excellence in Azure environments.
Top Skills: Application InsightsArmAzure Entra IdAzure MonitorBashBicepCosmos DbDatadogInfrastructure As CodeKeycloakKqlKustoLog AnalyticsAzureMicrosoft Defender SuiteOktaOpen TelemetryPingfederatePlaywrightPostgres SqlPowershellSQLTerraform
6 Days Ago
In-Office
Senior level
Senior level
Software
As a Senior Site Reliability Engineer, you will enhance cloud native products' reliability, performance, and automation, collaborating with various teams to ensure operational excellence in Azure environments.
Top Skills: Application InsightsArmAzure Entra IdAzure MonitorBashBicepCosmos DbDatadogInfrastructure As CodeKeycloakKqlLog AnalyticsAzureMicrosoft Defender SuiteOauthOidcOktaOpen TelemetryPingfederatePlaywrightPostgres SqlPowershellSAMLSQLTerraform

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account