Xenon Seven Logo

Xenon Seven

Site Reliability Engineer (SRE)-Mobile and Internet Platform

Posted 4 Days Ago
Be an Early Applicant
Remote
6 Locations
Mid level
Remote
6 Locations
Mid level
The Site Reliability Engineer ensures the stability, performance, and reliability of banking applications, implementing automation and monitoring solutions, providing 24/7 support, and optimizing infrastructure.
The summary above was generated by AI
Description

About us:

Where elite tech talent meets world-class opportunities!

At Xenon7, we work with leading enterprises and innovative startups on exciting, cutting-edge projects that leverage the latest technologies across various domains of IT including Data, Web, Infrastructure, AI, and many others. Our expertise in IT solutions development and on-demand resources allows us to partner with clients on transformative initiatives, driving innovation and business growth. Whether it's empowering global organizations or collaborating with trailblazing startups, we are committed to delivering advanced, impactful solutions that meet today’s most complex challenges.

About the Client:

Join one of Egypt’s premier financial institutions, renowned for its extensive suite of banking services, including Institutional Banking, Personal Banking, and Islamic Banking. With a global presence through over 50 branches and correspondents, we serve a diverse and dynamic clientele. As we embark on a groundbreaking digital transformation journey, we are committed to leveraging the latest technologies to establish a state-of-the-art data architecture that will redefine our performance and service delivery.


RequirementsPosition Overview

The Site Reliability Engineer (SRE) is responsible for ensuring the stability, performance, and reliability of Bank's critical applications, particularly Mobile Banking and Internet Banking platforms. This role bridges development and operations teams, implementing automation solutions, monitoring system health, and providing 24/7 operational support to maintain seamless banking services for customers on on-premise infrastructure.

Key Responsibilities

·       Monitor and maintain the reliability and performance of Mobile Banking and Internet Banking applications using Prometheus and Grafana dashboards

·       Manage and support OpenShift/Kubernetes infrastructure for containerized banking applications on on-premise servers

·       Respond to and resolve production incidents with minimal mean time to resolution (MTTR)

·       Implement and maintain centralized logging solutions using ELK Stack (Elasticsearch, Logstash, Kibana) for application troubleshooting

·       Develop and execute runbooks and automation scripts to reduce manual operational toil in OpenShift environments

·       Provide 24/7 production support and on-call rotation for critical banking services

·       Analyze logs and metrics from Prometheus and EFK to identify performance bottlenecks and reliability issues

·       Conduct root cause analysis (RCA) on incidents and implement preventive measures

·       Optimize Kubernetes/OpenShift deployments, pod management, and resource allocation on-premise

·       Implement alerting strategies and threshold management in Prometheus and Grafana

·       Support infrastructure scaling, capacity planning, and load balancing in production environments

·       Implement security best practices and compliance requirements for financial systems in containerized environments

·       Manage on-premise data center infrastructure and server resources

·       Document operational procedures, troubleshooting guides, and create knowledge base articles


Qualifications

·       BSc in Computer Science, Information Technology, Software Engineering, or related field

·       2+ years of hands-on experience in SRE, DevOps, or Production Engineering roles

·       Hands-on experience supporting production applications in Kubernetes/OpenShift environments

·       Strong experience with OpenShift container platform administration and troubleshooting on on-premise infrastructure

·       Proficiency with Prometheus for metrics collection and monitoring

·       Proficiency with Grafana for dashboard creation and visualization

·       Experience with ELK Stack (Elasticsearch, Logstash, Kibana) for centralized logging

·       Strong understanding of Linux/Unix operating systems and networking fundamentals

·       Practical experience with CI/CD tools and automation frameworks

·       Proficiency in at least one programming/scripting language (Python, Go, or Bash)

·       Experience with database management (SQL and NoSQL) on-premise

·       Excellent troubleshooting and analytical skills for production support

·       Strong communication skills and ability to work in cross-functional teams

·       Experience in 24/7 production support environments

·       Experience with on-premise data center infrastructure management

·       Previous experience in financial services or banking sector is a plus

Top Skills

Bash
Elasticsearch
Elk Stack
Go
Grafana
Kibana
Kubernetes
Logstash
NoSQL
Openshift
Prometheus
Python
SQL

Similar Jobs

4 Hours Ago
Easy Apply
Remote or Hybrid
India
Easy Apply
Mid level
Mid level
Consumer Web • HR Tech
As an Applied AI Engineer, you will design intelligent systems, develop chat-based UIs, and work on AI workflows using LLMs and orchestration tools.
Top Skills: APIsLangchainLlm ApisOpenaiPythonSQLTransformers
4 Hours Ago
Remote or Hybrid
India
Expert/Leader
Expert/Leader
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Lead the design and development of payment solutions on the OpenWay Way4 platform, guiding engineers in an Agile environment and optimizing SQL/PLSQL performance.
Top Skills: PlsqlSQLWay4
4 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Team Leader manages the review and investigation of Group Life claims, interprets policy provisions, ensures compliance, and supports team performance through coaching.
Top Skills: AccurintBiosCalligoCdfEdcsGlif ProductionGroupfactsExcelMicrosoft WordNetviewWorkdesk

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account