JPMorganChase Logo

JPMorganChase

Lead Site Reliability Engineer

Sorry, this job was removed at 02:13 p.m. (IST) on Tuesday, Sep 09, 2025
Be an Early Applicant
Hybrid
Hyderabad, Telangana
Hybrid
Hyderabad, Telangana

Similar Jobs at JPMorganChase

53 Minutes Ago
Hybrid
Hyderabad, Telangana, IND
Senior level
Senior level
Financial Services
The Lead Site Reliability Engineer leads initiatives to enhance application reliability and stability, mentors engineers, and manages major incidents while applying data-driven analytics to improve service levels.
Top Skills: .NetDatadogDockerDynatraceEcsGitlabGrafanaJava Spring BootJenkinsKubernetesPrometheusPythonSplunkTerraform
53 Minutes Ago
Hybrid
Hyderabad, Telangana, IND
Senior level
Senior level
Financial Services
The Lead Site Reliability Engineer at JPMorgan Chase leads site reliability initiatives, mentors engineers, and improves application reliability and stability using best practices and data-driven analytics.
Top Skills: .NetDatadogDockerDynatraceEcsGitlabGrafanaJava Spring BootJenkinsKubernetesPrometheusPythonSplunkTerraform
53 Minutes Ago
Hybrid
Hyderabad, Telangana, IND
Senior level
Senior level
Financial Services
As a Lead Site Reliability Engineer, you will lead reliability initiatives, mentor engineers, solve technical issues, and drive performance improvements across applications and platforms.
Top Skills: .NetDatadogDockerDynatraceEcsGitlabGrafanaJava Spring BootJenkinsKubernetesPrometheusPythonSplunkTerraform
Job Description
Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.
As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking Team, you will take the lead in conducting resiliency design reviews, break down complex problems into manageable tasks for other engineers, act as a technical lead for medium to large-sized products, and provide advice and mentoring to your fellow engineers.
Job responsibilities
  • Demonstrates and champions site reliability culture and practices and exerts technical influence throughout your team
  • Leads initiatives to improve the reliability and stability of your team's applications and platforms using data-driven analytics to improve service levels
  • Collaborates with team members to identify comprehensive service level indicators and stakeholders to establish reasonable service level objectives and error budgets with customers
  • Demonstrates a high level of technical expertise within one or more technical domains and proactively identifies and solves technology-related bottlenecks in your areas of expertise
  • Acts as the main point of contact during major incidents for your application and demonstrates the skills to identify and solve issues quickly to avoid financial losses
  • Documents and shares knowledge within your organization via internal forums and communities of practice

Required qualifications, capabilities, and skills
  • Formal training or certification on site reliability Engineering concepts and 5+ years applied experience
  • Expertise in application development and support with multiple technologies and design techniques.
  • Experience in developing AI/ML solutions using public cloud architecture, specifically Azure and AWS and experience in Python for AI/ML modeling.
  • Experience in automation and continuous delivery methods.
  • Familiarity with agile methodologies, including CI/CD, application resiliency, and security.
  • Experience in implementing GenAI services using Azure OpenAI models and AWS Bedrock service. Deep proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices with the ability to implement these practices within an application or platform
  • Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.
  • Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
  • Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.)
  • Experience with troubleshooting common networking technologies and issues
  • Ability to identify and solve problems related to complex data structures and algorithms

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account