The Site Reliability Engineer III is responsible for designing, implementing, and maintaining data streaming technologies using Kafka and other tools, ensuring security and stability in cloud environments.
Job Description
As an Experienced SRE at JPMorgan Chase within the Global Technology team, you serve as member of an agile team to design and deliver trusted market-leading data streaming technology products in a secure, stable, and scalable way.
Job Responsibilities
Required qualifications, capabilities, and skills
Preferred qualifications, capabilities, and skills
As an Experienced SRE at JPMorgan Chase within the Global Technology team, you serve as member of an agile team to design and deliver trusted market-leading data streaming technology products in a secure, stable, and scalable way.
Job Responsibilities
- Demonstrates strong knowledge of Kafka technology, Kafka connect framework and distributed infrastructure technologies with the ability to operate in and migrate across public and private clouds
- Is accountable for Installation, configuration, and maintaining Kafka Clusters including performance tuning, monitoring health and availability
- Ensures streaming infrastructure platform is adhering to the firm wide security & controls standards and addresses any drift at priority
- Applies technical expertise and problem-solving methodologies to projects of moderate scope
- Works with other platforms to architect and implement changes required to resolve issues and modernize the organization and its technology processes
- Executes creative solutions for the design, development, and technical troubleshooting for problems of moderate complexity
- Troubleshoots complex priority incidents by upstream/downstream data and systems or technical implications, able to advise on mitigation actions, while balancing the communication needs of each case.
- Adds to team culture of diversity, equity, inclusion, and respect
Required qualifications, capabilities, and skills
- Formal training or certification on software engineering concepts and 3+ years applied experience
- Deep knowledge of one or more areas of SRE such as hardware, networking terminology, storage engineering, deployment practices, integration, automation, scaling, resilience, or performance assessments
- Deep knowledge of at least one specific messaging/streaming technology (ex. Kafka, Kinesis) and one programming/scripting language (Java, Python, Unix Shell Scripting)
- Drives to continue to develop technical and cross-functional knowledge outside of the product
- Deep knowledge of cloud infrastructure and multiple cloud technologies with the ability to operate in public and private clouds (ex. AWS, Cloud Foundry, Kubernetes
- Experience with working in a large distributed system across a range of technologies including compute, databases, messaging, observability, and telemetry
- Deep knowledge of incident, change, and problem management processes
Preferred qualifications, capabilities, and skills
- Significant programming background in any applicable language, with familiarity in Linux and development using Linux platforms.
- Knowledge and certifications in Kubernetes, Azure, GCP, Terraform, and AWS, along with experience in public cloud platforms like AWS, GCP, and Azure.
- Proficiency in automation technologies such as Puppet and Terraform, as well as database platforms and SQL.
- Experience with CICD processes and technologies (e.g., Jenkins, Jira, Git) and authentication and authorization technologies (e.g., OAuth, Kerberos).
Top Skills
AWS
Azure
Cloud Foundry
GCP
Git
Java
Jenkins
JIRA
Kafka
Kubernetes
Puppet
Python
Terraform
Unix Shell Scripting
Similar Jobs at JPMorganChase
Financial Services
As a Site Reliability Engineer III, you will improve applications and infrastructure, implement automated deployment, resolve complex issues, and adopt site reliability practices.
Top Skills:
.NetDatadogDockerDynatraceEcsGitlabGrafanaJavaJenkinsKubernetesPrometheusPythonSplunkSpring BootTerraform
Financial Services
As an SRE III, you'll lead reliability initiatives, mentor engineers, and manage incidents while enhancing application stability using data analytics.
Top Skills:
.NetDatadogDockerDynatraceEcsGitlabGrafanaJavaJenkinsKubernetesPrometheusPythonSplunkSpring BootTerraform
Financial Services
As a Site Reliability Engineer III, you will optimize applications and infrastructure, implement best practices, and enhance reliability through collaboration.
Top Skills:
.NetDatadogDockerDynatraceEcsGitlabGrafanaJavaJenkinsKubernetesPrometheusPythonSplunkSpring BootTerraform
What you need to know about the Pune Tech Scene
Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.