Jump Trading Group Logo

Jump Trading Group

Site Reliability Engineer

Posted 4 Days Ago
Be an Early Applicant
In-Office
Mumbai, Maharashtra
Senior level
In-Office
Mumbai, Maharashtra
Senior level
As a Site Reliability Engineer, you'll manage production environments, monitor trading systems, implement improvements, and mentor other SREs.
The summary above was generated by AI

Jump Trading Group is committed to world class research. We empower exceptional talents in Mathematics, Physics, and Computer Science to seek scientific boundaries, push through them, and apply cutting edge research to global financial markets. Our culture is unique. Constant innovation requires fearlessness, creativity, intellectual honesty, and a relentless competitive streak. We believe in winning together and unlocking unique individual talent by incenting collaboration and mutual respect. At Jump, research outcomes drive more than superior risk adjusted returns. We design, develop, and deploy technologies that change our world, fund start-ups across industries, and partner with leading global research organizations and universities to solve problems.

Production Infrastructure is a global organization of Engineers who architect, build and maintain our world-class infrastructure.The team runs a global operation to monitor and troubleshoot, reliably deploy changes to our production environment, and build the orchestration, configuration management, and monitoring automation for the production trading system. This role will require deep technical and operational knowledge across all areas of the trading platform in order to proactively monitor and troubleshoot our trading system, deploy changes to our production environment while minimizing operational risk, and implement tools and processes to drive continuous improvement. This team works with traders, operations, exchanges, and developers to optimize the trading environment and investigate and solve system issues.

What You'll Do:

  • Own the production environment, driving performance, reliability, and operability through continuous improvement
  • Proactively monitor and troubleshoot large-scale trading systems and exchange connectivity
  • Build and maintain devops toolkit for the production trading system including configuration management, process management, deployment, monitoring, data collection, and analysis
  • Leverage firm-wide metrics to improve scalability and system performance
  • Collaborate across the technology organization to analyze and troubleshoot complex system problems
  • Work closely with Risk Management and Operational Trading Support teams to coordinate changes and manage incidents
  • Interact directly with traders to communicate and drive technology changes, manage incidents, and troubleshoot problems
  • Work with Clearing team to reconcile trades and position breaks
  • Assess and manage operational risk of changes into the production environment
  • Define and document process and procedure
  • Provide mentorship and cross training to other technical operations SREs
  • Other duties as assigned or needed

Skills You'll Need:

  • Degree in Computer Science, a related field, or equivalent professional experience
  • At least 5+ years of relevant work experience in an IT ops role, such as DevOps, SRE, Linux Systems Engineering, or Network Engineering
  • At least 3+ years of experience in python and shell scripting
  • Familiarity with C++  helpful but not required
  • A rigorous, detail-oriented approach to operations
  • Strong understanding of the linux operating system, including network and system configuration, kernel internals, scheduling, performance tuning
  • Strong understanding of networking concepts such as routing, multicast, LLDP, VLAN tagging, ethernet
  • A deep sense of ownership and urgency
  • Ability to handle shared operational and periodic on-call duties
  • Reliable and predictable availability

If you are currently a student or recent graduate, please see our Campus postings which offer both Summer and Full-Time opportunities.

Top Skills

C++
Python
Shell Scripting

Similar Jobs

10 Days Ago
In-Office
Pune, Maharashtra, IND
Senior level
Senior level
Software • Analytics • Hospitality
The Site Reliability Engineer will ensure cloud-native system stability and performance, manage AWS infrastructure, and optimize reliability in Java microservices.
Top Skills: AWSBashDatadogGoJavaKubernetesPythonSQL ServerTerraform
25 Days Ago
Hybrid
Senior level
Senior level
Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
The Manager - Cloud Services & SRE will oversee a technical team responsible for managing cloud services, ensuring operational excellence, and executing project management tasks while collaborating with global teams to drive solutions for Morningstar's infrastructure.
Top Skills: AnsibleAWSAws Api GatewayChefCi/CdDockerGitGrafanaJenkinsKafkaKong Api GatewayKubernetesLinuxPrometheusPythonTerraform
5 Days Ago
In-Office
Magarpatta, Hadapsar, Pune, Maharashtra, IND
Senior level
Senior level
Fintech • Financial Services
The Site Reliability Engineer will improve and maintain operational tools, automate processes, and support the team's infrastructure with a focus on cloud engineering and monitoring.
Top Skills: AnsibleCi/CdDockerGCPGithub ActionsGitlabJenkinsKubernetesTerraform

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account