OpenTable

Site Reliability Engineer II

Reposted 3 Days Ago

Easy Apply

Remote

Hiring Remotely in India

Senior level

Easy Apply

Remote

Hiring Remotely in India

Senior level

The Site Reliability Engineer II role involves supporting OpenTable's container infrastructure, collaborating on projects, mitigating incidents, and participating in on-call duties.

The summary above was generated by AI

This job is 100% remote across India.

About OpenTable: With millions of diners, 60,000+ restaurant partners and 25+ years of experience, OpenTable, part of Booking Holdings, Inc. (NASDAQ: BKNG), is an industry leader with a passion for helping restaurants thrive. Our world-class technology empowers restaurants to focus on what matters most – their team, their guests, and their bottom line – while enabling diners to discover and book the perfect restaurant for every occasion.

Every employee at OpenTable has a tangible impact on what we do and how we do it. You’ll also be part of a global team and its portfolio of metasearch brands. Hospitality is all about taking care of others, and it defines our culture.

Site Reliability Engineer (SRE)

The Serving Platforms team is a DevOps organization integral to OpenTable’s Infrastructure Engineering group. We are responsible for the entire lifecycle of the container stack that powers OpenTable business applications. We are a group of engineers with skills across Kubernetes, Service Mesh, Puppet, Networking, Linux, DNS, and Kafka. To provide a platform-as-a-service, we automate our processes using in-house and third-party tools. We value reliability, efficiency, and security while providing service to our customers, the OpenTable application engineering group.

We provide the following services across the company

Administering our container platform (Kubernetes/Istio)
Config management infrastructure administration (Puppet, Ansible)
SSL certificate management
Vault and Consul
CDN & Message Bus (Akamai, Kafka)
Cloud service operation (AWS)
Various security initiatives

About This Role

The Site Reliability Engineer II is responsible for the health, tuning, and lifecycle of the underlying Linux OS and the Kubernetes control plane in bare-metal/self-hosted environments, and supports OpenTable’s development and production container infrastructure. In this role, you will work with multiple engineering teams across the globe as an SME for Kubernetes and other technologies owned by the Serving Platforms team. You will participate in high-impact projects, work closely with other team members, support team priorities, and help foster good communication with stakeholders. You can expect to help build greenfield projects, mitigate infrastructure incidents, and participate in on-call rotation. Our infrastructure is self-hosted in our data centers. You will get maximum low-level exposure and experience. We're seeking an exceptional individual to join our team.

About You

You love working in a small, agile, highly productive, and focused environment. You enjoy building automation and self-service tools. You are curious and like learning. Picking up new languages or skills and sharing your findings with others is second nature to you. You’re detail-oriented, enjoy writing code, and implement DevOps principles via automation. You aren't satisfied with a tool "just working"; you want to understand how it interacts with the kernel, manages memory, and uses the network stack. You like to create repeatable processes that do not require human babysitting. When asked, you always have an opinion or can quickly form one. You seek to understand multiple perspectives and points of view and find the optimal solution for everyone.

Does this sound like something you'd excel at? If so, keep reading.

Required Experience:

Minimum 5+ years of hands-on Linux experience (Ubuntu, CentOS, Etc.) with deep expertise in Linux internals, including kernel tuning (sysctl), process management (cgroups/namespaces), and filesystem optimization. You should be comfortable troubleshooting performance bottlenecks at the system call level.
Understanding of systems administration concepts and patterns, including Linux networking (IPtables/NFTables, routing, load balancing) and custom package management. You should understand how to build, wrap, and deploy software at the OS level (RPM/DEB) beyond just Dockerizing it.
Experience and proficiency with scripting languages such as GoLang, Python, Ruby, Perl, or Bash
Proven experience building and maintaining Kubernetes clusters from the ground up (the "Hard Way"). Expertise in managing the K8s control plane, etcd clusters, and CNI plugins in a non-managed, self-hosted environment
Experience in incident response and root cause analysis of service disruptions
3+ years of experience with config management tools such as Puppet, Chef, Ansible, or SaltStack
Ability to quickly learn new technologies, frameworks, and architectures, as well as participate in technical conversations with external stakeholders and your team
Experience with operating messaging systems such as Kafka or RabbitMQ in production

Nice to have:

Understanding or experience with cloud computing - AWS, GCE, Azure
Familiarity with CI/CD Pipelines using tools like GitHub, Artifactory, CircleCI, Jenkins, TeamCity, Docker registry, etc.
Experience working with K/V stores such as Zookeeper, Redis, etcd, or Consul in production
Experience with virtualization technologies such as Proxmox, VMware, ESX, Xen, and OpenStack
Experience working with monitoring and alerting systems such as Sensu, Graphite, Prometheus, and Nagios
Applied knowledge of working and communicating with a globally distributed team
Experience with Windows Server OSs

Benefits and Perks:

Work from (almost) anywhere for up to 20 days per year
Focus on mental health and well-being
Company-paid therapy sessions through SpringHealth
Company-paid subscription to Headspace
Annual company-wide week off a year - the whole team fully recharges (and returns without a pile-up of work!)
Paid parental leave
Generous paid vacation + time off for your birthday
Paid volunteer time
Focus on your career growth
Development Dollars
Leadership development
Access to thousands of on-demand e-learnings
Travel Discounts
Employee Resource Groups
Quarterly team offsite
Tax optimisation options
Generous health insurance
Pension fund

Work Environment & Flexibility
At OpenTable, we pride ourselves on fostering a global and dynamic work environment. As a team member with us, you will benefit from a schedule tailored to accommodate a global workforce operating across multiple time zones. While the majority of your responsibilities may align with conventional business hours, there will be instances where you are expected to manage communications - via calls, Slack messages, or emails - outside of regular working hours to effectively collaborate with international colleagues, respond to restaurant partners, and/or address urgent matters. OpenTable will always abide by and consider local laws and regulations.

Inclusion
We’re committed to creating a workplace where everyone feels they belong and can thrive. We know the best ideas come when we bring different voices to the table, so we're building a team as dynamic as the diners and restaurants we serve—and fostering a culture where everyone feels welcome to be themselves.If you need accommodations during the application or interview process, or on the job, we’re here to support you. Please reach out to your recruiter to request any accommodations.

Top Skills

Ansible

AWS

Bash

Docker

Kafka

Kubernetes

Linux

Perl

Puppet

Python

Ruby

Similar Jobs

OpenTable

Site Reliability Engineer

5 Days Ago

Easy Apply

Remote

India

Easy Apply

Mid level

Food • Mobile

Provide 24/7 support for OpenTable's global data platforms, focusing on database operations, maintaining high availability, backups, and performance optimization.

Top Skills: CloudwatchDockerGitGitGoGrafanaKubernetesMongoDBPostgresPrometheusPuppetPythonRedisShell BashSQL Server

LivePerson

Site Reliability Engineer

18 Days Ago

Easy Apply

Remote

India

Easy Apply

Senior level

Natural Language Processing • Software • Conversational AI

Maintain and scale cloud-native platform infrastructure: manage Kubernetes (GKE/EKS), build Terraform modules, standardize deployments with Helm, implement GitLab CI/CD, enhance observability (Prometheus/Grafana/Datadog), automate tooling in Python/Go/Shell, participate in on-call rotation, perform RCAs, and collaborate with development teams to improve reliability.

Top Skills: Api GatewayArgocdAWSDatadogEksFluxGCPGitlab Ci/CdGkeGoGrafanaHelmIstioKubernetesLinkerdLinuxPagerdutyPrometheusPythonServicenowShellTerraform

Juniper Square

Site Reliability Engineer

18 Days Ago

Remote

India

Mid level

Fintech • Real Estate • Software

Automate infrastructure as code, evolve deployment pipelines, improve observability/SLOs, troubleshoot production incidents, enable secure scalable AWS/Kubernetes environments, mentor regional engineers, and support global developer experience.

Top Skills: Aws,Postgresql,Document Databases,Kubernetes,Helm,Docker,Docker-Compose,Tilt,Github Actions,Argocd,Gitops,Ansible,Terraform,Cloudformation,Cdk,Python,Typescript,Service Meshes,Prometheus,New Relic,Datadog,Cloudwatch,Elk,Splunk,Cspm,Cdr,Cwpp,Siem

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.