OpenTable Logo

OpenTable

Site Reliability Engineer II

Reposted 3 Days Ago
Easy Apply
Remote
Hiring Remotely in India
Senior level
Easy Apply
Remote
Hiring Remotely in India
Senior level
The Site Reliability Engineer II role involves supporting OpenTable's container infrastructure, collaborating on projects, mitigating incidents, and participating in on-call duties.
The summary above was generated by AI

This job is 100% remote across India.

About OpenTable: With millions of diners, 60,000+ restaurant partners and 25+ years of experience, OpenTable, part of Booking Holdings, Inc. (NASDAQ: BKNG), is an industry leader with a passion for helping restaurants thrive. Our world-class technology empowers restaurants to focus on what matters most – their team, their guests, and their bottom line – while enabling diners to discover and book the perfect restaurant for every occasion.

Every employee at OpenTable has a tangible impact on what we do and how we do it. You’ll also be part of a global team and its portfolio of metasearch brands. Hospitality is all about taking care of others, and it defines our culture.

 

Site Reliability Engineer (SRE)

The Serving Platforms team is a DevOps organization integral to OpenTable’s Infrastructure Engineering group. We are responsible for the entire lifecycle of the container stack that powers OpenTable business applications. We are a group of engineers with skills across Kubernetes, Service Mesh, Puppet, Networking, Linux, DNS, and Kafka. To provide a platform-as-a-service, we automate our processes using in-house and third-party tools.  We value reliability, efficiency, and security while providing service to our customers, the OpenTable application engineering group.       

We provide the following services across the company

  • Administering our container platform (Kubernetes/Istio)
  • Config management infrastructure administration (Puppet, Ansible)
  • SSL certificate management
  • Vault and Consul 
  • CDN & Message Bus (Akamai, Kafka)
  • Cloud service operation (AWS)
  • Various security initiatives

About This Role

The Site Reliability Engineer II is responsible for the health, tuning, and lifecycle of the underlying Linux OS and the Kubernetes control plane in bare-metal/self-hosted environments, and supports OpenTable’s development and production container infrastructure. In this role, you will work with multiple engineering teams across the globe as an SME for Kubernetes and other technologies owned by the Serving Platforms team. You will participate in high-impact projects, work closely with other team members, support team priorities, and help foster good communication with stakeholders. You can expect to help build greenfield projects, mitigate infrastructure incidents, and participate in on-call rotation. Our infrastructure is self-hosted in our data centers.  You will get maximum low-level exposure and experience.  We're seeking an exceptional individual to join our team.

About You

You love working in a small, agile, highly productive, and focused environment. You enjoy building automation and self-service tools. You are curious and like learning.  Picking up new languages or skills and sharing your findings with others is second nature to you. You’re detail-oriented, enjoy writing code, and implement DevOps principles via automation.  You aren't satisfied with a tool "just working"; you want to understand how it interacts with the kernel, manages memory, and uses the network stack. You like to create repeatable processes that do not require human babysitting.  When asked, you always have an opinion or can quickly form one.  You seek to understand multiple perspectives and points of view and find the optimal solution for everyone. 

Does this sound like something you'd excel at? If so, keep reading.

Required Experience:

  • Minimum 5+ years of hands-on Linux experience (Ubuntu, CentOS, Etc.) with deep expertise in Linux internals, including kernel tuning (sysctl), process management (cgroups/namespaces), and filesystem optimization. You should be comfortable troubleshooting performance bottlenecks at the system call level.
  • Understanding of systems administration concepts and patterns, including Linux networking (IPtables/NFTables, routing, load balancing) and custom package management. You should understand how to build, wrap, and deploy software at the OS level (RPM/DEB) beyond just Dockerizing it.
  • Experience and proficiency with scripting languages such as GoLang, Python, Ruby, Perl, or Bash
  • Proven experience building and maintaining Kubernetes clusters from the ground up (the "Hard Way"). Expertise in managing the K8s control plane, etcd clusters, and CNI plugins in a non-managed, self-hosted environment
  • Experience in incident response and root cause analysis of service disruptions
  • 3+ years of experience with config management tools such as Puppet, Chef, Ansible, or SaltStack
  • Ability to quickly learn new technologies, frameworks, and architectures, as well as participate in technical conversations with external stakeholders and your team
  • Experience with operating messaging systems such as Kafka or RabbitMQ in production

Nice to have:

  • Understanding or experience with cloud computing - AWS, GCE, Azure
  • Familiarity with CI/CD Pipelines using tools like GitHub, Artifactory, CircleCI, Jenkins, TeamCity, Docker registry, etc.
  • Experience working with K/V stores such as Zookeeper, Redis, etcd, or Consul in production
  • Experience with virtualization technologies such as Proxmox, VMware, ESX, Xen, and OpenStack
  • Experience working with monitoring and alerting systems such as Sensu, Graphite, Prometheus, and Nagios
  • Applied knowledge of working and communicating with a globally distributed team
  • Experience with Windows Server OSs

Benefits and Perks: 

  • Work from (almost) anywhere for up to 20 days per year
  • Focus on mental health and well-being
  • Company-paid therapy sessions through SpringHealth
  • Company-paid subscription to Headspace
  • Annual company-wide week off a year - the whole team fully recharges (and returns without a pile-up of work!)
  • Paid parental leave
  • Generous paid vacation + time off for your birthday
  • Paid volunteer time
  • Focus on your career growth
  • Development Dollars
  • Leadership development
  • Access to thousands of on-demand e-learnings
  • Travel Discounts
  • Employee Resource Groups
  • Quarterly team offsite
  • Tax optimisation options
  • Generous health insurance
  • Pension fund

Work Environment & Flexibility
At OpenTable, we pride ourselves on fostering a global and dynamic work environment. As a team member with us, you will benefit from a schedule tailored to accommodate a global workforce operating across multiple time zones. While the majority of your responsibilities may align with conventional business hours, there will be instances where you are expected to manage communications - via calls, Slack messages, or emails - outside of regular working hours to effectively collaborate with international colleagues, respond to restaurant partners, and/or address urgent matters. OpenTable will always abide by and consider local laws and regulations.

Inclusion
We’re committed to creating a workplace where everyone feels they belong and can thrive. We know the best ideas come when we bring different voices to the table, so we're building a team as dynamic as the diners and restaurants we serve—and fostering a culture where everyone feels welcome to be themselves.If you need accommodations during the application or interview process, or on the job, we’re here to support you. Please reach out to your recruiter to request any accommodations.

Top Skills

Ansible
AWS
Bash
Docker
Go
Kafka
Kubernetes
Linux
Perl
Puppet
Python
Ruby

Similar Jobs

5 Days Ago
Easy Apply
Remote
India
Easy Apply
Mid level
Mid level
Food • Mobile
Provide 24/7 support for OpenTable's global data platforms, focusing on database operations, maintaining high availability, backups, and performance optimization.
Top Skills: CloudwatchDockerGitGitGoGrafanaKubernetesMongoDBPostgresPrometheusPuppetPythonRedisShell BashSQL Server
18 Days Ago
Easy Apply
Remote
India
Easy Apply
Senior level
Senior level
Natural Language Processing • Software • Conversational AI
Maintain and scale cloud-native platform infrastructure: manage Kubernetes (GKE/EKS), build Terraform modules, standardize deployments with Helm, implement GitLab CI/CD, enhance observability (Prometheus/Grafana/Datadog), automate tooling in Python/Go/Shell, participate in on-call rotation, perform RCAs, and collaborate with development teams to improve reliability.
Top Skills: Api GatewayArgocdAWSDatadogEksFluxGCPGitlab Ci/CdGkeGoGrafanaHelmIstioKubernetesLinkerdLinuxPagerdutyPrometheusPythonServicenowShellTerraform
18 Days Ago
Remote
India
Mid level
Mid level
Fintech • Real Estate • Software
Automate infrastructure as code, evolve deployment pipelines, improve observability/SLOs, troubleshoot production incidents, enable secure scalable AWS/Kubernetes environments, mentor regional engineers, and support global developer experience.
Top Skills: Aws,Postgresql,Document Databases,Kubernetes,Helm,Docker,Docker-Compose,Tilt,Github Actions,Argocd,Gitops,Ansible,Terraform,Cloudformation,Cdk,Python,Typescript,Service Meshes,Prometheus,New Relic,Datadog,Cloudwatch,Elk,Splunk,Cspm,Cdr,Cwpp,Siem

What you need to know about the Pune Tech Scene

Once a far-out concept, AI is now a tangible force reshaping industries and economies worldwide. While its adoption will automate some roles, AI has created more jobs than it has displaced, with an expected 97 million new roles to be created in the coming years. This is especially true in cities like Pune, which is emerging as a hub for companies eager to leverage this technology to develop solutions that simplify and improve lives in sectors such as education, healthcare, finance, e-commerce and more.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account