OpenTable Logo

OpenTable

Site Reliability Engineer II

Sorry, this job was removed at 03:03 p.m. (IST) on Thursday, Sep 19, 2024
Be an Early Applicant
Remote
Remote

With millions of diners, tens of thousands of restaurants, and 23+ years of experience, OpenTable, part of Booking Holdings, Inc. (NASDAQ: BKNG), is an industry leader with a unique insight into the world of hospitality. We champion restaurants, bars, wineries, and other venues around the world, helping them attract guests, manage capacity, improve operations and maximize revenue.

Every employee at OpenTable has a tangible impact on what we do and how we do it. You’ll also be part of a global network that includes OpenTable and KAYAK's portfolio of travel brands including Swoodoo, checkfelix, momondo, Cheapflights, Mundi and HotelsCombined.

Hospitality is all about taking care of others, and it defines our culture. You’ll work in a welcoming and inclusive environment, and get the benefits, flexibility, and support you need to succeed.

In this role, you will:

  • Design, implement, and maintain multiple observability systems in OpenTable's on-prem data centers.
  • Be responsible for the uptime of logging and metric systems that hold hundreds of Terabytes worth of data, across multiple regions.
  • Be customer-focused, and help OpenTable's global engineering teams get the most value out of our metrics and logs.
  • Be required to context switch between different systems when troubleshooting.
  • Communicate with peers around the world in different time zones over Zoom and Slack.
  • Collaborate with engineering and product leadership to define priorities and set delivery goals.
  • Be a member of a 12-hour On-call rotation that swaps out weekly.
  • You will be encouraged to automate, to reduce the number of emergency calls.

Please apply if you have:

  • Good verbal communication and written documentation skills.
  • Proven experience in a DevOps role or related.
  • Excitement to learn new technologies and stacks.
  • Solid understanding of DNS, TCP/IP, Linux Server Administration (CentOS, RHEL, or Ubuntu), and shell scripting (Bash, Zshell).
  • Experience supporting services across VMs, Docker Containers, and Kubernetes.
  • Experience with coding in a programming language such as Python, Ruby, or Golang.
  • An understanding of how to use vendor-provided REST APIs to automate tasks.
  • Experience using configuration management tools such as Puppet, Chef, or Ansible.
  • Automation tools experience such as Terraform, or Pulumi is desirable.
  • 4+ years experience with Elastic Stack (Elasticsearch, Logstash, Kibana), or with metrics backends such as Prometheus, Graphite, and VictoriaMetrics.

Benefits:

  • Paid Vacation
  • One Celebration Day per calendar year
  • Focus on mental health and well-being
  • Company-wide weeks off a year - the whole team fully recharges (and returns without a pile-up of work!)
  • Generous paid parental leave
  • Focus on your career growth
  • Work from (almost) anywhere ; wherever you do your best work
  • Employee Assistance Program (EAP)
  • Pension Fund

Diversity, Equity, and Inclusion

OpenTable aspires to be a workplace that reflects the diverse communities we serve and a culture that is inclusive and welcoming. Hiring people with different backgrounds, experiences, perspectives, and ideas is critical to innovation and to how we deliver great experiences for our users and our partners. Representation matters.

We ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform job responsibilities, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Similar Jobs

Senior level
Cloud • Information Technology • Software
The Site Reliability Engineer will implement observability solutions, develop monitoring tools, and gather system metrics. Collaboration with development teams is essential to ensure reliability and performance standards, while also identifying and resolving system issues.
Top Skills: PerlPHPPythonRuby
6 Days Ago
Remote
India
Junior
Junior
Edtech • Software
As a Site Reliability Engineer at GoGuardian, you will work closely with product engineers to implement scalable, reliable systems. Your responsibilities include scaling backend systems, collaborating on CI/CD tools, building infrastructure, and monitoring service performance to meet SLAs.
Top Skills: Python
6 Days Ago
Remote
8 Locations
Mid level
Mid level
Cloud • Software
As an SRE & Gitops Engineer, you'll automate software operations, enhance infrastructure as code practices, maintain core services at Canonical, and collaborate with development teams to improve products. Responsibilities include troubleshooting, capacity planning, and using observability tools for monitoring and alerting.
Top Skills: Python

What you need to know about the Hyderabad Tech Scene

Because of its proximity to leading research institutions and a government committed to the city's growth, Hyderabad's tech scene is booming. With plans to establish India's first "AI city," the city is on track to become one of the world's most anticipated tech hubs, with companies like TransUnion, Schrödinger and Freshworks, among others, already calling the city home.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account