About Checkmate
Checkmate is a restaurant technology solution provider that has continually evolved over time. We started in 2017 by integrating 3rd party platforms to the POS systems of restaurants. At that time, there were multiple 3rd party platforms like GrubHub, UberEats, DoorDash, Postmates, Caviar, and even Amazon!
This was the photograph that started it all!
We have since then continually evolved to add multiple products to our portfolio, the primary ones being first party ordering solutions like web and app ordering, kiosks and catering. We have now recently moved into three new exciting products: Digital Menu Boards, Phone Ordering AI and Drive thru AI. We form a very core part of the restaurant technology ecosystem, and are continually adding more and more digital solutions for the restaurant brands to increase their sales.
Our revolutionary enterprise menu management system, Everyware, truly unlocks the potential of menus and how it can be customized for each individual digital channel. As you can see, this is a company that continually evolves and adapts and today we are powering digital ordering solutions for some of the largest brands in the world.
We have been called the "north star of vendors" as we truly believe that technology is just a method by which we service the customers, it does not form the entirety of it. Service is a big component of what we provide to our customers, which is inherently believed by every single team member here. We are doing a lot of exciting things, including application of AI in our products and systems, using experimentation at scale to determine what works for our clients and ML to analyze and productize the massive amount of data we have. Each individual here makes a difference and has a valuable contribution. Key traits here are ownership and drive. Join us if you think you have them.
We're looking for an experienced Senior DevOps Engineer to join our infrastructure and platform team. You'll join a small, collaborative infrastructure team and take shared ownership of the systems that keep our high-traffic, consumer-facing platform running reliably, securely, and efficiently.
This is not a role for someone who wants to impose a new vision or rebuild from scratch. We have established patterns, tooling, and conventions that work—and we need someone who can learn them quickly, operate confidently within them, and over time contribute thoughtful improvements that make the whole team better. You'll bring a strong technical foundation, a practical mindset, and the communication skills to collaborate effectively across engineering, product, and leadership.
What You'll DoInfrastructure Provisioning & Management
- Design, implement, and maintain cloud infrastructure on AWS using Terraform and Ansible, following existing conventions and extending them thoughtfully.
- Manage and support AWS services across our stack including EC2, ECS, RDS, S3, IAM, VPC, CloudFront, and related services.
- Maintain and improve infrastructure-as-code practices, ensuring consistency, reproducibility, and auditability across environments.
- Participate in capacity planning and cost optimization, identifying opportunities to improve resource efficiency without compromising reliability.
CI/CD & Deployment
- Build, maintain, and improve CI/CD pipelines (GitHub Actions or equivalent) to support reliable, automated delivery across development, staging, and production environments.
- Work with engineering teams to improve build speed, deployment safety, and rollback capabilities.
- Support blue/green and canary deployment strategies as appropriate for our platform needs.
Reliability & Incident Response
- Participate in on-call rotation and own production incidents end-to-end — from detection through root cause analysis, resolution, and post-mortem.
- Use observability tooling (Datadog, CloudWatch, or equivalent) to monitor system health, establish alerting thresholds, and proactively surface issues before they impact customers.
- Contribute to runbooks, incident documentation, and process improvements that reduce mean time to resolution over time.
Security & Compliance
- Apply security best practices across infrastructure — IAM policy scoping, secrets management, network segmentation, vulnerability patching, and access controls.
- Support compliance and audit requirements by maintaining clear documentation and ensuring infrastructure changes are tracked and reviewable.
Collaboration & Continuous Improvement
- Work closely with the senior engineer on the team to learn existing systems deeply and contribute to architectural improvements over time.
- Proactively identify areas for improvement — tooling, automation gaps, manual processes, reliability risks — and raise them constructively with the team.
- Document infrastructure clearly so that other engineers can understand and operate the systems they depend on.
Requirements
- 8+ years of professional DevOps, infrastructure, or platform engineering experience in production environments.
- Hands-on proficiency with Terraform for infrastructure provisioning — writing modules, managing state, and working across environments.
- Deep familiarity with AWS — including compute (EC2, ECS), storage (S3, RDS), networking (VPC, Route 53, CloudFront), and IAM.
- Experience with Ansible for configuration management and automation across server fleets or container environments.
- Strong understanding of CI/CD principles and hands-on experience building or maintaining pipelines (GitHub Actions, GitLab CI, CircleCI, or equivalent).
- Experience with Linux system administration, shell scripting (Bash), and general infrastructure debugging.
- Demonstrated ability to work within an established infrastructure — understanding existing design decisions, following conventions, and improving incrementally rather than replacing wholesale.
- Solid grasp of security fundamentals: IAM least-privilege, secrets management, network access controls, and patching hygiene.
- Strong written and verbal communication skills in English — able to collaborate asynchronously across time zones and document work clearly.
- BSc in Computer Science, Engineering, or a related field — or equivalent professional experience.
- Must be comfortable working in 2 PM to 11 PM IST.
- Experience with container orchestration and managed Kubernetes services, particularly in cloud environments.
- Familiarity with observability tooling such as Datadog, Prometheus, Grafana, or New Relic.
- Experience with database operations on AWS RDS — backups, replication, failover, and performance tuning.
- Familiarity with Redis, Sidekiq, or background job infrastructure in production environments.
- Exposure to AI/ML infrastructure or familiarity with deploying and serving ML models — not required today, but increasingly relevant as our platform evolves.
- Experience with cost allocation, tagging strategies, and AWS cost optimization practices.
- Background in Agile or Scrum environments.
- Familiarity with Microsoft Azure, including core compute, networking, and deployment services.
- Experience with cloud cost management and FinOps practices — including tagging strategies, cost allocation, rightsizing, Reserved Instance or Savings Plan management, and working with tools like AWS Cost Explorer or equivalent.



