Centific

Senior Site Reliability Engineer

Posted Yesterday

Be an Early Applicant

In-Office

Hyderabad, Telangana, IND

Senior level

In-Office

Hyderabad, Telangana, IND

Senior level

The Senior Site Reliability Engineer will enhance system reliability and oversee observability, incident response, and operational excellence on Azure, collaborating across teams.

The summary above was generated by AI

About Centific

Centific is a frontier AI data foundry that curates diverse, high-quality data, using our purpose-built technology platforms to empower the Magnificent Seven and our enterprise clients with safe, scalable AI deployment. Our team includes more than 150 PhDs and data scientists, along with more than 4,000 AI practitioners and engineers. We harness the power of an integrated solution ecosystem—comprising industry-leading partnerships and 1.8 million vertical domain experts in more than 230 markets—to create contextual, multilingual, pre-trained datasets; fine-tuned, industry-specific LLMs; and RAG pipelines supported by vector databases. Our zero-distance innovation™ solutions for GenAI can reduce GenAI costs by up to 80% and bring solutions to market 50% faster.

Our mission is to bridge the gap between AI creators and industry leaders by bringing best practices in GenAI to unicorn innovators and enterprise customers. We aim to help these organizations unlock significant business value by deploying GenAI at scale, helping to ensure they stay at the forefront of technological advancement and maintain a competitive edge in their respective markets.

About Job

- Role Summary
  We are hiring a Senior Site Reliability Engineer (SRE) with strong expertise in observability and solid DevOps experience to support and scale our cloud‑native platform on Microsoft Azure.
  This role focuses on improving system reliability, monitoring and alerting, incident response, and operational excellence, while partnering closely with Product, DevOps, and Engineering teams.
  Key Responsibilities
- Design and operate end‑to‑end observability (metrics, logs, and traces) for microservices.
- Define and improve alerting strategies, focusing on noise reduction and actionable alerts.
- Drive reliability outcomes through SLIs/SLOs, monitoring standards, and operational readiness.
- Lead and participate in production incident response, perform root cause analysis (RCA), and drive preventive improvements.
- Collaborate on CI/CD pipelines, release safety, and automation for operational workflows.
- Mentor engineers and contribute to knowledge sharing, including runbooks, best practices, and documentation.
- Required Skills & Experience
- 5+ years of experience in SRE, DevOps, or Production Engineering roles.
- Hands‑on experience with observability tools such as Prometheus, Grafana, OpenTelemetry, Azure Monitor, or similar (required).
- Strong incident management experience, including on‑call support and RCAs.
- Nice to Have
- Experience operating systems in cloud environments, including cloud architecture design and security best practices.
- Experience with performance tuning, capacity planning, and cost optimization.
- Development background with the ability to read, understand, and debug application code to support production issue investigation
- What We’re Looking For
- A senior, hands‑on engineer with strong ownership and problem‑solving skills.
- Strong communication skills and a cross‑team collaboration mindset.
- Passion for reliability, observability, and automation.
- Hands‑on experience with Azure services, including AKS, PostgreSQL, MySQL, Azure Storage Accounts, and Azure networking.
- DevOps experience, including CI/CD pipeline setup and maintenance.
- Proficiency in automation and scripting (e.g., Python, Bash, PowerShell).
- Experience collaborating across distributed and global teams.

Centific is an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, ancestry, citizenship status, age, mental or physical disability, medical condition, sex (including pregnancy), gender identity or expression, sexual orientation, marital status, familial status, veteran status, or any other characteristic protected by applicable law. We consider qualified applicants regardless of criminal histories, consistent with legal requirements.

Top Skills

Azure Monitor

Bash

Ci/Cd

Grafana

Azure

Opentelemetry

Powershell

Prometheus

Python

Similar Jobs

Zuora

Senior Site Reliability Engineer

9 Days Ago

In-Office

Senior level

Fintech • Internet of Things • Payments • Software

The Senior Site Reliability Engineer at Zuora will lead reliability architecture, design AI-driven automation, and enhance cloud infrastructure while mentoring other engineers.

Top Skills: AWSJenkinsKafkaKubernetesLinuxMicroservicesPuppetPythonTerraform

Tyk

Senior Site Reliability Engineer

20 Days Ago

In-Office or Remote

India

Senior level

Cloud • Software

The Senior Site Reliability Engineer at Tyk will optimize and maintain cloud platforms, enhance automation, and ensure high reliability across systems while collaborating cross-functionally and driving continuous improvements.

Top Skills: AWSEksGoGrafanaHelmKubernetesLinuxMongoDBPrometheusPythonRedisTerraform

LexisNexis

Senior Site Reliability Engineer

18 Hours Ago

In-Office

Senior level

Information Technology • Legal Tech • Professional Services • Analytics • Business Intelligence

The Senior Site Reliability Engineer I ensures system reliability, collaborates on deployment strategies, and promotes DevOps and site reliability best practices.

Top Skills: Amazon Web ServicesBashDockerEcsGitlabGrafanaJenkinsKubernetesPowershellPrometheusPythonTerraform

What you need to know about the Hyderabad Tech Scene

Because of its proximity to leading research institutions and a government committed to the city's growth, Hyderabad's tech scene is booming. With plans to establish India's first "AI city," the city is on track to become one of the world's most anticipated tech hubs, with companies like TransUnion, Schrödinger and Freshworks, among others, already calling the city home.