CloudRaft Logo

CloudRaft

Site Reliability Engineer(SRE)

Reposted Yesterday
Be an Early Applicant
Remote
Hiring Remotely in India
Mid level
Remote
Hiring Remotely in India
Mid level
The Site Reliability Engineer (SRE) will manage cloud-native infrastructure, develop CI/CD pipelines, and ensure system reliability using best practices and automation tools.
The summary above was generated by AI
Immediate joiners or candidates who can join within 7 days only to apply.
Folks who can start working with us from 1st June 2026 will be given priority.
If you have already applied to CloudRaft in the last 90 days, we already have your CV/resume on file. Multiple applications from the same candidate will not be considered.

About CloudRaft
CloudRaft is a dynamic company specializing in advanced AI and cloud-native solutions. We foster creativity, collaboration, and innovation, enabling our team to address complex challenges and deliver exceptional results. Join us to contribute to an organization that prioritizes professional growth, operational excellence, and technological advancement.

Job Description
We seek an experienced Site Reliability Engineer (SRE) to join our team. In this role, you will scale our operations, design and maintain resilient infrastructure and apply best practices for reliability and efficiency within our cloud-native environment.

Responsibilities
  • Manage and maintain Kubernetes clusters across cloud platforms, including OpenShift, Amazon EKS, Azure AKS, and Google GKE.
  • Implement and manage CI/CD pipelines using tools such as Jenkins, GitHub Actions, Argo CD, or GitLab CI/CD.
  • Design and maintain observability stacks with tools including Prometheus, Grafana, Loki, OpenTelemetry, and related technologies.
  • Optimize system performance and resolve production issues.
  • Implement SRE principles, including Service Level Indicators (SLIs) and Service Level Objectives (SLOs), to uphold system reliability.
  • Automate infrastructure and operational tasks using programming languages such as Go or Python, and Infrastructure as Code (IaC) tools like Terraform.
  • Apply AI skills like Vibe Coding for engineering tasks, AIOps and automation, understanding of Large Language Models (LLMs) and AI Agents, and proficiency in Prompt Engineering.
  • Remain current with emerging technologies, including AI, MLOps, and Edge Computing.
  • Contribute to knowledge sharing through technical writing and presentations.

Qualifications
  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • 2-5 years of experience in SRE, Platform Engineering, DevOps Engineer.
  • Strong expertise in Kubernetes, cloud-native technologies, and major cloud platforms (AWS, Azure, GCP).
  • Proficiency in programming languages such as Python or Go or Node.js.
  • Familiarity with CI/CD tools and contemporary deployment practices.
  • Knowledge of observability tools and Infrastructure as Code.
  • AI skills, including experience with Vibe Coding, AIOps and automation, understanding of LLMs and AI Agents, and Prompt Engineering.
  • CKA Certified (Brownie points!)
  • Excellent problem-solving abilities and communication skills.
  • Inclination toward open-source contributions is advantageous.

Benefits : 
- Competitive salary
- Premium health insurance and various health & wellness benefits
- Opportunity to work on cutting-edge technologies
- Collaborative and supportive work environment
- Chance to make a real impact on the company's success

Similar Jobs

10 Days Ago
In-Office or Remote
Mid level
Mid level
eCommerce
The Site Reliability Engineer will maintain and enhance infrastructure reliability and performance, focusing on automation, monitoring, and incident response in a cloud environment.
Top Skills: AnsibleBashCloud BuildDeployment ManagerDockerGCPGitlab CiGoGoogle Cloud PlatformGrafanaJenkinsKubernetesPrometheusPythonStackdriverTerraform
4 Days Ago
In-Office or Remote
India
Senior level
Senior level
Cloud • Security • Software • Cybersecurity
As a Senior Site Reliability Engineer, you will enhance automation and efficiency, troubleshoot complex issues, and improve system reliability and monitoring.
Top Skills: AnsibleAWSAzureDatadogElkGCPGoGrafanaLinuxOpensearchPrometheusPythonSaltstackSplunkTerraform
4 Days Ago
Remote
India
Senior level
Senior level
Information Technology • Marketing Tech • Social Media
Lead the SRE Center of Excellence at GoDaddy, guiding teams to improve cloud operations, reliability, and developer experience while managing global engineering collaborations.
Top Skills: AWSDatadogGithub ActionsGoGrafanaKafkaKubernetesPostgresPrometheusPythonTerraform

What you need to know about the Hyderabad Tech Scene

Because of its proximity to leading research institutions and a government committed to the city's growth, Hyderabad's tech scene is booming. With plans to establish India's first "AI city," the city is on track to become one of the world's most anticipated tech hubs, with companies like TransUnion, Schrödinger and Freshworks, among others, already calling the city home.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account