The Cloud Systems Administrator will be responsible for managing and maintaining our AWS (Amazon Web Services) cloud infrastructure. This role involves ensuring the availability, performance, and security of our cloud systems and providing support and troubleshooting for cloud-related issues. The ideal candidate will have a strong background in AWS and cloud system administration, with a proactive approach to problem-solving and a passion for continuous improvement.
Key Responsibilities:
- Manage and maintain our AWS cloud infrastructure, ensuring optimal performance and reliability.
- Monitor system performance and availability, implementing measures to improve efficiency and prevent downtime.
- Perform regular system maintenance, updates, and patches to ensure security and stability.
- Troubleshoot and resolve cloud-related issues, providing timely support to internal teams on Web servers, Virtual deployments, databases, and other hosting infrastructure that supports web application services.
- Implement and manage backup and recovery processes for cloud-based data and applications.
- Collaborate with the development and operations teams to design and implement scalable and secure cloud solutions.
- Develop and maintain documentation for cloud systems, configurations, and procedures.
- Ensure compliance with relevant security standards and best practices.
- Assist in automating routine tasks and processes to improve efficiency.
- Manage and maintain datacenter infrastructure hosted on Rackspace, ensuring optimal performance and reliability.
- Monitor system performance and availability, implementing measures to improve efficiency and prevent downtime.
- Perform regular system maintenance, updates, and patches to ensure security and stability.
- Troubleshoot and resolve datacenter-related issues, providing timely support to internal teams.
- Stay up to date with the latest AWS services and technologies and recommend improvements to existing systems.
- Perform Kubernetes maintenance and support the Kubernetes ecosystem; including upgrading/migrating Istio service mesh.
Qualifications:
- Highly skilled in working with Kubernetes, service mesh implementations, GitOps tools (ArgoCD, Flux), and knowledge of connecting Kubernetes with APM and secrets management (ESO).
- Proven experience as a Systems Administrator, Cloud Administrator, or similar role with a focus on AWS and Linux-based deployments.
- Exceptional diagnostic skills to identify and resolve complex system issues efficiently.
- Analytical thinking and a methodical approach to troubleshooting system and network problems.
- Ability to perform root cause analysis and implement preventive measures to avoid recurrence.
- Strong knowledge of AWS services and tools (e.g., EC2, S3, RDS, CloudFormation, Lambda).
- Experience with infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation).
- Strong working knowledge of Linux distributions (CentOS, Ubuntu, RedHat), including system boot processes, file systems, and user/group management.
- Skill in creating and managing cron jobs for automated task scheduling.
- Proficiency in scripting and automation (e.g., Python, Bash, PowerShell).
- Familiarity with containerization and orchestration technologies (e.g., Docker, Kubernetes) is a plus.
- Solid understanding of networking concepts and protocols.
- Excellent troubleshooting and problem-solving skills.
- Effective communication and teamwork abilities.
- AWS certifications (e.g., AWS Certified SysOps Administrator, AWS Certified Solutions Architect) are a plus.
- Diagnosing and resolving issues with AWS services and infrastructure.
- Use of AWS Support and documentation for problem-solving.
- Familiarity with common troubleshooting tools and techniques.
- In-depth knowledge of Linux distributions (e.g., Ubuntu, CentOS, Red Hat, Debian).
- Proficiency in using the Linux command line interface (CLI).
Core Required Skills:
- EC2 (Elastic Compute Cloud)
- S3 (Simple Storage Service)
- RDS (Relational Database Service)
- VPC (Virtual Private Cloud)
- IAM (Identity and Access Management)
- Lambda
- CloudFormation
- CloudWatch
- Understanding of VPCs, subnets, routing tables, and security groups
- Knowledge of DNS services, such as Route 53
- Familiarity with VPNs
- CLI: Bash, grep, awk, find, and other key commands.
Bounteous Hyderabad, Telangana, IND Office
Floor 4, Survey Numbers: 27/1, 27/2, 27/3 and 27/4, Fairfield by Marriott No: 2 , Nanakramguda, Gachibowli, Hyderabad, Telangana, India, 500032
What you need to know about the Hyderabad Tech Scene
Because of its proximity to leading research institutions and a government committed to the city's growth, Hyderabad's tech scene is booming. With plans to establish India's first "AI city," the city is on track to become one of the world's most anticipated tech hubs, with companies like TransUnion, Schrödinger and Freshworks, among others, already calling the city home.