Manage cloud infrastructure, develop CI/CD pipelines, and support Kubernetes stacks across AWS, GCP, and Azure. Collaborate with teams to enhance cloud operations.
Are you ready to join a fun, nimble team that thrives on collaboration and innovation?
At Azul, we are dedicated to advancing our technology and infrastructure, and we are looking for passionate individuals to be part of our journey. As a member of our team, you will have the opportunity to work alongside talented Engineers who are committed to building and maintaining a secure and high-performance cloud infrastructure.
What You'll Do (aka the Responsibilities)
- Manage connectivity between within and across multiple Cloud providers (AWS, GCP, and Azure)
- Support Cloud and on-premise Kubernetes stacks
- Design and implement IT infrastructure
- You will develop and support CI/CD pipelines
- Develop and support observability and alerting infrastructure
- Work with a team of Cloud Operations Engineers to help build and maintain the systems and code that allow us to provide an always available, secure, and performant cloud infrastructure
- Work with internal Engineering Teams to support the deployment and monitoring of their products
- Automate monitoring of cloud infrastructure using Open Telemetry, Prometheus, Grafana and other observability tools
- Deploy/provision new cloud infrastructure using automation like terraform, argocd, helm, ansible, boto3 (Python)
- Develop automated remediation for system faults to remove points of failure in cloud infrastructure
- Evaluate and make recommendations about stacks, tooling, and engineering best-practices
What You'll Bring (aka Education and Experience)
- Bachelor's degree in computer science, Engineering, or a related field, or equivalent work experience.
- 5+ years of experience in a DevOps or Site Reliability Engineering (SRE) role, with a proven track record of managing large-scale infrastructure.
- Linux proficiency
- Familiarity with OpenStack
- A strong understanding of networking. The ability to diagnose and understand network issues. (BGP, IPsec, VXLAN, Geneve, 802.1Q, etc.)
- Expertise in AWS, Azure, GCP, and cloud-native technologies.
- In-depth knowledge of CI/CD tools (Jenkins, GitLab CI, ArgoCD, etc.) and best practices.
- Experience with infrastructure-as-code tools, such as Terraform, CloudFormation, Ansible, etc.
- Experience with containerization (Docker) and orchestration tools (Kubernetes, OpenShift).
- Familiarity with observability tools (OpenTelemetry, Prometheus, Grafana, Loki ELK, Splunk, etc.).
- Proficiency in scripting and programming languages, e.g. Python, Bash, Go, and Rust.
- Experience with microservices architecture.
- Familiarity with serverless technologies.
- Certifications in relevant cloud platforms (e.g., AWS Certified DevOps Engineer, Azure DevOps Engineer).
- Knowledge of infrastructure security practices (e.g., IAM, security groups, etc.).
What You'll Bring (aka Skills)
- Ability to adapt to different teams and priorities, juggle multiple tasks
- Confidence in decision-making to pursue company goals
- A desire to learn and continually develop and expand your skillset
- Curiosity
- Excellent problem-solving skills and the ability to troubleshoot complex issues in production environments.
- Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams.
What We Offer
- Equity Program - be part of the company success.
- Annual bonus based on company performance.
- Referral Program - earn referral bonuses and bring your colleagues.
- IT Equipment - MacBook Pro or any other HW according to your preferences.
- Work-life balance - generous holidays, sick time, flexible working hours, 100% work from home also possible.
- Most importantly, you will work with top experts worldwide who contribute to the Java ecosystem!
Top Skills
Ansible
Argocd
AWS
Azure
Bash
Boto3
Docker
GCP
Gitlab Ci
Go
Grafana
Jenkins
Kubernetes
Openstack
Opentelemetry
Prometheus
Python
Rust
Terraform
Similar Jobs
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Senior Software Engineer will handle complex problems, lead projects, perform code reviews, and mentor junior engineers while building backend applications and enhancing the search capabilities across Atlassian products.
Top Skills:
AWSAzureGCPGoJavaKafkaKotlinLuceneNoSQLPythonRestSnsSolrSpringSqsTypescript
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Senior Machine Learning Systems Engineer will lead infrastructure for AI & ML tools, tackling complex challenges, mentoring junior members, and collaborating across teams.
Top Skills:
Java,Kotlin,Aws,Sagemaker,S3,Cloud Formation
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
As a Senior Engineering Manager at Atlassian, you'll lead and mentor an engineering team to ensure high-quality software delivery and foster innovation.
Top Skills:
Cloud EnvironmentDev-OpsMicroservices
What you need to know about the Hyderabad Tech Scene
Because of its proximity to leading research institutions and a government committed to the city's growth, Hyderabad's tech scene is booming. With plans to establish India's first "AI city," the city is on track to become one of the world's most anticipated tech hubs, with companies like TransUnion, Schrödinger and Freshworks, among others, already calling the city home.