nomiso Logo

nomiso

SRE Architect

Posted 6 Hours Ago
Be an Early Applicant
Easy Apply
In-Office
Hyderabad, Telangana
Expert/Leader
Easy Apply
In-Office
Hyderabad, Telangana
Expert/Leader
Design and own infrastructure and observability for large-scale services. Automate operations, implement SLOs/SLIs, run incident response, perform chaos testing, mentor SREs, and drive reliability improvements across cloud and data-center environments.
The summary above was generated by AI

About Company: 

Nomiso is a product and services engineering company. We are a team of Software Engineers, Architects, Managers, and Cloud Experts with expertise in Technology and Delivery Management. 

Our mission is to Empower and Enhance the lives of our customers through simple solutions for their complex business problems. 

At NomiSo, we encourage entrepreneurial spirit - to learn, grow and improve. A great workplace thrives on ideas and opportunities. That is a part of our DNA. We’re in pursuit of colleagues who share similar passions, are nimble, and thrive when challenged. We offer a positive, stimulating, and fun environment – with opportunities to grow, a fast-paced approach to innovation, and a place where your views are valued and encouraged.

We invite you to push your boundaries and join us in fulfilling your career aspirations!


Position Overview


We are looking for a SRE Architect who will work with technology experts to design optimal solutions to requirements for our customers. This is achieved through interactive requirements gathering, determination of best fit solutions based on problem solving approaches, integrated solution design based on multiple technology types, and a strong ability to present and articulate solutions to senior members of the customer teams.


Roles and Responsibilities:

  • Own the Infrastructure, APM and work with Developers and Systems engineers to Build, Release, Monitor and run the services reliability exceeding the agreed SLAs..
  • Write software to automate API-driven tasks at scale and contribute to the product codebase in Java, JS, React, Node, Go and Python
  • Write automation to reduce toil and eliminate manual tasks that are repeatable.
  • Work with Ansible, Puppet, Chef, Terraform or another config management / orchestration suite, know where it's broken, work towards fixing them and explore new alternatives
  • Define and accelerate implementation of support processes, tools and best practices
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system reliability
  • Handle cross team performance issues from identification of the cause, determining the areas of improvement and driving those actions to closure
  • Performance and maturity baselining of Systems, tools maturity & coverage, metrics, technology and engineering practices
  • Define, Measure and improve Reliability Metrics (SLO/SLI), Observability (Monitoring, Logging-Tracing solutions), Ops process (Incident, Problem Mgmt) and streamline – automate release management. Build dashboards to provide visibility into performance of the applications. 
  • Create chaos in the production environment purposefully in a controlled manager to validate reliability of systems.
  • Mentor and coach other SREs in the organization
  • Provide written and verbal updates to executives and the stakeholders of the application in the organization.
  • Understand the current process, system setup and propose the improvements needed in the processes, and  technology so that the application exceeds the desired Service Level Objective.
  • Strong believer of automation to bring in sustained continuous improvement by automating Toil, Runbooks, improving ability of the applications to auto heal leading to improved reliability 

Must Have Skills:


The successful candidate will have the following attributes/qualifications:

  • 15+ years of experience in Development and Operations of applications/services in production that has uptime over 99.9%.  
  • 8+ years of experience as a SRE in handling applications that are web scale
  • Strong hands-on coding experience in one or more programming languages such as Python, Golang, Java, Bash, etc.
  • Good understanding of Observability (monitoring, logging, tracing, metrics), Chaos engineering concepts.
  • Proficiency in using Observability tools (example: New Relic, Datadog, etc) for monitoring, logging, tracing.
  • Expert level hands on knowledge in public cloud platform AWS and/or Google Cloud Platform. Professional level certificate on one of the public clouds is highly desirable.
  • Must have hands-on experience in using configuration management systems such as Ansible or SaltStack and infrastructure automation tools like Terraform or CloudFormation.
  • Should have used altering systems such as Pager Duty.
  • Should have implemented solutions around Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for services. Measurement should have been within a system and across systems in distributed systems 
  • Should have supported Production Incidents (PIs) on critical applications of a company. Troubleshoot, debug, and diagnose operational issues and drive them to closure. 
  • Understanding of software delivery life cycles, particularly Agile/Lean & DevOps
  • Proven experience in handling large scale and growing infrastructure across Data Centers and heterogeneous Cloud platforms
  • Experience as a service owner in managing large – geographically diverse stakeholders 
  • Ability to work with creative – fast growing engineering team and motivate them to deliver their best work
  • History of driving innovation.

Good to Have Skills:

  • Familiarity with handling:
  • Containerization – Kubernetes, Docker, Rancher, etc
  • Kafka, Yarn, ElasticSearch etc.
  • Source code management and Implementation of Security best practices. 
  • Tech Stack - Python, Falcon, Elastic Search, MongoDB, AWS (SQS S3), Map Reduce.
  • Networking knowledge
  • Understanding of software delivery life cycles, particularly Agile/Lean & DevOps
  • Contribution to open source community

Qualification: 

Master’s or Bachelor’s degree in Computer Science Engineering, or a related technical degree.

Website: https://www.nomiso.io/

Location:

Bangalore/Hyderabad

About Nomiso:

Nomiso is a product and services engineering company. We are a team of Software Engineers, Architects, Managers, and Cloud Experts with expertise in Technology and Delivery Management. 

Our mission is to Empower and Enhance the lives of our customers, through efficient solutions for their complex business problems. 

At Nomiso we encourage entrepreneurial spirit - to learn, grow and improve. A great workplace, thrives on ideas and opportunities. That is a part of our DNA. We’re in pursuit of colleagues who share similar passions, are nimble and thrive when challenged. We offer a positive, stimulating and fun environment – with opportunities to grow, a fast-paced approach to innovation, and a place where your views are valued and encouraged.

We invite you to push your boundaries and join us in fulfilling your career aspirations!

We are an equal opportunity employer and are committed to diversity, equity, and inclusion. We do not discriminate on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other protected characteristics.

Top Skills

Java,Javascript,React,Node,Go,Python,Bash,Ansible,Puppet,Chef,Terraform,Saltstack,Cloudformation,New Relic,Datadog,Aws,Google Cloud Platform,Pagerduty,Kubernetes,Docker,Rancher,Kafka,Yarn,Elasticsearch,Mongodb,Sqs,S3,Mapreduce,Falcon

Similar Jobs

5 Minutes Ago
Hybrid
Hyderabad, Telangana, IND
Senior level
Senior level
Fintech • Mobile • Payments • Software • Financial Services
Lead reviews and decisions for high-risk customers across APAC, advise 1st-line teams on KYC/CDD/EDD, own escalation criteria, monitor decision quality, report to MLROs, and drive CDD process improvements and training.
2 Hours Ago
Easy Apply
Hybrid
4 Locations
Easy Apply
Senior level
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
Perform static and dynamic malware analysis, develop detections and protections, analyze IOCs, validate AI/ML outputs, build automation tools, and publish threat research.
Top Skills: X86 Assembly,Ida,X64Dbg,Ollydbg,Windows Internals,Tcp/Ip,Ids/Ips,Yara,Python,Perl,Ruby,Genai,Machine Learning
2 Hours Ago
Remote or Hybrid
India
Junior
Junior
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Design, implement, and maintain Java/Spring Boot applications and microservices. Optimize database design and queries (Oracle/SQL), conduct code reviews and mentoring, implement CI/CD, collaborate cross-functionally, adopt developer productivity tools, and troubleshoot production issues.
Top Skills: Java,Spring Boot,Restful Apis,Oracle,Sql,Relational Databases,Git,Github,Github Copilot,Intellij Idea,Code Linters,Ci/Cd,Microservices

What you need to know about the Hyderabad Tech Scene

Because of its proximity to leading research institutions and a government committed to the city's growth, Hyderabad's tech scene is booming. With plans to establish India's first "AI city," the city is on track to become one of the world's most anticipated tech hubs, with companies like TransUnion, Schrödinger and Freshworks, among others, already calling the city home.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account