Pfizer Logo

Pfizer

Senior Manager, Site Reliability Engineer (SRE)

Posted Yesterday
Be an Early Applicant
Hybrid
Chennai, Tamil Nadu
Senior level
Hybrid
Chennai, Tamil Nadu
Senior level
The Senior Manager of Site Reliability Engineering leads the Hosting Operations team, ensuring infrastructure reliability, scalability, and performance. Responsibilities include mentorship, process improvement, automation, stakeholder engagement, incident management, and monitoring while applying SRE principles across technology domains.
The summary above was generated by AI
ROLE SUMMARY
The IT Operations and Global Command Center organization delivers excellence in the pursuit of breakthroughs that change patients' lives through industry-leading infrastructure operations performance. . We ensure optimal performance of global application hosting and network services that power Pfizer's business processes. We strive to revolutionize service dependability by applying advanced analytics to drive predictive detection, identifying potential issues with our services and intervening before they disrupt our business. We place data at the heart of what we do and apply a relentless focus on continuous improvement to enable Pfizer's business processes and patient outcomes.
We are seeking a highly-skilled and experienced Site Reliability Engineer (SRE) Hosting Operations team. This role will be accountable for the reliability, scalability, and performance of our Hosting infrastructure services for all Pfizer business units globally. This includes Server, Storage, Data Protection, Database, Middleware, HCI operations. The successful candidate will apply SRE principles to drive operational excellence, continuous improvement, and deliver tangible business outcomes that support the organization's strategic goals.
ROLE RESPONSIBILITIES
  • Act as the primary point of guidance the India Hosting SRE team, providing mentorship, coordinating shift coverage, and acting as a reliable escalation point to ensure operational continuity across domains and regions.
  • Foster a culture of reliability and continuous improvement by proactively identifying operational gaps, supporting peer development, and reinforcing best practices across the Hosting SRE team in India
  • Strong hands-on support across two technology domains listed below:

Virtualization platforms ( VMware, HCI )
Operating Systems (RedHat, Windows)
If you don't meet every requirement but have relevant experience in similar technologies, and can demonstrate the ability to succeed in this role, we encourage you to apply
  • Identify areas for improvement & automation opportunities by using data analysis skills and develop proactive solutions to enhance system reliability & reduce toil (manual effort).
  • Automate everything: Build and maintain IaC/CM ( Ansible etc.) and scripting (PowerShell, Bash, Python) to administer, patch, and configure VMs, HCI stack, servers end‑to‑end.
  • Be comfortable leveraging AI tools and platforms to enhance operational efficiency, with a proactive mindset to rethink and transform traditional workflows through intelligent automation and innovation.
  • Stakeholder Engagement: Act as a point of contact for technical and audit queries from internal and external stakeholders, ensuring timely and accurate responses that reflect deep system understanding and compliance awareness.
  • Lead root cause analysis (RCA) events including assisting the addressing identified corrective actions and service/process improvements with an SRE mindset.
  • Ensure strong observability across Hosting infrastructure by developing effective monitoring and alerting, enabling predictive operations in partnership with the Command Center. Act as an escalation point for L2/L3 teams on complex issues, resolving tickets within SLA in coordination with clients.
  • Cross-Functional Readiness: While domain expertise is required, the role requires a flexible mindset and readiness to support adjacent domains such as Unix , Database, Storage etc . This ensures operational continuity and resilience across Hosting SRE function.
  • Provide technical leadership for the Hosting domain during major incident response by actively participating & managing on-call and shift rotations, working closely with the Command Center to ensure timely resolution and maintain infrastructure reliability.

BASIC QUALIFICATIONS
  • Bachelor's degree in a technical field or equivalent practical experience
  • 10 year+ of experience in HCI & Virtualization administration / engineering roles
  • Demonstrate strong leadership and communication skills by taking ownership in driving projects and technical issues, as well as mentoring junior team members.
  • Solid experience with the following technologies: HCI Technologies ( VxRail, vSan, VMware) , Operating systems knowledge. Certifications in any of the following areas. Hosting technologies (DB, Mid-Tier, OS), AI, Observability or Cloud are considered a plus
  • Development skills in one of the programming language such as : Java, Python, C/C#/C++, PowerShell, GitHub, CI/CD with Coding best practices to design, develop, and maintain tools and scripts for system monitoring, automation, and troubleshootingHosting technologies.
  • Strong data literacy and analytical skills to interpret system metrics, identify trends, and support predictive operations
  • Knowledge of configuration management and infrastructure as code tools like Ansible, Terraform, and others.

PHYSICAL/MENTAL REQUIREMENTS
Data Literacy - the ability to analyze, interpret and use data to provide actionable insights
NON-STANDARD WORK SCHEDULE, TRAVEL OR ENVIRONMENT REQUIREMENTS
  • Occasional travel (
  • Willingness to work in split shifts (morning/evening) and participate in on-call rotations to support the 24/7 nature of the Operations environment

Work Location Assignment: Hybrid
Pfizer is an equal opportunity employer and complies with all applicable equal employment opportunity legislation in each jurisdiction in which it operates.
Information & Business Tech

Top Skills

Ansible
C/C#/C++
Ci/Cd
Git
Hci Technologies (Vxrail
Java
Powershell
Python
Redhat
Terraform
Vmware)
Vsan
Windows

Similar Jobs at Pfizer

9 Hours Ago
Hybrid
3 Locations
Senior level
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The Senior Manager, DevOps Engineer oversees AI/ML platforms, defines DevOps best practices, manages CI/CD pipelines, and ensures infrastructure security and reliability.
Top Skills: AWSBashCloudFormationCloudwatchDockerDynatraceGithub ActionsGrafanaPythonSnykSonarqubeTerraform
9 Hours Ago
Hybrid
3 Locations
Mid level
Mid level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The DevOps Engineer will manage AWS infrastructure, CI/CD pipelines, security compliance, and collaborate with development teams to optimize AI platforms.
Top Skills: AWSBashCloudFormationCloudwatchDockerDynamoDBDynatraceEcrEcsGithub ActionsGrafanaLambdaPythonS3SnykSonarqube
2 Days Ago
Hybrid
Chennai, Tamil Nadu, IND
Senior level
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The SRE Manager will oversee hosting infrastructure reliability, drive automation, handle cross-functional team coordination, and lead incident responses, ensuring optimal performance across global operations.
Top Skills: AnsibleC/C#/C++Data ProtectionEnterprise StorageGitJavaOperating SystemsPowershellPythonTerraform

What you need to know about the Hyderabad Tech Scene

Because of its proximity to leading research institutions and a government committed to the city's growth, Hyderabad's tech scene is booming. With plans to establish India's first "AI city," the city is on track to become one of the world's most anticipated tech hubs, with companies like TransUnion, Schrödinger and Freshworks, among others, already calling the city home.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account