Design, build, and maintain scalable Databricks data pipelines using Python (and Java optionally). Implement ETL/ELT, CI/CD with GitHub, deploy on Kubernetes/AWS, monitor via Splunk/Grafana, and contribute to RAG/GenAI pipelines integrating structured and unstructured data.
Requisition Number: 2367324
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.
We are seeking a highly skilled Data Engineer with strong hands-on experience in building data pipelines on Databricks using Python. The role involves working on modern data platforms, cloud-native technologies, and contributing to advanced GenAI use cases such as RAG implementations.
Primary Responsibilities:
Builder Responsibilities:
Design, develop, and deploy AI-powered solutions using no-code, low-code, and advanced platforms, translating business needs into scalable applications that enhance products, workflows, and decision-making.
Required Qualifications:
Preferred Qualifications:
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.
We are seeking a highly skilled Data Engineer with strong hands-on experience in building data pipelines on Databricks using Python. The role involves working on modern data platforms, cloud-native technologies, and contributing to advanced GenAI use cases such as RAG implementations.
Primary Responsibilities:
Builder Responsibilities:
Design, develop, and deploy AI-powered solutions using no-code, low-code, and advanced platforms, translating business needs into scalable applications that enhance products, workflows, and decision-making.
- Design, develop, and maintain scalable data pipelines on Databricks
- Write efficient and optimized Python code for data engineering workflows
- Build and manage ETL/ELT processes for large-scale datasets
- Deploy and manage applications using Kubernetes and AWS services
- Set up and maintain CI/CD pipelines using GitHub
- Ensure code quality through unit testing (JUnit or equivalent frameworks)
- Monitor systems using Splunk and Grafana for logging and observability
- Collaborate with cross-functional teams including data scientists and AI engineers
- Contribute to RAG-based pipelines integrating structured and unstructured data
- Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so
Required Qualifications:
- Solid experience in:
- Databricks and Apache Spark
- Developing data pipelines using Python
- Hands-on experience with AWS (S3, EC2, Lambda, etc.)
- Experience with Kubernetes (containerization & orchestration)
- Experience implementing CI/CD pipelines using GitHub
- Experience in monitoring/logging tools like Splunk and Grafana
- Knowledge of testing frameworks (JUnit or similar)
- Solid understanding of data engineering best practices and distributed systems
Preferred Qualifications:
- Undergraduate degree experience
- Experience in RAG (Retrieval-Augmented Generation) implementations
- Experience is developing Data pipelines on Databricks using Python and Java code
- Experience in RAG implementations using Vector databases like Mongo DB
- Experience or knowledge inKubernetes & AWS, CI/CD Pipeline - Github JUnit, Splunk, Grafana
- Hands-on experience with vector databases (MongoDB - Vector Search preferred)
- Exposure to LLMs and GenAI architectures
- Knowledge of handling unstructured datasets for AI/ML use cases
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
Optum Hyderabad, Telangana, IND Office
Hyderabad, India, India
Similar Jobs at Optum
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Lead design, build, and support of Azure-based data platforms and production pipelines using Databricks, ADF, ADLS Gen2, and Azure SQL. Implement Bronze/Silver/Gold layers, Unity Catalog governance, and Power BI semantic models and dashboards. Provide technical leadership, mentor engineers, ensure security/compliance, and optimize pipeline and report performance.
Top Skills:
Adls Gen2Azure Data Factory (Adf)Azure DatabricksAzure SqlDaxAzurePower BIPysparkPythonUnity Catalog
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, and operate scalable cloud and on-prem data systems and pipelines using Azure, Databricks, Spark, Kafka, and Snowflake. Collaborate with architects, DevOps, and product teams to convert .NET ETL to cloud, automate infrastructure and CI/CD, ensure security and compliance, and deliver high-quality, production-ready data services.
Top Skills:
.NetAzure App ServicesAzure Data FactoryAzure DatabricksAzure DevopsAzure FunctionsC#DockerGraphQLJavaScriptJenkinsKafkaKafka StreamsPowershellPythonRestful ApisSnowflakeSonarqubeSparkSQLSQL ServerTerraform
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, develop, deploy, and maintain end-to-end data engineering solutions. Lead data acquisition, ETL/ELT, transformations, and data quality remediation. Implement Azure-based pipelines (Databricks, ADF, Spark), enable DevOps/versioning, document data definitions, and partner with analytics teams to support reporting and advanced analytics in healthcare.
Top Skills:
Azure Data FactoryAzure DevopsDatabricksDatastageInformaticaAzurePl/SqlPythonScalaSparkSQLSsisT-SqlVb.Net
What you need to know about the Hyderabad Tech Scene
Because of its proximity to leading research institutions and a government committed to the city's growth, Hyderabad's tech scene is booming. With plans to establish India's first "AI city," the city is on track to become one of the world's most anticipated tech hubs, with companies like TransUnion, Schrödinger and Freshworks, among others, already calling the city home.

