Morningstar Logo

Morningstar

Data Scientist

Posted 2 Days Ago
Hybrid
Navi Mumbai, Thane, Maharashtra
Mid level
Hybrid
Navi Mumbai, Thane, Maharashtra
Mid level
The Data Scientist at PitchBook will leverage machine learning, deep learning, and natural language processing to analyze vast data sets and derive insights that enhance user experience on the PitchBook Platform. Responsibilities include developing models, improving data decision-making processes, and collaborating with teams to integrate AI solutions while ensuring efficient project completion.
The summary above was generated by AI

At PitchBook, we are always looking forward. We continue to innovate, evolve, and invest in ourselves to bring out the best in everyone. We're deeply collaborative and thrive on the excitement, energy, and fun that reverberates throughout the company.
Our extensive learning programs and mentorship opportunities help us create a culture of curiosity that pushes us to always find new solutions and better ways of doing things. The combination of a rapidly evolving industry and our high ambitions means there's going to be some ambiguity along the way, but we excel when we challenge ourselves. We're willing to take risks, fail fast, and do it all over again in the pursuit of excellence.
If you have a good attitude and are willing to roll up your sleeves to get things done, PitchBook is the place for you.
About the Role:
As a member of the Product and Engineering team at PitchBook, you will be part of a team of big thinkers, innovators, and problem solvers who strive to deepen the positive impact we have on our customers and our company every day. We value curiosity and the drive to find better ways of doing things. We thrive on customer empathy, which remains our focus when creating excellent customer experiences through product innovation.
We know that greatness is achieved through collaboration and diverse points of view, so we work closely with partners around the globe. As a team, we assume positive intent in each other's words and actions, value constructive discussions, and foster a respectful working environment built on integrity, growth, and business value. We invest heavily in our people, who are eager to learn and constantly improve. Join our team and grow with us!
PitchBook's Data Science and Machine Learning team has a clear mission: leverage cutting-edge machine learning and cloud technologies to automatically research private markets and improve the navigability of our platform. The Data Scientist is responsible for using machine learning, deep learning, GenAI and natural language processing (NLP) to collect a high-volume of data for the PitchBook Platform and surface insights for our users. This role requires a strong desire to learn and deliver value through scalablemachine learning/AI systems. A strong motivation to succeed is critical and everyone has the opportunity to shape the long-term direction of our team.
Primary Job Responsibilities:

  • Leverage a sound understanding of data to improve decision-making, and deliver insights through various data analysis techniques, including advanced skills in acquiring, processing, and wrangling data from multiple sources using tools and techniques like ETL, APIs, and data visualization.


  • Apply various modeling techniques, while fusing data sources using pre-processing methods like transformation and normalization including employing data cleaning techniques for both structured and unstructured data, conducting exploratory data analysis, and extracting insights to inform business decisions through iterative exploration and hypothesis testing.


  • Source additional information and solutions through research and relevant libraries, including assisting in applying best practice model fit testing, tuning, and validation techniques to assess model performance, considering data attributes like dataset size and partitioning.


  • Develop and optimize machine learning & deep learning models such as regression, classifiers, ensemble techniques (bagging, boosting, stacking etc.), transformers based Open Source LLMs and Chat GPT including prompt engineering for NLP tasks such as Summarization, Sentiment Analysis, Information Extraction and collaborate with cross-functional teams to integrate solutions into products, besides staying updated with AI/ML advancements especially in GenAI.


  • Proficiency in advanced statistical analysis and methods like regression analysis, time series analysis, and hypothesis testing.


  • Creatively solve problems and develop effective solutions, along with an understanding of business goals and industry trends to align data projects with strategic objectives.


  • Strong verbal and written communication skills to convey complex data insights to stakeholders, combined with efficient project management to ensure timely and within-budget completion.


  • Familiarity and practical skills with various data science and analytics tools, programming skills in languages like Python, and the ability to interpret moderately complex scripts.


Skills and Qualifications:

  • Bachelor's/Master's degree in Computer Science, Machine Learning, Statistics, or related fields


  • 2+ years of experience building predictive models (Classical ML models, Neural Network based Deep Learning models, Transformer based GenAi models etc.) using Python in a production environment


  • 2+ years of demonstrated experience with natural language processing (NLP)


  • 2+ years of demonstrated experience with Python


  • Demonstrate experience using complex SQL queries for data extraction and transformation


  • Strong communication and data presentation skills


  • Strong problem-solving ability


  • Ability to communicate complex analysis in a clear, precise, and actionable manner


  • Experience working closely with software development team(s) is a plus


  • Experience building scalable systems in production is a plus


  • Knowledge of financial services, investment, banking, venture capital, or private equity is desired but not required


Working Conditions
This employee will work in a standard office setting. Some flexibility with work-from-home is a possibility. Employees in this position use a computer on an ongoing basis throughout the day.
I10_MstarIndiaPvtLtd Morningstar India Private Ltd. (Delhi) Legal Entity
Morningstar's hybrid work environment gives you the opportunity to work remotely and collaborate in-person each week. We've found that we're at our best when we're purposely together on a regular basis, at least three days each week. A range of other benefits are also available to enhance flexibility as needs change. No matter where you are, you'll have tools and resources to engage meaningfully with your global colleagues.

Top Skills

Python
SQL

Similar Jobs at Morningstar

Be an Early Applicant
17 Days Ago
Navi Mumbai, Thane, Maharashtra, IND
Hybrid
12,700 Employees
Mid level
12,700 Employees
Mid level
Enterprise Web • Fintech • Financial Services
The Senior Data Scientist will design and develop AI/ML solutions, promote best practices in object-oriented designs, and ensure continuous integration. This role requires expertise in Python and data science principles, as well as experience in AWS services, with responsibilities spanning data cleaning, modeling, and deployment.
Be an Early Applicant
12 Hours Ago
Navi Mumbai, Thane, Maharashtra, IND
Hybrid
12,700 Employees
Entry level
12,700 Employees
Entry level
Enterprise Web • Fintech • Financial Services
The Associate Quantitative Analyst will work with the rating team, conducting quantitative analysis and managing project deliverables in the structured finance domain, specifically in residential mortgage-backed securities. Responsibilities include performing cash flow analysis, database management, and enhancing analytical tools while collaborating with operations and technology teams.
Be an Early Applicant
4 Days Ago
Navi Mumbai, Thane, Maharashtra, IND
Hybrid
12,700 Employees
Mid level
12,700 Employees
Mid level
Enterprise Web • Fintech • Financial Services
As a Machine Learning Engineer, you will develop features and services using Python, applying machine learning and NLP techniques to analyze data and provide insights. Collaborating with product and engineering teams, you will automate processes for data management and stay updated on industry trends to contribute to best practices.

What you need to know about the Hyderabad Tech Scene

Because of its proximity to leading research institutions and a government committed to the city's growth, Hyderabad's tech scene is booming. With plans to establish India's first "AI city," the city is on track to become one of the world's most anticipated tech hubs, with companies like TransUnion, Schrödinger and Freshworks, among others, already calling the city home.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account