S&P Global

Sr Data Scientist– NLP, LLM and GenAI

New York, New York, United States

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Financial ServicesIndustries

Sr Data Scientist – NLP, LLM and GenAI

Employment Type: Full time

Position Overview:

S&P is a leader in risk management solutions leveraging automation and AI/ML. This role is a unique opportunity for hands-on ML scientists and NLP/Gen AI/ LLM scientists to grow into the next step in their career journey and apply their technical expertise in NLP, deep learning, GenAI, and LLMs to drive business value for multiple stakeholders while conducting cutting-edge applied research around LLMs, Gen AI, and related areas.

Grade Level (for internal use): 10

Responsibilities:

ML, Gen AI, NLP, LLM Model Development:

  • Design and develop custom ML, Gen AI, NLP, LLM Models for batch and stream processing-based AI ML pipelines.
  • Model components will include data ingestion, preprocessing, search and retrieval, Retrieval Augmented Generation (RAG), NLP/LLM model development, fine-tuning and prompt engineering.
  • Ensure the solution meets all technical and business requirements.
  • Work closely with other members of data science, MLOps, and technology teams in the design, development, and implementation of the ML model solutions.

ML, NLP, LLM Model Evaluation:

  • Work closely with other data science team members to develop, validate, and maintain robust evaluation solutions and tools to evaluate model performance, accuracy, consistency, and reliability during development and UAT.
  • Implement model optimizations to improve system efficiency.

NLP, LLM, Gen AI Model Deployment:

  • Work closely with the MLOps team for the deployment of machine learning models into production environments, ensuring reliability and scalability.

Internal Collaboration:

  • Collaborate closely with product teams, business stakeholders, MLOps, machine learning engineers, and software engineers to ensure smooth integration of machine learning models into production systems.

Documentation:

  • Write and maintain comprehensive documentation of ML modeling processes and procedures for reference and knowledge sharing.

Develop Models Based on Standards and Best Practices:

  • Ensure that the models are designed and developed while adhering to specified standards, governance, and best practices in ML model development as specified by senior Data Science and MLOps leads.

Assist in Problem Solving:

  • Troubleshoot complex issues related to machine learning model development and data pipelines and develop innovative solutions.

What We’re Looking For:

  • Bachelor's / Master’s in Computer Science, Mathematics or Statistics, Computational Linguistics, Engineering, or a related field.
  • 1+ years of professional hands-on experience leveraging large sets of structured and unstructured data to develop data-driven tactical and strategic analytics and insights using ML, NLP, computer vision solutions.
  • Demonstrated 1+ years hands-on experience with Python, Hugging Face, TensorFlow, Keras, PyTorch, Spark or similar statistical tools. Expert in Python programming.
  • 1+ years hands-on experience developing natural language processing (NLP) models, ideally with transformer architectures.
  • 1+ years of experience with implementing information search and retrieval at scale, using a range of solutions from keyword search to semantic search using embeddings.
  • Knowledge of developing or tuning Large Language Models (LLM) and Generative AI (GAI).
  • Knowledge of NLP, LLMs (extractive and generative), fine-tuning, and LLM model development.
  • Familiar with higher-level trends in LLMs and open-source platforms.

Nice to have:

  • Experience with contributing to GitHub and open-source initiatives or in research projects and/or participation in Kaggle competitions.

About S&P Global Ratings:

At S&P Global Ratings, our analyst-driven credit ratings, research, and sustainable finance opinions provide critical insights that are essential to translating complexity into clarity so market participants can uncover opportunities and make decisions with conviction. By bringing transparency to the market through...

Skills

NLP
Large Language Models
GenAI
Deep Learning
ML Pipelines
Model Fine-tuning
Prompt Engineering
Model Evaluation
Model Deployment
MLOps
Data Ingestion
Data Preprocessing
Search and Retrieval
Retrieval Augmented Generation (RAG)

S&P Global

Provides financial information and analytics services

About S&P Global

S&P Global provides financial information and analytics to a wide range of clients, including investors, corporations, and governments. The company offers services such as credit ratings, market intelligence, and indices, which help clients understand and navigate the global financial market. S&P Global's products work by utilizing advanced data analytics and research to deliver insights that assist clients in making informed decisions and managing risks. Unlike many competitors, S&P Global has a diverse range of divisions, including S&P Global Ratings and S&P Dow Jones Indices, which allows it to cater to various financial needs. The company's goal is to support clients in driving growth while also committing to corporate responsibility and positive societal impact.

New York City, New YorkHeadquarters
1917Year Founded
IPOCompany Stage
Data & Analytics, Financial ServicesIndustries
10,001+Employees

Benefits

Health Insurance
Unlimited Paid Time Off
Professional Development Budget
401(k) Company Match
Family Planning Benefits
Employee Discounts

Risks

Integration challenges with new acquisitions like ProntoNLP may cause operational issues.
Increased competition from AI-driven platforms like Brooklyn Investment Group.
Dependence on volatile credit ratings market could impact revenue stability.

Differentiation

S&P Global integrates advanced AI tools for superior financial analytics capabilities.
The company offers comprehensive ESG solutions, meeting growing sustainability demands.
S&P Global's diverse divisions provide a wide range of financial services globally.

Upsides

Acquisition of ProntoNLP boosts data analytics and sentiment scoring capabilities.
Rising demand for ESG data enhances S&P Global's market position.
Expansion into India strengthens S&P Global's research and insights offerings.

Land your dream remote job 3x faster with AI