ML PhD Intern - LLMs & Generative AI at Truveta

Seattle, Washington, United States

Truveta Logo
Not SpecifiedCompensation
InternshipExperience Level
InternshipJob Type
UnknownVisa
Healthcare, TechnologyIndustries

Requirements

  • Currently pursuing a Ph.D. in Computer Science, Electrical Engineering, or a related field, with a focus on machine learning, natural language processing (NLP), Large Language Models (LLMs), multi-modal foundation models, and generative AI
  • Strong theoretical and practical background in NLP including experience with state-of-the-art architectures
  • Proficiency in deep learning frameworks (e.g., PyTorch, TensorFlow, etc.) and libraries commonly used in NLP and Generative AI
  • Solid programming skills in Python and the ability to write clean, efficient, and well-documented code
  • Excellent problem-solving and troubleshooting abilities
  • Finished classes and working only on research work to complete PhD
  • Within 1 year of PhD graduation date
  • Physically present in the United States for the duration of the internship

Responsibilities

  • Collaborate with researchers and engineers to design, develop, and refine large language models and generative models for various applications
  • Utilize expertise in machine learning and natural language processing to develop novel algorithms and methodologies for generative modeling tasks
  • Implement, train, and fine-tune LLM and GPT-like models on large-scale datasets to ensure optimal performance and accuracy
  • Stay up to date with the latest research advancements and techniques in the field of language modeling, generative modeling, and machine learning
  • Deliver the next generation of innovation in trustworthy healthcare

Skills

Machine Learning
Large Language Models
LLMs
Generative AI
Clinical Data Analysis
AI Research

Truveta

Healthcare data platform for research analytics

About Truveta

Truveta provides a platform that allows researchers to access and analyze patient data to enhance patient care and study the safety and effectiveness of treatments. The platform, known as Truveta Studio, offers immediate and compliant access to patient-level data, which is sourced from over 30 health systems and includes information from more than 100 million patients across the United States. This data is updated daily and comes from over 800 hospitals and 20,000 clinics. Truveta Studio is designed to simplify the data access process, making it cost-effective for researchers by charging them only for the data and analytics they use. Unlike many competitors, Truveta focuses on providing transparent pricing and efficient access to comprehensive healthcare data. The company's goal is to empower researchers in the healthcare and life sciences sectors to gain valuable insights that can lead to improved patient outcomes.

Seattle, WashingtonHeadquarters
2020Year Founded
$189.7MTotal Funding
LATE_VCCompany Stage
Data & Analytics, Biotechnology, HealthcareIndustries
201-500Employees

Benefits

Competitive Compensation
Comprehensive Benefits
401(k)
Professional Development
Work/Life Autonomy
Flexible Time Off
Generous Parental Leave
Team Activities

Risks

Data privacy concerns may arise from expanding datasets with sensitive information.
Rapid community expansion could strain resources and affect data quality.
Non-peer-reviewed studies may expose Truveta to criticism on scientific rigor.

Differentiation

Truveta offers the most comprehensive EHR data from over 100 million patients.
Truveta Studio provides cost-effective, compliant access to patient-level data and analytics.
Truveta's AI extracts complex clinical concepts from unstructured data, enhancing research capabilities.

Upsides

Truveta's partnership with Panalgo accelerates insights through integrated regulatory-grade EHR data.
The mother-child EHR dataset positions Truveta as a leader in maternal health research.
Truveta's real-world EHR data enables valuable drug comparisons ahead of clinical trials.

Land your dream remote job 3x faster with AI