ML PhD Intern - LLMs & Generative AI at Truveta

Seattle, Washington, United States

Truveta Logo
Not SpecifiedCompensation
InternshipExperience Level
InternshipJob Type
UnknownVisa
Healthcare, TechnologyIndustries

Requirements

  • Currently pursuing a Ph.D. in Computer Science, Electrical Engineering, or a related field, with a focus on machine learning, natural language processing (NLP), Large Language Models (LLMs), multi-modal foundation models, and generative AI
  • Strong theoretical and practical background in NLP including experience with state-of-the-art architectures
  • Proficiency in deep learning frameworks (e.g., PyTorch, TensorFlow, etc.) and libraries commonly used in NLP and Generative AI
  • Solid programming skills in Python and the ability to write clean, efficient, and well-documented code
  • Excellent problem-solving and troubleshooting abilities
  • Finished classes and working only on research work to complete PhD
  • Within 1 year of PhD graduation date
  • Physically present in the United States for the duration of the internship

Responsibilities

  • Collaborate with researchers and engineers to design, develop, and refine large language models and generative models for various applications
  • Utilize expertise in machine learning and natural language processing to develop novel algorithms and methodologies for generative modeling tasks
  • Implement, train, and fine-tune LLM and GPT-like models on large-scale datasets to ensure optimal performance and accuracy
  • Stay up to date with the latest research advancements and techniques in the field of language modeling, generative modeling, and machine learning
  • Deliver the next generation of innovation in trustworthy healthcare

Skills

Key technologies and capabilities for this role

Machine LearningLarge Language ModelsLLMsGenerative AIClinical Data AnalysisAI Research

Questions & Answers

Common questions about this position

Is this internship remote or does it require being in the office?

Truveta embraces a remote culture with headquarters in the greater Seattle area, but participation requires physical presence in the United States for the duration of the internship.

What are the eligibility requirements for this PhD internship?

Candidates must have finished their classes and be working only on research to complete their PhD, and must be within 1 year of their graduation date.

What skills and expertise are required for this ML PhD Intern role?

The role requires expertise in machine learning, natural language processing, large language models, and generative models, along with applied science and software development skills.

How long is the ML PhD internship at Truveta?

Internships are designed to be a minimum of 10 weeks, with the opportunity to extend based on company needs and candidate desires.

What is the company culture like at Truveta?

Truveta fosters a collaborative, mission-driven environment that values diversity, problem-solving, and teamwork across software engineering, big data, machine learning, clinical informatics, and medicine.

Truveta

Healthcare data platform for research analytics

About Truveta

Truveta provides a platform that allows researchers to access and analyze patient data to enhance patient care and study the safety and effectiveness of treatments. The platform, known as Truveta Studio, offers immediate and compliant access to patient-level data, which is sourced from over 30 health systems and includes information from more than 100 million patients across the United States. This data is updated daily and comes from over 800 hospitals and 20,000 clinics. Truveta Studio is designed to simplify the data access process, making it cost-effective for researchers by charging them only for the data and analytics they use. Unlike many competitors, Truveta focuses on providing transparent pricing and efficient access to comprehensive healthcare data. The company's goal is to empower researchers in the healthcare and life sciences sectors to gain valuable insights that can lead to improved patient outcomes.

Seattle, WashingtonHeadquarters
2020Year Founded
$189.7MTotal Funding
LATE_VCCompany Stage
Data & Analytics, Biotechnology, HealthcareIndustries
201-500Employees

Benefits

Competitive Compensation
Comprehensive Benefits
401(k)
Professional Development
Work/Life Autonomy
Flexible Time Off
Generous Parental Leave
Team Activities

Risks

Data privacy concerns may arise from expanding datasets with sensitive information.
Rapid community expansion could strain resources and affect data quality.
Non-peer-reviewed studies may expose Truveta to criticism on scientific rigor.

Differentiation

Truveta offers the most comprehensive EHR data from over 100 million patients.
Truveta Studio provides cost-effective, compliant access to patient-level data and analytics.
Truveta's AI extracts complex clinical concepts from unstructured data, enhancing research capabilities.

Upsides

Truveta's partnership with Panalgo accelerates insights through integrated regulatory-grade EHR data.
The mother-child EHR dataset positions Truveta as a leader in maternal health research.
Truveta's real-world EHR data enables valuable drug comparisons ahead of clinical trials.

Land your dream remote job 3x faster with AI