Sr Staff R&D Engineer at The Walt Disney Company

Nicasio, California, United States

The Walt Disney Company Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Entertainment, MediaIndustries

Requirements

  • MSc or PhD in Computer Science, Electrical Engineering, Applied Math, or a related field with a focus on AI/ML and multi-modal signal processing
  • 5 years of professional experience in applied ML, with a deep focus on audio-centric AI/ML research and deployment
  • Expertise in building and scaling models using PyTorch, with fluency in training, fine-tuning, and inference for deep neural networks
  • Demonstrated experience developing generative models such as VAE, GAN, diffusion models, or neural vocoders (e.g., HiFi-GAN, WaveNet)
  • Deep understanding of audio-specific ML domains, including source separation, speech enhancement, music processing, and cross-modal tasks
  • Experience with MLOps tooling (e.g., Weights & Biases, MLflow, Datachain), Docker-based containerization, and scalable infrastructure for distributed training
  • Fluency in audio signal processing fundamentals and the integration of DSP into ML pipelines
  • Proven ability to contribute to architectural planning, research strategy, and production deployment in complex, multi-stakeholder environments
  • Preferred Qualifications
  • Familiarity with audio/text/video multi-modal frameworks and cross-domain representations
  • Experience implementing real-time or near-real-time inference pipelines in cloud or edge environments (e.g., AWS, GCP, on-prem GPUs)
  • Working knowledge of latent diffusion audio models (e.g., stable-audio, AudioLDM, AudioGen)
  • Strong knowledge of industry-standard audio datasets and benchmarks (LibriSpeech, VCTK, MUSDB, etc.)

Responsibilities

  • Lead the research, design, and implementation of state-of-the-art machine learning algorithms for speech processing, voice transfer, source separation, and upmixing in media post-production environments
  • Drive the architecture and deployment of scalable model training pipelines using PyTorch and distributed computing frameworks
  • Develop novel generative audio models, including latent diffusion, flow-based models, variational autoencoders, and neural vocoders, optimized for professional soundtrack production
  • Own end-to-end model lifecycle management: pretraining, fine-tuning, validation, inference optimization, and CI/CD integration
  • Guide the development of personalized model adaptation workflows to support per-user tuning, cross-project continuity, and flexible deployment
  • Collaborate with product, platform, and engineering leads to define integration strategies within a secure, cloud-optimized SaaS environment
  • Stay at the forefront of generative audio, multi-modal modeling, and self-supervised learning—translating emerging research into applied innovation
  • Contribute to internal tooling and infrastructure that improves iteration speed, reproducibility, and explainability of deployed models
  • Mentor junior researchers and engineers, and contribute to a culture of rigorous experimentation, collaboration, and continuous improvement

Skills

Key technologies and capabilities for this role

PyTorchMachine LearningSpeech ProcessingSource SeparationUpmixingNeural VocodersLatent Diffusion ModelsVariational AutoencodersFlow-based ModelsDistributed ComputingCI/CDGenerative AudioModel Training Pipelines

Questions & Answers

Common questions about this position

Is this role remote or hybrid?

This role is hybrid, requiring work onsite in the Nicasio, CA office and occasional work from home.

What salary or compensation does this position offer?

This information is not specified in the job description.

What are the key skills required for this Sr Staff R&D Engineer role?

The role requires an MSc or PhD in Computer Science, Electrical Engineering, Applied Math, or related field with AI/ML focus, 5 years of professional experience in applied ML especially audio-centric AI/ML, and expertise in PyTorch for building and scaling models including generative models like VAE, GAN, diffusion models, or neural vocoders.

What is the team culture like for this position?

The role involves being a core member of an applied R&D team, contributing to technical direction, collaborating across product and engineering, mentoring junior researchers and engineers, and fostering a culture of rigorous experimentation, collaboration, and continuous improvement.

What makes a strong candidate for this Sr Staff R&D Engineer position?

Strong candidates will have an advanced degree in a relevant field with AI/ML focus, 5+ years in applied ML particularly audio AI, deep PyTorch expertise, and hands-on experience with generative audio models, along with the ability to lead research and mentor others.

The Walt Disney Company

Leading producers & providers of entertainment and information

About The Walt Disney Company

N/AHeadquarters
1923Year Founded
N/ACompany Stage
10,001+Employees

Land your dream remote job 3x faster with AI