Research Engineer (or Scientist), Speech and Language at DeepMind

Mountain View, California, United States

Apply Now

Not SpecifiedCompensation

Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level

Full TimeJob Type

UnknownVisa

Artificial Intelligence, Technology, ResearchIndustries

Requirements

MS or PhD in Computer Science, Artificial Intelligence, Machine Learning, Computer Vision, Speech Processing, or equivalent practical experience
Proven experience in deep learning research and development, particularly in generative AI related to video and audio synthesis, including diffusion models and autoregressive generative models
Exceptional engineering skills in Python and deep learning frameworks (e.g., JAX, TensorFlow, PyTorch), with a track record of building high-quality research prototypes and systems; self-motivated to pick up technologies to adapt and move quickly
Strong publication record at top-tier machine learning, computer vision, and graphics conferences (e.g., NeurIPS, ICLR, ICML, SIGGRAPH, CVPR, ICCV)
Advantages
Knowledge of probabilistic machine learning and generative modeling (e.g., Diffusion, autoregressive models, GANs, flows, hierarchical VAEs, DDPMs)
Demonstrated experience in large-scale training of multimodal generative models
Sequence processing experience with TensorFlow, PyTorch, or JAX
Knowledge of speech processing and language understanding, in particular text-to-speech synthesis and prosody modeling

Responsibilities

Design, rapidly implement in code, and rigorously evaluate cutting-edge deep learning algorithms and data curation for multimodal generative AI, with a particular emphasis on audio and video synthesis
Report and present research findings and developments clearly and efficiently both internally and externally, verbally and in writing
Thrive under uncertainty, driving both team collaborations to meet ambitious research goals, as well as significant individual contributions

Skills

Key technologies and capabilities for this role

Deep LearningMachine LearningSpeech ProcessingLanguage ModelsGenerative AIAudio SynthesisVideo SynthesisMultimodal AIDiffusion ModelsPyTorchRepresentation LearningData Curation

Questions & Answers

Common questions about this position

What education is required for this Research Engineer role?

An MS or PhD in Computer Science, Artificial Intelligence, Machine Learning, Computer Vision, Speech Processing, or equivalent practical experience is required.

What technical skills are essential for this position?

Proven experience in deep learning research and development, particularly in generative AI related to video and audio synthesis including diffusion models and autoregressive generative models, plus exceptional engineering skills in Python and deep learning frameworks like JAX, TensorFlow, or PyTorch.

Is a publication record necessary for this role?

Yes, a strong publication record at top-tier machine learning, computer vision, and graphics conferences such as NeurIPS, ICLR, ICML, SIGGRAPH, CVPR, and ICCV is required.

What is the work arrangement or location for this position?

This information is not specified in the job description.

What is the salary or compensation for this role?

This information is not specified in the job description.

DeepMind

Develops artificial general intelligence systems

About DeepMind

This company leads in the field of artificial general intelligence (AGI), with notable applications across healthcare, energy management, and biotechnology. Their work in early diagnostic tools for eye diseases, optimizing energy usage in major data centers, and groundbreaking contributions to protein structure prediction underlines their commitment to harnessing AI for diverse practical applications. The company's dedication to pushing the boundaries of AI technology not only propels the industry forward but also creates a dynamic and impactful working environment for its employees.

London, United KingdomHeadquarters

2010Year Founded

$4.9MTotal Funding

ACQUISITIONCompany Stage

AI & Machine Learning, BiotechnologyIndustries

1,001-5,000Employees

Research Engineer (or Scientist), Speech and Language at DeepMind

Requirements

Responsibilities

Skills

Questions & Answers

DeepMind

About DeepMind

Benefits

Performance Bonus

Risks

Emerging AI models may challenge DeepMind's current strategies.

Backlash against AI models like Gemini poses reputational risks.

Labeling AI-generated content could increase operational complexity for DeepMind.

Differentiation

DeepMind combines AI, ML, and neuroscience for general-purpose learning algorithms.

DeepMind's AlphaFold model advances protein folding research significantly.

GraphCast by DeepMind offers rapid, accurate ten-day weather forecasts.

Upsides

AI-driven drug discovery is set to grow significantly in 2024.

AlphaCode 2 showcases AI's potential in competitive programming.

DeepMind's AI tools are transforming music creation and meteorology.

Research Scientist - Voice AI Foundations

AI Researcher (Multimodal Perception Models)

AI Researcher (Voice)

Senior Software Engineer, Audio/Video

Machine Learning Research Engineer

Machine Learning Researcher

Land your dream remote job 3x faster with AI

Research Engineer (or Scientist), Speech and Language at DeepMind

Requirements

Responsibilities

Skills

Questions & Answers

DeepMind

About DeepMind

Benefits

Performance Bonus

Risks

Emerging AI models may challenge DeepMind's current strategies.

Backlash against AI models like Gemini poses reputational risks.

Labeling AI-generated content could increase operational complexity for DeepMind.

Differentiation

DeepMind combines AI, ML, and neuroscience for general-purpose learning algorithms.

DeepMind's AlphaFold model advances protein folding research significantly.

GraphCast by DeepMind offers rapid, accurate ten-day weather forecasts.

Upsides

AI-driven drug discovery is set to grow significantly in 2024.

AlphaCode 2 showcases AI's potential in competitive programming.

DeepMind's AI tools are transforming music creation and meteorology.

Related Jobs

Research Scientist - Voice AI Foundations

AI Researcher (Multimodal Perception Models)

AI Researcher (Voice)

Senior Software Engineer, Audio/Video

Machine Learning Research Engineer

Machine Learning Researcher

Land your dream remote job 3x faster with AI