Research Engineer (or Scientist), Speech and Language at DeepMind

Mountain View, California, United States

DeepMind Logo
Not SpecifiedCompensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, Technology, ResearchIndustries

Requirements

  • MS or PhD in Computer Science, Artificial Intelligence, Machine Learning, Computer Vision, Speech Processing, or equivalent practical experience
  • Proven experience in deep learning research and development, particularly in generative AI related to video and audio synthesis, including diffusion models and autoregressive generative models
  • Exceptional engineering skills in Python and deep learning frameworks (e.g., JAX, TensorFlow, PyTorch), with a track record of building high-quality research prototypes and systems; self-motivated to pick up technologies to adapt and move quickly
  • Strong publication record at top-tier machine learning, computer vision, and graphics conferences (e.g., NeurIPS, ICLR, ICML, SIGGRAPH, CVPR, ICCV)
  • Advantages
  • Knowledge of probabilistic machine learning and generative modeling (e.g., Diffusion, autoregressive models, GANs, flows, hierarchical VAEs, DDPMs)
  • Demonstrated experience in large-scale training of multimodal generative models
  • Sequence processing experience with TensorFlow, PyTorch, or JAX
  • Knowledge of speech processing and language understanding, in particular text-to-speech synthesis and prosody modeling

Responsibilities

  • Design, rapidly implement in code, and rigorously evaluate cutting-edge deep learning algorithms and data curation for multimodal generative AI, with a particular emphasis on audio and video synthesis
  • Report and present research findings and developments clearly and efficiently both internally and externally, verbally and in writing
  • Thrive under uncertainty, driving both team collaborations to meet ambitious research goals, as well as significant individual contributions

Skills

Key technologies and capabilities for this role

Deep LearningMachine LearningSpeech ProcessingLanguage ModelsGenerative AIAudio SynthesisVideo SynthesisMultimodal AIDiffusion ModelsPyTorchRepresentation LearningData Curation

Questions & Answers

Common questions about this position

What education is required for this Research Engineer role?

An MS or PhD in Computer Science, Artificial Intelligence, Machine Learning, Computer Vision, Speech Processing, or equivalent practical experience is required.

What technical skills are essential for this position?

Proven experience in deep learning research and development, particularly in generative AI related to video and audio synthesis including diffusion models and autoregressive generative models, plus exceptional engineering skills in Python and deep learning frameworks like JAX, TensorFlow, or PyTorch.

Is a publication record necessary for this role?

Yes, a strong publication record at top-tier machine learning, computer vision, and graphics conferences such as NeurIPS, ICLR, ICML, SIGGRAPH, CVPR, and ICCV is required.

What is the work arrangement or location for this position?

This information is not specified in the job description.

What is the salary or compensation for this role?

This information is not specified in the job description.

DeepMind

Develops artificial general intelligence systems

About DeepMind

This company leads in the field of artificial general intelligence (AGI), with notable applications across healthcare, energy management, and biotechnology. Their work in early diagnostic tools for eye diseases, optimizing energy usage in major data centers, and groundbreaking contributions to protein structure prediction underlines their commitment to harnessing AI for diverse practical applications. The company's dedication to pushing the boundaries of AI technology not only propels the industry forward but also creates a dynamic and impactful working environment for its employees.

London, United KingdomHeadquarters
2010Year Founded
$4.9MTotal Funding
ACQUISITIONCompany Stage
AI & Machine Learning, BiotechnologyIndustries
1,001-5,000Employees

Benefits

Performance Bonus

Risks

Emerging AI models may challenge DeepMind's current strategies.
Backlash against AI models like Gemini poses reputational risks.
Labeling AI-generated content could increase operational complexity for DeepMind.

Differentiation

DeepMind combines AI, ML, and neuroscience for general-purpose learning algorithms.
DeepMind's AlphaFold model advances protein folding research significantly.
GraphCast by DeepMind offers rapid, accurate ten-day weather forecasts.

Upsides

AI-driven drug discovery is set to grow significantly in 2024.
AlphaCode 2 showcases AI's potential in competitive programming.
DeepMind's AI tools are transforming music creation and meteorology.

Land your dream remote job 3x faster with AI