Research Scientist - Speech & Audio Understanding (Speech Generation) at Tencent

Bellevue, Washington, United States

Tencent Logo
$122,500 – $229,700Compensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Technology, AIIndustries

Requirements

  • Master’s or Ph.D. in Computer Science, Artificial Intelligence, Electronic Engineering, Signal Processing, or related fields
  • Research or development experience in one or more areas: voice foundation models, speech synthesis, speech recognition, audio generation, voice conversion, or speech codec
  • Familiarity with mainstream voice-enabled large models (e.g., GPT4o, GLM-4-Voice, Qwen2.5-Omni, Voila). Prior project experience is preferred
  • Proficient in deep learning frameworks (e.g., PyTorch). Experience with large-scale model training frameworks (Megatron/Deepspeed) is a plus
  • Solid understanding of large model architectures and principles. Experience in large-scale pretraining or post-training is preferred

Responsibilities

  • Track the latest research in speech generation algorithms, explore next-generation paradigms for speech/audio generation, and push the boundaries of speech generation capabilities
  • Investigate cutting-edge multimodal voice foundation model technologies to enhance voice interaction experiences by integrating text, speech, and vision
  • Lead the technical R&D of voice foundation models, driving model performance improvements and innovative applications

Skills

Key technologies and capabilities for this role

speech generationvoice foundation modelsspeech synthesisspeech recognitionaudio generationvoice conversionspeech codecPyTorchMegatronDeepspeedlarge model architectureslarge-scale pretrainingmultimodal modelsGPT4oGLM-4-VoiceQwen2.5-OmniVoila

Questions & Answers

Common questions about this position

What is the salary range for this Research Scientist position?

The expected base pay range is $122,500.00 to $229,700.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience.

What benefits are offered for this role?

Employees may be eligible for a sign-on payment, relocation package, restricted stock units (case-by-case), medical, dental, vision, life and disability benefits, 401(k) plan, 15-25 days vacation (tenure-based), 13 holidays, and 10 days paid sick leave.

Is this position remote or onsite, and where is it located?

The position is located in US-Washington-Bellevue. No remote work policy is specified.

What skills and experience are required for this role?

A Master’s or Ph.D. in Computer Science, AI, Electronic Engineering, Signal Processing or related fields is required, along with research/development experience in voice foundation models, speech synthesis, recognition, audio generation, voice conversion, or speech codec. Proficiency in PyTorch and familiarity with voice-enabled large models like GPT4o or Qwen2.5-Omni are also needed.

What is the company culture like at Tencent?

Tencent fosters an inclusive environment as an equal opportunity employer, believing diverse voices fuel innovation to better serve users and the community, where every employee feels supported and inspired to achieve goals.

Tencent

Internet platform for social, gaming, fintech

About Tencent

Tencent is a technology company that focuses on enhancing the daily lives of internet users and assisting businesses in their digital transformation. It operates in various sectors, including social networking, entertainment, fintech, and cloud computing. Tencent's main products include WeChat, a messaging and mobile payment app with over a billion users, and Tencent Games, which produces popular video games like Honor of Kings and PUBG Mobile. The company generates revenue through online advertising, subscription services, in-app purchases, mobile payments, and cloud services. Unlike many competitors, Tencent has a diverse business model that allows it to serve both individual users and enterprises effectively. The goal of Tencent is to enrich user experiences and support businesses in their digital journeys.

Shenzhen, ChinaHeadquarters
1998Year Founded
$31.5MTotal Funding
IPOCompany Stage
Consumer Software, Enterprise Software, Fintech, AI & Machine Learning, GamingIndustries
10,001+Employees

Benefits

Professional Development Budget

Risks

Tencent's addition to the US blacklist may affect its operations and partnerships.
Developing Call of Duty mobile version may lead to competitive tensions with Microsoft.
Investment in blockchain exposes Tencent to volatile regulatory environments.

Differentiation

Tencent's WeChat app integrates messaging, social media, and mobile payments seamlessly.
Tencent Games is a global leader with popular titles like Honor of Kings and PUBG Mobile.
Tencent Cloud offers scalable solutions for businesses, enhancing digital transformation efforts.

Upsides

Tencent's investment in blockchain technology could enhance its fintech and cloud services.
The Hunyuan-Large language model advances Tencent's AI capabilities in social networking and gaming.
Collaboration with DYXnet on AI solutions opens new avenues in digital transformation services.

Land your dream remote job 3x faster with AI