Hunyuan Multimodal Algorithm Researcher (Omni-Modal)​​ at Tencent

Palo Alto, California, United States

Tencent Logo
$182,500 – $343,200Compensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Technology, Artificial IntelligenceIndustries

Requirements

  • Bachelor’s degree (full-time preferred) or higher in Computer Science, Artificial Intelligence, Mathematics, or related fields; graduate degrees are prioritized
  • Hands-on experience in large-scale multimodal data processing and high-quality data generation is highly preferred
  • Solid foundation in deep learning algorithms and practical experience in large model development; familiarity with Diffusion Models and Autoregressive Models is advantageous. Publication in top-tier conferences or experience in cross-modal (e.g., audio-visual) research is preferred
  • Proficiency in underlying implementation details of deep learning networks and operators, model tuning for training/inference, CPU/GPU acceleration, and distributed training/inference optimization; practical experience is a plus
  • Participation in ACM or NOI competitions is highly valued
  • Strong learning agility, communication skills, teamwork, and curiosity

Responsibilities

  • Conduct research and development of Omni multimodal large models, including the design and construction of training data, foundational model algorithm design, optimization related to pre-training/SFT/RL, model capability evaluation, and exploration of downstream application scenarios
  • Scientifically analyze challenges in R&D, identify bottlenecks in model performance, and devise solutions based on first principles to accelerate model development and iteration, ensuring competitiveness and leading-edge performance
  • Explore diverse paradigms for achieving Omni-modal understanding and generation capabilities, research next-generation model architectures, and push the boundaries of multimodal models

Skills

Key technologies and capabilities for this role

Multimodal ModelsDeep LearningDiffusion ModelsAutoregressive ModelsLarge Model TrainingDistributed TrainingGPU AccelerationModel OptimizationData ProcessingRLHFSFTModel Evaluation

Questions & Answers

Common questions about this position

What is the salary range for this position?

The expected base pay range is $182,500 to $343,200 per year, with actual pay varying based on job-related knowledge, skills, and experience.

What benefits are offered for this role?

Benefits include medical, dental, vision, life and disability coverage, 401(k) plan participation, 15-25 days of vacation, up to 13 holidays, and up to 10 days of paid sick leave per year. Employees may also be eligible for a sign-on payment, relocation package, and restricted stock units on a case-by-case basis.

Is this position remote or onsite, and where is it located?

The position is located in Palo Alto, California, with no mention of remote work options.

What skills and experience are required for this role?

Candidates need a Bachelor’s or higher in Computer Science, AI, Mathematics or related fields (graduate degrees prioritized), hands-on experience in large-scale multimodal data processing, solid deep learning foundation with large model development, and proficiency in deep learning implementations, model tuning, and optimization. Familiarity with Diffusion/Autoregressive Models, top-tier publications, cross-modal research, and ACM/NOI competition experience are preferred.

What does Tencent value in candidates for this researcher role?

Tencent prioritizes strong learning agility, communication skills, teamwork, and curiosity, along with technical expertise in multimodal models and competition experience.

Tencent

Internet platform for social, gaming, fintech

About Tencent

Tencent is a technology company that focuses on enhancing the daily lives of internet users and assisting businesses in their digital transformation. It operates in various sectors, including social networking, entertainment, fintech, and cloud computing. Tencent's main products include WeChat, a messaging and mobile payment app with over a billion users, and Tencent Games, which produces popular video games like Honor of Kings and PUBG Mobile. The company generates revenue through online advertising, subscription services, in-app purchases, mobile payments, and cloud services. Unlike many competitors, Tencent has a diverse business model that allows it to serve both individual users and enterprises effectively. The goal of Tencent is to enrich user experiences and support businesses in their digital journeys.

Shenzhen, ChinaHeadquarters
1998Year Founded
$31.5MTotal Funding
IPOCompany Stage
Consumer Software, Enterprise Software, Fintech, AI & Machine Learning, GamingIndustries
10,001+Employees

Benefits

Professional Development Budget

Risks

Tencent's addition to the US blacklist may affect its operations and partnerships.
Developing Call of Duty mobile version may lead to competitive tensions with Microsoft.
Investment in blockchain exposes Tencent to volatile regulatory environments.

Differentiation

Tencent's WeChat app integrates messaging, social media, and mobile payments seamlessly.
Tencent Games is a global leader with popular titles like Honor of Kings and PUBG Mobile.
Tencent Cloud offers scalable solutions for businesses, enhancing digital transformation efforts.

Upsides

Tencent's investment in blockchain technology could enhance its fintech and cloud services.
The Hunyuan-Large language model advances Tencent's AI capabilities in social networking and gaming.
Collaboration with DYXnet on AI solutions opens new avenues in digital transformation services.

Land your dream remote job 3x faster with AI