Reddit

Senior Machine Learning Engineer, Ads Training Platform

United Kingdom

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Social Media, Advertising TechnologyIndustries

Requirements

Candidates should have 5+ years of experience in infrastructure or platform engineering, or large-scale distributed systems, with at least 2 years of hands-on experience with the Ray platform. A strong understanding of distributed computing principles, including task scheduling, fault tolerance, and state management, is required. Experience with distributed storage systems, large-scale data processing, and debugging/profiling distributed jobs is also necessary. Experience with deep learning frameworks like PyTorch or TensorFlow is a significant advantage.

Responsibilities

The Senior Machine Learning Engineer will design, build, and maintain large-scale distributed training infrastructure for Ads ML models. They will develop tools and frameworks on top of the Ray platform, and create tools for debugging, profiling, and tuning distributed training jobs for performance and reliability. Responsibilities also include integrating with object storage systems, improving data access patterns, and collaborating with ML engineers to enhance model training time, efficiency, and GPU training costs. Driving improvements in scheduling, state management, and fault tolerance within the training platform is also a key duty.

Skills

Infrastructure Engineering
Platform Engineering
Distributed Systems
ML Platform Operations
Ray Platform
Debugging
Profiling
Performance Tuning
Object Storage Systems
Data Access Patterns
GPU Training
Scheduling
State Management
Fault Tolerance

Reddit

Online platform for community discussions and content

About Reddit

Reddit is an online platform that allows users to post, vote, and comment on content within various communities based on shared interests. Users can engage in discussions on a wide range of topics, from news and sports to entertainment and hobbies. The platform features a unique voting system where content can receive upvotes or downvotes, helping the most popular posts gain visibility. Reddit generates revenue primarily through advertising, premium memberships, and the sale of virtual goods like Reddit Coins. Unlike many other social media platforms, Reddit emphasizes community-driven content and discussions, creating a space for authentic interactions among users. The goal of Reddit is to foster engagement and connection among its diverse user base.

San Francisco, CaliforniaHeadquarters
2005Year Founded
$1,177.1MTotal Funding
IPOCompany Stage
Consumer Software, EntertainmentIndustries
1,001-5,000Employees

Benefits

Comprehensive health benefits
Flexible unlimited vacation days & monthly global wellness days
Family planning funds & 4+ months paid parental leave
Personal & professional development funds
Paid volunteer time off
Workspace & home office benefits

Risks

Reliance on advertising revenue faces competition from platforms like TikTok and Instagram.
AI tools like 'Reddit Answers' may raise privacy and data usage concerns.
NFT market expansion exposes Reddit to volatility and regulatory scrutiny.

Differentiation

Reddit's unique voting system promotes engaging and authentic content discovery.
The platform's diverse communities cater to a wide range of interests and discussions.
Reddit's AI-powered tools enhance user experience and content moderation.

Upsides

Reddit's confidential IPO plans signal potential growth and market expansion.
AI-driven content moderation can improve user safety and platform reliability.
Expansion into blockchain and NFTs opens new revenue streams and user engagement.

Land your dream remote job 3x faster with AI