Applied Research Lead, Reinforcement Learning
RunwayFull Time
Junior (1 to 2 years)
Candidates should possess deep ML expertise, including hands-on experience with tree models, neural nets, ranking, and specifically LLM/VLM fine-tuning & RLHF/RLPF for ads. Proven track record in building high-throughput, low-latency services and real-time data pipelines is necessary. Comfort with uplift metrics, variance, and causal inference, along with an understanding of auction dynamics and advertiser pain points, is required. A startup mindset with a bias to action, comfort with ambiguity, and scrappiness are essential, as is experience mentoring and scaling high-impact ML teams with strong coaching and performance-management skills. Excellent communication abilities to distill complex ML trade-offs for various stakeholders are also a must. Prior work on ad-creative tooling and experience navigating brand-safety, privacy, or copyright frameworks in generative AI products are considered nice-to-have.
The Machine Learning Manager will lead and grow a diverse team of MLEs and software engineers, fostering a culture of rapid iteration, ownership, and inclusion. They will set the technical vision and define long-term roadmaps that align generative AI bets with Ads OKRs, translating fuzzy ideas into concrete milestones and measurable outcomes. This role involves launching and scaling LLM/VLM-powered generators and predictive models, owning the full lifecycle from prototype to production. Responsibilities include architecting low-latency generation/serving stacks and petabyte-scale feature pipelines that meet privacy, safety, and compliance requirements. The manager will drive experiments by establishing rigorous A/B and causal-inference frameworks to quantify lift, marketplace impact, and advertiser ROI. They will also partner broadly with Product, Sales, Policy, UX, and sibling Ads ML teams to integrate ACE capabilities into the self-serve flow and ad auction, and embed guardrails like brand-safety filters, copyright checks, and human-in-the-loop escalation paths to protect users and advertisers.
Online platform for community discussions and content
Reddit is an online platform that allows users to post, vote, and comment on content within various communities based on shared interests. Users can engage in discussions on a wide range of topics, from news and sports to entertainment and hobbies. The platform features a unique voting system where content can receive upvotes or downvotes, helping the most popular posts gain visibility. Reddit generates revenue primarily through advertising, premium memberships, and the sale of virtual goods like Reddit Coins. Unlike many other social media platforms, Reddit emphasizes community-driven content and discussions, creating a space for authentic interactions among users. The goal of Reddit is to foster engagement and connection among its diverse user base.