Data Scientist II at Scribd

San Francisco, California, United States

Scribd Logo
$120,000 – $180,000Compensation
Junior (1 to 2 years)Experience Level
Full TimeJob Type
UnknownVisa
Biotechnology, MediaIndustries

Requirements

  • 3+ years of post-qualification experience developing machine learning models, working with systems at scale
  • Experience with machine learning models, including but not limited to NLP, LLMs, and generative models
  • Experience with classical Scikit-learn and NumPy models
  • Experience with custom Neural Networks in PyTorch
  • Experience with third-party LLM APIs
  • Familiarity with Python, SQL, and Spark
  • Experience with data analysis and visualization tools
  • Strong understanding of machine learning algorithms and their applications
  • Experience with data preprocessing and feature engineering
  • Strong problem-solving skills and attention to detail
  • Excellent communication and documentation skills
  • Ability to work in a fast-paced environment and meet deadlines
  • Experience with data science tools and technologies
  • Familiarity with cloud-based infrastructure and data storage solutions
  • Experience with data governance and quality control

Responsibilities

  • Investigate methods of solving complex problems at Scribd, at scale
  • Collaborate with other Data Scientists, Machine Learning Engineers, and ML Data Engineers on cross-functional projects
  • Leverage various algorithms, including but not limited to classical Scikit-learn and NumPy models, custom Neural Networks in PyTorch, and third-party LLM APIs
  • Process massive amounts of data using Python, SQL, and Spark
  • Align with stakeholders through written and verbal communications methods on approaches and results of projects
  • Write detailed, accurate, and concise project documentation
  • Develop and deploy high-impact AI and ML systems
  • Contribute innovative ideas and solutions to the Applied Research team
  • Participate in cross-functional squads to maximize business impact
  • Design and implement content enrichment, representation learning, recommendations, search, translation, and other areas of impact
  • Act as a key driver for innovation in product surface experimentation, metadata generation, and model development
  • Collaborate with Product and Engineering partners to design solutions and maximize business impact

Skills

Machine Learning
Natural Language Processing
Generative AI
Data Analysis
Problem-Solving
Communication

Scribd

Digital library and e-book subscription service

About Scribd

Scribd is a digital library and e-book subscription service that provides users with access to a wide variety of reading materials, including e-books, audiobooks, magazines, and documents. Users pay a monthly subscription fee to enjoy unlimited access to this extensive library, which features bestsellers, classic literature, and academic papers. Scribd stands out from competitors like Amazon's Kindle Unlimited and Audible by offering a more diverse range of content types and a user-friendly interface that enhances the reading experience. Additionally, Scribd fosters community engagement through features like ScribdChat, allowing interaction between authors and readers, as well as curated reading lists that highlight relevant content. The company's goal is to serve a broad audience of readers, from casual readers to professionals, by providing a rich and varied digital content library.

San Francisco, CaliforniaHeadquarters
2007Year Founded
$102.9MTotal Funding
LATE_VCCompany Stage
Consumer Software, EntertainmentIndustries
201-500Employees

Benefits

Diversity, Equity, and Inclusion - A robust Diversity, Equity and Inclusion program that includes company-wide training, equitable hiring best practices, Employee Resource Groups, and company-wide goals.
Scribd flex - We embrace flexibility as a key principle and will allow employees, in partnership with their manager, to choose the workstyle that best suits their individual needs and preferences.
Health, Dental, Vision, Life & Disability - We offer comprehensive healthcare plans and cover premiums for employees at 100%. PPO, HMO, and High Deductible Health Plans are available so you can choose whichever coverage supports your lifestyle best.
Matching 401(k) - Easily save for retirement with our 401(k) plan and take advantage of up to 3% company match with no waiting period.
Paid Time Off - A generous paid time off program that includes vacation time, personal days, sabbaticals, volunteer time, winter break, sick time, and more.
Paid parental leave - We provide paid time away from work for our new biological, adoptive, or foster parents.
Wellbeing - Resources, workshops & events to support your wellbeing journey: Mind, Body & Soul.
Curated Career Paths and Continuous Learning - The power to explore career paths & possibilities, and the tools to support your growth. From continuing education, degrees, or certifications, we support our employee’s career journeys.

Risks

Credit-based model may alienate users preferring unlimited access.
Spotify's competitive pricing could draw subscribers away from Scribd.
New CEO may disrupt Scribd's current business model and market position.

Differentiation

Scribd offers a diverse range of content types beyond just e-books.
The platform includes features like ScribdChat for author-reader interaction.
Scribd's user-friendly interface enhances the overall reading experience.

Upsides

Tony Grimminck's appointment as CEO suggests potential strategic expansions.
The credit-based model could attract users seeking flexible content access.
Growing audiobook demand presents a market opportunity for Scribd.

Land your dream remote job 3x faster with AI