Applied Scientist Intern (Audio Language Modeling)

New York, New York, United States

Apply with AI Apply

Not SpecifiedCompensation

InternshipExperience Level

Full TimeJob Type

UnknownVisa

AI & Machine Learning, CybersecurityIndustries

Position Overview

Location Type: Remote
Employment Type: Full-Time
Salary: Not Specified

Reality Defender provides accurate, multi-modal AI-generated media detection solutions to enable enterprises and governments to identify and prevent fraud, disinformation, and harmful deepfakes in real time. As a Y Combinator graduate, Comcast NBCUniversal LIFT Labs alumni, and backed by DCVC, Reality Defender is the first company to pioneer multi-modal and multi-model detection of AI-generated media. Our web app and platform-agnostic API built by our research-forward team ensures that our customers can swiftly and securely mitigate fraud and cybersecurity risks in real time with a frictionless, robust solution.

Responsibilities

Explore and conceptualize novel methods to leverage different modalities (e.g., speech, text) for deepfake detection and relevant audio understanding tasks.
Perform fundamental and applied research to advance the current state-of-the-art on audio deepfake detection.
Build models with generalizability to unseen generative methods.
Collaborate with scientists and engineers across the organization.
Summarize, publish, and present research findings.

Requirements

Currently enrolled in a PhD program with specialization in machine learning/deep learning, natural language processing, and/or speech processing.
2+ years of experience with training/fine-tuning large models, esp. audio language models, speech foundation models, multi-modal foundation models.
Experience with end-to-end model building pipeline for ML tasks: dataset curation/cleaning, model implementation, benchmarking, and result analysis.
Familiarity with distributed multi-GPU training.
Prior experience with publications in reputable ML/Audio/NLP research venues, e.g., NeurIPS, Interspeech, ICASSP, ACL, EMNLP.

Skills

Machine Learning

Deep Learning

Natural Language Processing

Speech Processing

Audio Language Models

Speech Foundation Models

Multi-modal Foundation Models

Distributed Multi-GPU Training

Model Building

Dataset Curation

Benchmarking

Result Analysis

Reality Defender

Deepfake detection for enterprises and governments

About Reality Defender

Reality Defender offers deepfake detection solutions to protect enterprises, platforms, and governments from AI-generated threats. Its detection platform scans images, videos, and audio in real time to identify fabricated content, helping to prevent misinformation. The company stands out by providing enterprise-grade services through a subscription model that allows easy integration into existing systems. The goal is to enhance fraud prevention and maintain the authenticity of digital content for clients.

New York City, New YorkHeadquarters

2018Year Founded

$46.7MTotal Funding

SERIES_ACompany Stage

Enterprise Software, Cybersecurity, AI & Machine LearningIndustries

51-200Employees

Risks

Free tools like TrueMedia's may undercut Reality Defender's subscription model.

Rapid increase in deepfakes could overwhelm current detection capabilities.

Commoditization of AI tools challenges Reality Defender to update detection algorithms.

Differentiation

Reality Defender offers real-time detection for images, videos, and audio deepfakes.

The platform is government-approved, ensuring high reliability and accuracy for clients.

Reality Defender uses a multi-model approach, enhancing detection capabilities across various media types.

Upsides

Raised $33M in Series A to expand technology and market reach.

Partnership with Respeecher enhances audio deepfake detection capabilities.

Won 'Most Innovative Startup' at RSA Conference 2024, boosting credibility.

Land your dream remote job 3x faster with AI

Try Jobo Free