Research Scientist - Voice AI Foundations
Deepgram- Full Time
- Mid-level (3 to 4 years), Senior (5 to 8 years)
Candidates must be currently enrolled in a PhD program with a specialization in machine learning, deep learning, natural language processing, or speech processing. They should have at least 2 years of experience in training and fine-tuning large models, particularly audio language models and multi-modal foundation models. Familiarity with the end-to-end model building pipeline for machine learning tasks, including dataset curation, model implementation, benchmarking, and result analysis, is required. Experience with distributed multi-GPU training and prior publications in reputable ML, audio, or NLP research venues is also necessary.
The Applied Scientist Intern will explore and conceptualize novel methods to leverage different modalities for deepfake detection and audio understanding tasks. They will perform fundamental and applied research to advance the state-of-the-art in audio deepfake detection and build models that generalize to unseen generative methods. Collaboration with scientists and engineers across the organization is expected, as well as summarizing, publishing, and presenting research findings.
Deepfake detection for enterprises and governments
Reality Defender offers deepfake detection solutions to protect enterprises, platforms, and governments from AI-generated threats. Its detection platform scans images, videos, and audio in real time to identify fabricated content, helping to prevent misinformation. The company stands out by providing enterprise-grade services through a subscription model that allows easy integration into existing systems. The goal is to enhance fraud prevention and maintain the authenticity of digital content for clients.