Applied Scientist Intern (Audio Language Modeling)
Reality Defender- Full Time
- Internship
Candidates should possess a degree in Linguistics, Computational Linguistics, NLP, or a related field, or demonstrate proven experience in these areas. Hands-on experience creating or curating language resources such as corpora, lexicons, and annotation guidelines for applied use is essential. Working knowledge of multiple languages or the ability to quickly analyze unfamiliar linguistic systems is required, along with practical experience writing, maintaining, and debugging tools for linguistic data analysis using Python, regular expressions, and libraries like Pandas, spaCy, or NLTK.
The Applied Linguist will develop and maintain linguistic datasets to power multilingual features, design culturally and linguistically grounded benchmarking frameworks, analyze language-specific model behaviors and guide improvements, coordinate with native speakers and localization experts, create and maintain annotation guidelines and pronunciation lexicons, work with Data Operations and Labeling teams to ensure labeling quality, partner with Research and Product to test new languages and features, provide ongoing QA for multilingual models, and contribute to internal documentation for model behavior.
Speech recognition APIs for audio transcription
Deepgram specializes in artificial intelligence for speech recognition, offering a set of APIs that developers can use to transcribe and understand audio content. Their technology allows clients, ranging from startups to large organizations like NASA, to process millions of audio minutes daily. Deepgram's APIs are designed to be fast, accurate, scalable, and cost-effective, making them suitable for businesses needing to handle large volumes of audio data. The company operates on a pay-per-use model, where clients are charged based on the amount of audio they transcribe, allowing Deepgram to grow its revenue alongside client usage. With a focus on the high-growth market of speech recognition, Deepgram is positioned for future success.