[Remote] Senior Research Engineer at AssemblyAI

Remote

AssemblyAI Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, AI & Machine Learning, Speech Technology, Software DevelopmentIndustries

Requirements

Candidates must have strong expertise in the Python ecosystem and major ML frameworks like PyTorch and JAX, along with experience in lower-level programming languages such as C++ or Rust. A deep understanding of GPU acceleration, including CUDA, profiling, and kernel-level optimization, is essential, with TPU experience being a strong plus. Proven ability to accelerate deep learning workloads using compiler frameworks, graph optimizations, and parallelization strategies is required, as is a solid understanding of the deep learning lifecycle from model design to inference deployment. Strong debugging, profiling, and optimization skills in large-scale distributed environments are necessary, alongside excellent communication and collaboration skills.

Responsibilities

The Senior Research Engineer will investigate and mitigate performance bottlenecks in large-scale distributed training and inference systems. They will develop and implement both low-level and high-level optimization strategies, and translate research models and prototypes into highly optimized, production-ready inference systems. Responsibilities include exploring and integrating inference compilers, designing, testing, and deploying scalable solutions for parallel and distributed workloads on heterogeneous hardware, and facilitating knowledge transfer between Research and Engineering teams.

Skills

Speech AI
Deep Learning
Distributed Training
Data Processing
Inference Systems
High-Performance Computing
API Development
Python
PyTorch
TensorFlow
Machine Learning

AssemblyAI

Speech recognition and audio intelligence solutions

About AssemblyAI

AssemblyAI specializes in Speech AI technology, focusing on automatic speech recognition (ASR) and audio intelligence. Their main product is an API that allows businesses to transcribe audio and video content, detect speakers, analyze sentiment, and redact personally identifiable information (PII). This API enables clients to integrate these capabilities into their own applications, providing accurate and scalable speech-to-text solutions. Unlike many competitors, AssemblyAI emphasizes continuous improvement of their AI models, backed by a team of research leaders and engineers. Their goal is to help businesses unlock the potential of voice data, making it easier to derive insights and build innovative applications.

San Francisco, CaliforniaHeadquarters
2017Year Founded
$110MTotal Funding
SERIES_CCompany Stage
Enterprise Software, AI & Machine LearningIndustries
51-200Employees

Benefits

Competitive Salary + Bonus
Equity
401k
100% Remote team
Unlimited PTO
Premium medical, vision, & dental care
$1K budget for your home office setup
New Macbook Pro (or PC if you prefer)
2x/year company paid team retreat

Risks

Competition from Gladia, Deepgram, and Speechmatics may erode market share.
Rapid AI advancements could make current models obsolete without continuous innovation.
Over-reliance on API revenue is risky if clients shift to in-house solutions.

Differentiation

AssemblyAI offers advanced Speech AI models for transcription and sentiment analysis.
The company provides an API for seamless integration into client applications.
AssemblyAI's technology supports speaker detection and PII redaction, enhancing data security.

Upsides

Real-time audio processing is a growing market opportunity for AssemblyAI.
Customizable AI voice agents present a new avenue for product expansion.
Integration with no-code platforms could increase API adoption among non-technical users.

Land your dream remote job 3x faster with AI