Position Overview
- Location Type: Remote
- Employment Type: Full-Time
- Salary: Competitive (Details not provided)
- About Outspeed: Outspeed is solving the challenge of latency in AI systems. They are building infrastructure for real-time AI applications in gaming, AR/VR, robotics, and more. The company is led by an experienced team with backgrounds from MIT, Google, and Microsoft, and is based in San Francisco. They value empathy, deep technical knowledge, and autonomy.
Role Description
As an early Member of Technical Staff, you will contribute to every layer of Outspeed’s real-time AI platform – core inference engines, orchestration services, developer APIs, and customer-facing tools. You'll alternate between rapid prototyping and hardening production systems, shipping code that is immediately exercised by teams worldwide. Typical weeks might include:
- Owning end-to-end feature work – from RFC and design review to infrastructure-as-code, monitoring, and post-launch iteration.
- Optimizing GPU inference pipelines, extending the TypeScript/React console, and designing a new audio-streaming protocol.
- Pairing with customers to debug latency spikes, then upstreaming the fix into the orchestration layer.
- Influencing engineering culture: instituting best practices, mentoring newer hires, and shaping the technical roadmap alongside the founders.
This role is ideal for an engineer who enjoys breadth, thrives on context-switching, and wants their fingerprints on everything built.
Requirements
- Experience: 2+ years of professional software engineering experience.
- Proficiency: Deep proficiency in at least one systems language (Go/Rust/C++) and one high-level language (Python/TypeScript).
- Production Systems: Proven track record shipping production-grade distributed systems – designing, implementing, and operating services that run at scale and stay up.
- ML Workflows: Hands-on experience with end-to-end ML workflows: data pipelines, model training/tuning, packaging, and low-latency serving (PyTorch + CUDA or similar).
- Product Sense: Strong product sense and the ability to translate ambiguous problems into well-scoped engineering work.
- Communication: Excellent written and verbal communication skills; comfortable leading design docs and code reviews in a fast-moving, asynchronous environment.
Nice-to-Haves
- Containerization & Orchestration: Expertise running containerized workloads with Docker, Kubernetes, or Nomad, plus IaC with Terraform or Pulumi.
- Cloud Primitives: Knowledge of public-cloud primitives (SDN, block storage, secrets/identity, spot fleet management) and how to squeeze every dollar out of them.
- Inference Tooling: Familiarity with modern inference tooling (e.g., vLLM, Ray, Triton, llama.cpp).
Application Instructions
Company Information
- Founded by: Immigrants
- Commitment: Providing support to immigrant workers.