GPU Performance tooling engineer
RivosFull Time
Junior (1 to 2 years)
Key technologies and capabilities for this role
Common questions about this position
The salary range is $215K - $300K.
All roles require you to be in-person at the NYC HQ located in Union Square.
Expertise in CUDA, PyTorch, and generative models is essential, along with experience designing custom CUDA or Triton kernels, optimizing model training and inference pipelines on NVIDIA GPUs, and profiling with tools like Nsight and PyTorch Profiler.
The company fosters a culture of fast iteration, thought leadership, and outsized impact for early team members, with a rapidly growing team of ambitious engineers based in NYC.
Strong candidates are experts at the intersection of CUDA, PyTorch, and generative models, with hands-on experience optimizing GPU performance, designing custom kernels, and staying current with cutting-edge tech like Hopper features and FlashAttention.
Video captioning and translation services
Captions.ai enhances video content by providing captioning and translation services tailored for content creators, social media influencers, marketing agencies, and businesses. Their main offerings include automatic subtitle generation, translation into 28 languages, and video compression to improve performance. These tools simplify the video production process, allowing users to produce professional-quality videos with ease. Unlike many competitors, Captions.ai uses a freemium model, offering basic services for free while charging for advanced features, which helps attract a large user base and convert free users into paying customers. The company's goal is to make high-quality video content accessible to a wider audience, and recent funding will support their growth and product development.