Senior Research Engineer
AssemblyAICandidates must have strong expertise in the Python ecosystem and major ML frameworks like PyTorch and JAX, along with experience in lower-level programming languages such as C++ or Rust. A deep understanding of GPU acceleration, including CUDA, profiling, and kernel-level optimization, is essential, with TPU experience being a strong plus. Proven ability to accelerate deep learning workloads using compiler frameworks, graph optimizations, and parallelization strategies is required, as is a soli…
Salary not specified
Full Time
Senior (5 to 8 years), Expert & Leadership (9+ years)