GPU Performance tooling engineer
RivosFull Time
Junior (1 to 2 years)
Key technologies and capabilities for this role
Common questions about this position
This role is remote, based out of the United States.
This information is not specified in the job description.
Candidates need a strong grasp of CPU microarchitecture fundamentals, especially instruction scheduling, register files, scalar and vector execution, and optimizing instruction execution latencies. Experience with performance simulation tools like Gem5 or SimpleScalar for analyzing bottlenecks and tuning performance is required. Collaborative skills, organization, and familiarity with HDLs (Verilog, VHDL) and low-level C/C++ are pluses.
Tenstorrent values collaboration, curiosity, and a commitment to solving hard problems. The team consists of diverse technologists passionate about AI and building the best AI platform, working alongside top minds in high-performance computing and CPU micro-architecture.
Strong candidates have deep expertise in CPU microarchitecture, experience with performance simulation tools, and a collaborative approach to bridging architecture with modeling. Candidates at various experience levels are welcome, as the interview process assesses the appropriate level for offers.
Builds advanced computers for AI applications
Tenstorrent builds advanced computers specifically designed for artificial intelligence applications. Their products include high-performance computing systems that utilize specialized hardware and software solutions, leveraging technologies like ASIC design and RISC-V architecture. Unlike many competitors, Tenstorrent focuses on integrating neural network compilers into their systems, enhancing the efficiency of AI computations. The company's goal is to advance the capabilities of AI computing, serving clients in the AI and computing sectors while generating revenue through the sale of their specialized systems and services.