Senior Solutions Architect, HPC Systems Engineer
NVIDIAFull Time
Senior (5 to 8 years)
Candidates should have over 5 years of experience with HPC Systems and/or HPC Applications, along with experience in Linux/Unix operating system environments and scripting languages like Bash and Python. Familiarity with cloud computing platforms (AWS, Azure, Google Cloud), high-level knowledge of HPC schedulers and MPI, and familiarity with scientific computing libraries like OpenMP and CUDA are required. A general knowledge in at least one CAE discipline, 2+ years of full-stack experience with Python in cloud environments, and experience in statistical analysis/AI/ML are also necessary. Expertise in IaC tools, serverless architecture, and devops/gitops for CI/CD pipelines is strongly preferred. The role requires a self-starter with technical aptitude, excellent communication skills, a BS/BA degree in Engineering, Computer Science, or a related field, and must be a US Person.
The HPC Engineer will be responsible for the end-to-end delivery of core functionality and new product ideas, building and maintaining automation for software catalog building, testing, and maintenance. They will develop and maintain critical product areas including multi-cloud compute environments, job routing, and performance analytics. Responsibilities also include delivering technological solutions for Rescale's features, publishing, managing, and optimizing complex HPC software for demanding customers, and developing/implementing processes and best practices for HPC services and support. The role involves identifying and implementing automation opportunities for increased productivity, designing and improving processes to drive customer satisfaction, and solving complex problems related to customer software on Rescale, requiring in-depth knowledge of the software and its operating environment.
Cloud platform for high-performance computing
Rescale offers a cloud-based platform for high-performance computing (HPC) solutions aimed at computational engineering and research & development (R&D). The platform helps clients overcome limitations of outdated hardware by providing scalable computing resources, enabling faster product development and improved collaboration. Rescale differentiates itself with a pay-as-you-go model, allowing clients to access computing power on demand. The company's goal is to enhance innovation and competitiveness in the digital engineering sector.