AI Applications Architect
Turing- Full Time
- Senior (5 to 8 years)
Candidates must possess 10+ years of experience in the compute/server/datacenter industry, with at least 3 years of experience working with accelerated AI solutions utilizing GPUs or custom AI accelerators. Successful experience in specifying, building, and deploying rack-scale hardware and software infrastructure is required, and experience working at AIaaS cloud service providers, on-prem datacenter, or high-performance computing labs is highly desirable. Strong interpersonal skills, including the ability to work efficiently in a group environment and influence cross-functional teams without direct managerial authority, are also necessary. Excellent problem-solving skills and the ability to resolve complex issues with ambiguity are crucial.
The AI Infrastructure Solutions Architect will architect Untether AI’s rack-scale solutions, encompassing accelerator cards, servers, networking, and rack topologies, while selecting, integrating, and testing best-of-breed hardware and full-stack software. They will lead and collaborate with cross-functional teams to drive a shared vision and deliver a best-in-class solution, working with business development teams to recommend and deliver rack-level solutions to partners and customers. Furthermore, the Architect will dogfood the systems they create, testing, bulletproofing, and recommending them to partners and customers, and will work closely with hardware, software, and product teams.
Enhances AI inference with at-memory computing
Untether AI enhances the speed and efficiency of AI inference workloads using at-memory computing. This method places the compute element next to memory cells, which boosts compute density and accelerates AI inference for various neural networks, such as those used in vision, natural language processing, and recommendation systems. The company targets businesses that rely on AI technologies and need high-performance computing for inference tasks. Their products, including the runAI200® devices and tsunAImi® accelerator cards, are designed to deliver exceptional performance, with the tsunAImi® card offering over 2 PetaOps. This allows businesses to optimize their AI workloads while maintaining a compact PCI-Express form factor. Untether AI's goal is to provide efficient and cost-effective solutions for companies looking to enhance their AI applications.