Machine Learning Engineer
SweedFull Time
Mid-level (3 to 4 years)
Key technologies and capabilities for this role
Common questions about this position
Quadric provides a comprehensive benefits package including Health Care Plan (Medical, Dental & Vision), Retirement Plan (401k, IRA), Life Insurance, Paid Time Off, Family Leave, Short Term & Long Term Disability, Training & Development, Work From Home, Free Food & Snacks, and Stock Option Plan.
The position is hybrid, requiring some office presence as remote is not fully supported.
Candidates need 5+ years in AI/LLM model inference and deployment, experience with model quantization (PTQ, QAT), accuracy measurement, performance profiling, frameworks like onnxruntime, Pytorch, vLLM, huggingface-transformer, neural-compressor, or llamacpp, plus proficiency in C/C++ and Python.
This information is not specified in the job description.
A strong candidate holds a Bachelor’s or Master’s in Computer Science or Electrical Engineering, has 5+ years of AI/LLM inference experience, expertise in model optimization techniques like quantization, proficiency in C/C++ and Python, and strong problem-solving, debugging, and communication skills.
Simplifies SoC design for machine learning
Quadric focuses on simplifying the design and programming of System on Chips (SoCs) specifically for machine learning applications. Their main product is the Chimera, a General-Purpose Neural Processing Unit (GPNPU) that combines matrix and vector operations with scalar control code in a single execution pipeline. This design allows developers to avoid splitting application code across different processors, making the development process more efficient. Quadric serves clients in the semiconductor industry, including SoC developers and manufacturers, who need to improve their machine learning capabilities. Unlike competitors, Quadric offers a comprehensive solution that includes both hardware and software tools, such as the Chimera LLVM C Compiler and the Chimera Instruction Set Simulator, enabling developers to design, simulate, and deploy their applications effectively. The goal of Quadric is to enhance the performance and ease of development for machine learning applications on SoCs.