Staff AI Research Scientist - Evaluation, Handshake AI
HandshakeFull Time
Expert & Leadership (9+ years)
Key technologies and capabilities for this role
Common questions about this position
The position is hybrid.
This information is not specified in the job description.
Required skills include experience designing and running evaluations such as benchmarks and test suites, statistical and analytical rigor for reproducible experiments, expertise in building with models like compound AI systems and agentic techniques, and a proven track record of research results.
Distyl AI features a fast-growing, innovative environment pushing AI utilization in enterprises, with researchers who creatively redefine software use, operate in AI-native ways, and come from strong academic backgrounds with research track records; the company is led by proven leaders from Palantir, Apple, and top labs, partnering with OpenAI.
Strong candidates have experience designing evaluations and benchmarks, statistical rigor, expertise in building intelligent systems with models like agentic collaboration and techniques such as ReAct, and a proven research track record; they thrive in creative, AI-native environments beyond traditional research.
Provides customized AI solutions for enterprises
Distyl.ai provides artificial intelligence solutions tailored for enterprises, focusing on enhancing productivity and streamlining operations. Their products utilize generative AI and large language models, which can be customized to fit a client's specific data, workflows, and systems. This customization allows for smooth integration with existing technologies. Distyl.ai serves a wide range of clients, including Fortune 500 companies in sectors like Consumer Packaged Goods, Retail, Healthcare, Finance, and Manufacturing, as well as federal agencies. The company’s experienced team, which includes professionals from major tech firms and top AI research institutions, enables them to create scalable solutions that meet high standards. Distyl.ai aims to unlock value for clients quickly, typically within a quarter, by providing tailored AI applications that address unique business needs.