Staff Engineer - Raw Data
ParsableFull Time
Senior (5 to 8 years), Expert & Leadership (9+ years)
The ideal candidate possesses 4-7 years of experience in machine learning or AI engineering, with a proven track record in RAG, vector search, and LLM fine-tuning. Deep expertise with vector databases such as Pinecone, FAISS, Weaviate, or Azure AI Search is required, along with familiarity with HuggingFace and OpenAI fine-tuning APIs, and strong understanding of chunking strategies for optimizing retrieval. Proficiency in Python and experience with ML frameworks (PyTorch, TensorFlow) and cloud platforms (AWS, Azure) are essential. A solid understanding of token management, evaluation tuning, and cost optimization for large-scale AI deployments is also necessary, coupled with strong problem-solving skills, a collaborative mindset, and the ability to communicate complex technical concepts.
This role involves handling embeddings and chunking strategies to optimize document and data retrieval for GenAI-powered features. The Machine Learning Engineer will manage vector stores and retrieval workflows using leading vector databases, fine-tune small and large language models using frameworks such as HuggingFace and OpenAI APIs, and optimize cost and reduce latency by implementing best practices for token management, model evaluation, and cloud resource utilization. The engineer will also collaborate with engineering, product, and data teams to integrate RAG pipelines into production systems, ensuring reliability, scalability, and security, and stay up-to-date with the latest advancements in retrieval-augmented generation, vector search, and LLM fine-tuning.
Process serving solutions for law firms
ABC Legal Services specializes in process serving for law firms, delivering legal documents reliably and promptly. They offer services like point-to-point special delivery and subscription routes, with features such as GPS tracking and real-time updates. What sets them apart is their extensive network of over 2,000 process servers and their status as the only acting Central Authority for the U.S. Department of Justice for international service requests. The company's goal is to provide fast and efficient process serving solutions, enhancing the delivery experience for their clients.