Role:
We are seeking a highly skilled Machine Learning Engineer to join our team.
Your primary focus will be building and optimizing Retrieval-Augmented Generation (RAG) models for search across large-scale medical and scientific documents, including pre-processing and embedding over half a billion documents to ensure they are searchable and contextually accurate.
You'll work on semantic chunking strategies, improving the automated evaluation pipeline, and fine-tuning Large Language Models (LLMs) for textual RAG use cases. The role involves hands-on experimentation, model development, and backend engineering, with deployments to non-prod environments and collaboration with DevOps for production rollout.
About the Role Requirements: