About this Role
* We are seeking an experienced AI/LLM Engineer to join our team in building and optimizing Retrieval-Augmented Generation (RAG) models for search across large-scale medical and scientific documents.
Key Responsibilities
* Developing and fine-tuning RAG models for high-performance search results
* Pre-processing and embedding large-scale documents for accurate search and retrieval
* Implementing semantic chunking strategies for improved document analysis
* Evaluating and refining automated evaluation pipelines for model performance
* Fine-tuning Large Language Models (LLMs) for textual RAG use cases
Requirements
* At least 1 year of experience as a Search Engineer or AI Engineer
* Experience with search technologies, including OpenSearch and scalable search systems
* Strong Python development skills, including API creation, model training, testing, and backend programming
* Familiarity with LangChain for building LLM workflows using tools, memory, and retrieval
* Excellent communication skills for collaborating with the team and providing regular updates
Preferred Skills
* Familiarity with AWS infrastructure, including IAM, VPC, S3, and more
* Knowledge of RAG architectures, specifically textual RAG use cases
About This Opportunity