AI Model Deployment Specialist
Job Title: AI Model Deployment Specialist for Retrieval-Augmented Generation (RAG) Service
About the Role:
* We are seeking an experienced AI model deployment specialist to build and optimize RAG models for search across large-scale medical and scientific documents.
* The ideal candidate will pre-process and embed over half a billion documents to ensure they are searchable and contextually accurate.
Key Responsibilities:
* Hands-on experimentation and model development using cutting-edge technologies.
* Backend engineering with deployments to non-prod environments and collaboration with DevOps for production rollout.
Requirements:
* 1+ years of experience as a Search Engineer or AI Engineer in a similar role.
* Search Technology Experience – OpenSearch, building scalable search systems.
* 2+ years of Python development experience, including API creation, model training, testing, and general backend programming.
* Familiarity with LangChain for building LLM workflows using tools, memory, and retrieval.
Preferred Skills:
* Familiarity with AWS infrastructure (IAM, VPC, S3, etc.).
* Exposure to RAG architectures, specifically textual RAG use cases.