Job Description:
The primary objective of the project is to revamp and optimize an e-commerce platform for a large retail company serving millions of customers weekly.
Solutions are delivered by multiple Product Teams focused on various domains including Customer, Loyalty, Search, Data Integration, and Cart.
Current top priorities include new brand onboarding, re-architecture, database migrations, and migrating microservices to a unified cloud-native solution without disrupting business operations.
Responsibilities:
* We are looking for an experienced Data Scientist with Machine Learning expertise and good understanding of search engines to work on the following:
* - Design, develop, and optimize semantic and vector-based search solutions using Lucene/Solr and modern embeddings.
* - Apply machine learning, deep learning, and natural language processing techniques to enhance search relevance and ranking.
* - Develop scalable data pipelines and APIs for indexing, retrieval, and model inference.
* - Integrate ML models and search capabilities into production systems.
* - Evaluate, fine-tune, and monitor search performance metrics.
* - Collaborate with software engineers, data engineers, and product teams to translate business needs into technical implementations.
* - Stay current with advancements in search technologies, LLMs, and semantic retrieval frameworks.
Mandatory Skills Description:
* 5+ years of experience in Data Science or Machine Learning Engineering with a focus on Information Retrieval or Semantic Search.
* Strong programming experience in both Java and Python (production-level code, not just prototyping).
* Deep knowledge of Lucene, Apache Solr, or Elasticsearch (indexing, query tuning, analyzers, scoring models).
* Experience with Vector Databases, Embeddings, and Semantic Search techniques.
* Strong understanding of NLP techniques (tokenization, embeddings, transformers, etc.).
* Experience deploying and maintaining ML/search systems in production.
* Solid understanding of software engineering best practices (CI/CD, testing, version control, code review).
Languages:
* English: B2 Upper Intermediate