E-commerce Platform Modernization and Development
As a senior data scientist, you will play a crucial role in the modernization, maintenance, and development of an e-commerce platform for a big retail company. Serving millions of customers each week, this project requires expertise in search engines, machine learning, and data integration.
Solutions are delivered by several product teams focused on different domains: customer, loyalty, search and browse, data integration, and cart. Current priorities include new brand onboarding, re-architecture, database migrations, and migration of microservices to a unified cloud-native solution without disruption to business.
Responsibilities:
* Design, develop, and optimize semantic and vector-based search solutions using Lucene/Solr and modern embeddings.
* Apply machine learning, deep learning, and natural language processing techniques to improve search relevance and ranking.
* Develop scalable data pipelines and APIs for indexing, retrieval, and model inference.
* Integrate ML models and search capabilities into production systems.
* Evaluate, fine-tune, and monitor search performance metrics.
* Collaborate with software engineers, data engineers, and product teams to translate business needs into technical implementations.
* Stay current with advancements in search technologies, LLMs, and semantic retrieval frameworks.
Mandatory Skills Description:
* 5+ years of experience in data science or machine learning engineering, with a focus on information retrieval or semantic search.
* Strong programming experience in both Java and Python (production-level code, not just prototyping).
* Deep knowledge of Lucene, Apache Solr, or Elasticsearch (indexing, query tuning, analyzers, scoring models).
* Experience with vector databases, embeddings, and semantic search techniques.
* Strong understanding of NLP techniques (tokenization, embeddings, transformers, etc.).
* Experience deploying and maintaining ML/search systems in production.
* Solid understanding of software engineering best practices (CI/CD, testing, version control, code review).