About the Role
We are seeking an experienced Data Engineer with Machine Learning expertise to work on designing, developing, and optimizing semantic and vector-based search solutions leveraging Lucene/Solr and modern embeddings.
This role involves applying machine learning, deep learning, and natural language processing techniques to improve search relevance and ranking. You will develop scalable data pipelines and APIs for indexing, retrieval, and model inference, and integrate ML models and search capabilities into production systems.
Additionally, you will evaluate, fine-tune, and monitor search performance metrics, and collaborate with software engineers, data engineers, and product teams to translate business needs into technical implementations.
As a member of our team, you will stay current with advancements in search technologies, LLMs, and semantic retrieval frameworks.
Mandatory Skills
* 5+ years of experience in Data Science or Machine Learning Engineering, with a focus on Information Retrieval or Semantic Search.
* Strong programming experience in both Java and Python (production-level code, not just prototyping).
* Deep knowledge of Lucene, Apache Solr, or Elasticsearch (indexing, query tuning, analyzers, scoring models).
* Experience with Vector Databases, Embeddings, and Semantic Search techniques.
* Strong understanding of NLP techniques (tokenization, embeddings, transformers, etc.).
* Experience deploying and maintaining ML/search systems in production.
* Solid understanding of software engineering best practices (CI/CD, testing, version control, code review).
Languages
* English: B2 Upper Intermediate