Job Summary
We are seeking a Data Engineer to support the full lifecycle of real-world clinical and imaging data, enabling data-driven insights for high-impact research programs.
Duties and Responsibilities
* Data Ingestion and Quality Control: Collect and ingest large volumes of real-world data, including medical imaging and related annotations, and perform detailed quality checks to ensure accuracy, completeness, and consistency.
* Data Pipelines and Processing: Build and maintain data pipelines to structure and prepare complex datasets, clean, transform, and normalize raw data for downstream research analysis.
* Collaboration with Data Scientists: Work closely with Data Scientists to prepare specific curated subsets for focused analysis.
* Data Documentation: Document data sources, processing steps, and quality control procedures.
Ideal Profile
* A minimum of 3–7 years of experience as a Data Engineer or similar role.
* Strong proficiency in Python and SQL is required.
* Experience working with large-scale data pipelines.
* Clear and confident communication skills in English (written and verbal).
* Ability to work independently and manage multiple data workflows in parallel.
Benefits and Opportunities
This is a chance to be part of meaningful work with long-term potential. The ideal candidate will have a strong foundation in data engineering principles, excellent problem-solving skills, and the ability to collaborate effectively with cross-functional teams.