This is a fantastic opportunity to support the full lifecycle of real-world clinical and imaging data. As a Data Engineer, you will be responsible for collecting and ingesting large volumes of data, performing quality checks, building and maintaining data pipelines, cleaning and transforming raw data, and working closely with Data Scientists to prepare curated subsets for analysis.
Key Responsibilities:
* Collect and ingest large volumes of real-world data, including medical imaging and related annotations
* Perform detailed quality checks to ensure accuracy, completeness, and consistency
* Build and maintain data pipelines to structure and prepare complex datasets
* Clean, transform, and normalize raw data for downstream research analysis
* Work closely with Data Scientists to prepare specific curated subsets for focused analysis
Ideal Profile:
* 3–7 years of experience as a Data Engineer or similar role
* Strong proficiency in Python and SQL (required)
* Experience working with large-scale data pipelines
* Clear and confident communication skills in English (written and verbal)
* Ability to work independently and manage multiple data workflows in parallel
Additional Requirements:
* Self-motivated and eager to work in a fast-paced, research-oriented environment
Nice to Have:
* Experience with medical data, imaging (DICOM), clinical research, or pharmaceutical environments
Remote Opportunity | English Fluency Required