Join Sii Poland as a Senior Data Engineer. In this role, you will design, build, and maintain robust data pipelines and scalable architectures to deliver trusted data from QA/QC systems into analytics and AI/ML platforms that drive critical decision-making across the organization.
Your tasks
* Designing and optimizing data pipelines that ingest, process, and delivering data in a regulated (GxP) environment
* Building and maintaining scalable data models and storage architectures for analytics and AI/ML workloads
* Developing ETL/ELT workflows and APIs with traceability and validation in mind
* Implementing data quality checks, automated testing, and monitoring to ensure data integrity and compliance
* Collaborating with data scientists and AI/ML engineers to deliver production-ready datasets
* Partnering with cross-functional teams to translate business needs into technical solutions
* Contributing to CI/CD, infrastructure as code, and automation workflows for data platform operations
* Championing engineering best practices, code reviews, and continuous improvement across the team
Requirements
* At least 6 years of professional experience in data or software engineering, with 3+ years in lead roles overseeing data engineering
* Proven experience building production-grade data pipelines and cloud-native architectures.
* Deep expertise in the AWS data stack (S3, Glue, Lambda, Athena, Redshift)
* Proficient in Python and SQL (PostgreSQL, Redshift, MySQL), with deep experience in scripting and automation
* Hands-on experience with ETL/ELT workflows using Airflow (custom DAGs), DBT, and YAML-based frameworks
* Strong understanding of data modelling, schema design, normalization, and analytics query optimization
* Practical experience with Spark and Kafka or batch and streaming data processing
* Experience delivering AI/ML-ready datasets, including feature engineering and real-time inference support
* Understanding of GxP, computer system validation (CSV), and documentation/testing requirements for validated systems