Job Title
Data Engineer - Cloud Native Expert
About the Role
We are seeking a highly skilled cloud native data engineer to design, implement and optimize enterprise-grade data pipelines on Azure. The ideal candidate will leverage Azure Databricks, Azure Data Factory, SQL Server and Python to enable scalable, governed and performant data solutions.
The role requires expertise in cloud-native technologies such as Azure Databricks, PySpark, Delta Lake, and Unity Catalog for governance. The successful candidate will play a key role in modernizing our data platform on the Azure Cloud, ensuring reliability, efficiency, and compliance across the full data lifecycle.
Key Responsibilities
1. Data Pipeline Development: Design, build and optimize ETL/ELT pipelines using Azure Databricks (PySpark, Delta Lake) and Azure Data Factory (ADF).
2. Data Flows & Transformations: Develop pipelines, data flows, and complex transformations with ADF, PySpark, and T-SQL for seamless data extraction, transformation, and loading.
3. Data Processing: Develop Databricks Python notebooks for tasks such as joining, filtering, and pre-aggregation.
4. Database & Query Optimization: Optimize database performance through SQL query tuning, index optimization, and code improvements to ensure efficient data retrieval and manipulation.
5. SSIS & Migration Support: Maintain and enhance SSIS package design and deployment for legacy workloads; contribute to migration and modernization into cloud-native pipelines.
6. Collaboration & DevOps: Work with cross-functional teams using Git (Azure Repos) for version control and Azure DevOps pipelines (CI/CD) for deployment.
7. Data Governance & Security: Partner with governance teams to integrate Microsoft Purview and Unity Catalog for cataloging, lineage tracking, and role-based security.
8. API & External Integration: Implement REST APIs to retrieve analytics data from diverse external data feeds, enhancing accessibility and interoperability.
9. Automation: Automate ETL processes and database maintenance tasks using SQL Agent Jobs, ensuring data integrity and operational reliability.
Required Skills and Qualifications
* 5+ years of hands-on expertise with Azure Databricks, Python, PySpark, and Delta Lake.
* 5+ years of proven experience with Azure Data Factory for orchestrating and monitoring pipelines.
* Strong SQL Server / T-SQL experience with a focus on query optimization, indexing strategies, and coding best practices.
* Demonstrated experience in SSIS package design, deployment, and performance tuning.
* Hands-on knowledge of Unity Catalog for governance.
* Experience with Git (Azure DevOps Repos) and CI/CD practices in data engineering projects.
Benefits
This is an exciting opportunity to join a dynamic team and work on cutting-edge cloud-native data engineering projects. As a cloud native data engineer, you will have the chance to develop your skills in Azure Databricks, Azure Data Factory, and Unity Catalog, and contribute to the modernization of our data platform on the Azure Cloud.
You will work closely with cross-functional teams to ensure that data solutions are scalable, governed, and performant. This role offers opportunities for professional growth and development, and we encourage collaboration and innovation.
Others
PREFERRED QUALIFICATIONS:
* Microsoft Certified: Azure Data Engineer Associate (DP-203)
* Microsoft Certified: Azure Solutions Architect Expert or equivalent advanced Azure certification
* Databricks Certified Data Engineer Associate or Professional
* Additional Microsoft SQL Server or Azure certifications demonstrating advanced database and cloud expertise
Nice to Have:
* Exposure to Change Data Capture (CDC), Change Data Feed (CDF), and Temporal Tables.
* Experience with Microsoft Purview, Power BI, and Azure-native integrations.
* Familiarity with Profisee Master Data Management (MDM).
* Working in Agile/Scrum environments.