Data Engineer
We're seeking a skilled Data Engineer to join our team. In this role, you'll design and implement enterprise-grade data pipelines using Azure Databricks.
Key Responsibilities:
* Data Pipeline Development: Design, build, and optimize ETL/ELT pipelines using Azure Databricks (PySpark, Delta Lake) and Azure Data Factory (ADF).
* Data Flows & Transformations: Develop pipelines, data flows, and complex transformations with ADF, PySpark, and T-SQL for seamless data extraction, transformation, and loading.
* Data Processing: Develop Databricks Python notebooks for tasks such as joining, filtering, and pre-aggregation.
* Database & Query Optimization: Optimize database performance through SQL query tuning, index optimization, and code improvements to ensure efficient data retrieval and manipulation.
* SSIS & Migration Support: Maintain and enhance SSIS package design and deployment for legacy workloads; contribute to migration and modernization into cloud-native pipelines.
* Collaboration & DevOps: Work with cross-functional teams using Git (Azure Repos) for version control and Azure DevOps pipelines (CI/CD) for deployment.
* Data Governance & Security: Partner with governance teams to integrate Microsoft Purview and Unity Catalog for cataloging, lineage tracking, and role-based security.
* API & External Integration: Implement REST APIs to retrieve analytics data from diverse external data feeds, enhancing accessibility and interoperability.
* Automation: Automate ETL processes and database maintenance tasks using SQL Agent Jobs, ensuring data integrity and operational reliability.
* Advanced SQL Expertise: Craft and optimize complex T-SQL queries to support efficient data processing and analytical workloads.
Requirements:
* Azure Databricks: 5+ years of hands-on expertise.
* Azure Data Factory: 5+ years of proven experience.
* SQL Server: Strong experience with query optimization, indexing strategies, and coding best practices.
* SSIS: Demonstrated experience in package design, deployment, and performance tuning.
* Unity Catalog: Hands-on knowledge.
* Git & CI/CD: Experience with Git (Azure DevOps Repos) and CI/CD practices in data engineering projects.
Benefits:
* Microsoft Certified: Azure Data Engineer Associate (DP-203)
* Microsoft Certified: Azure Solutions Architect Expert or equivalent advanced Azure certification
* Databricks Certified Data Engineer Associate or Professional
* Additional Microsoft SQL Server or Azure certifications demonstrating advanced database and cloud expertise
Nice to Have:
* Change Data Capture (CDC), Change Data Feed (CDF), and Temporal Tables
* Microsoft Purview, Power BI, and Azure-native integrations
* Profisee Master Data Management (MDM)
* Agile/Scrum environments