Enterprise Data Engineer
* Job Summary:
As an Enterprise Data Engineer, you will design, implement, and optimize complex data pipelines using Azure Databricks, Azure Data Factory, and Python. Your primary goal will be to deliver scalable, governed, and performant data solutions that ensure reliability, efficiency, and compliance across the full data lifecycle.
* Key Responsibilities:
o Design, build, and optimize ETL/ELT pipelines using Azure Databricks (PySpark, Delta Lake) and Azure Data Factory (ADF).
o Develop pipelines, data flows, and complex transformations with ADF, PySpark, and T-SQL for seamless data extraction, transformation, and loading.
o Develop Databricks Python notebooks for tasks such as joining, filtering, and pre-aggregation.
o Optimize database performance through SQL query tuning, index optimization, and code improvements to ensure efficient data retrieval and manipulation.
o Maintain and enhance SSIS package design and deployment for legacy workloads; contribute to migration and modernization into cloud-native pipelines.
o Work with cross-functional teams using Git (Azure Repos) for version control and Azure DevOps pipelines (CI/CD) for deployment.
o Partner with governance teams to integrate Microsoft Purview and Unity Catalog for cataloging, lineage tracking, and role-based security.
o Implement REST APIs to retrieve analytics data from diverse external data feeds, enhancing accessibility and interoperability.
o Automate ETL processes and database maintenance tasks using SQL Agent Jobs, ensuring data integrity and operational reliability.
o Craft and optimize complex T-SQL queries to support efficient data processing and analytical workloads.
* Required Qualifications:
o 5+ years of hands-on expertise with Azure Databricks, Python, PySpark, and Delta Lake.
o 5+ years of proven experience with Azure Data Factory for orchestrating and monitoring pipelines.
o Strong SQL Server / T-SQL experience with a focus on query optimization, indexing strategies, and coding best practices.
o Demonstrated experience in SSIS package design, deployment, and performance tuning.
o Hands-on knowledge of Unity Catalog for governance.
o Experience with Git (Azure DevOps Repos) and CI/CD practices in data engineering projects.
* Nice to Have:
o Exposure to Change Data Capture (CDC), Change Data Feed (CDF), and Temporal Tables.
o Experience with Microsoft Purview, Power BI, and Azure-native integrations.
o Familiarity with Profisee Master Data Management (MDM).
o Working in Agile/Scrum environments.
* Preferred Qualifications:
o Microsoft Certified: Azure Data Engineer Associate (DP-203)
o Microsoft Certified: Azure Solutions Architect Expert or equivalent advanced Azure certification
o Databricks Certified Data Engineer Associate or Professional
o Additional Microsoft SQL Server or Azure certifications demonstrating advanced database and cloud expertise