Leading enterprise data engineer

Carapicuíba

beBeeEngineering

Anunciada dia 19 dezembro

Descrição

Data Engineering Position

We are seeking an experienced Senior Data Engineer to join our team and play a key role in designing, implementing, and optimizing enterprise-grade data pipelines.

About the Role

This is an excellent opportunity for a skilled professional to leverage their expertise in Azure Databricks, Python, PySpark, and Delta Lake to enable scalable, governed, and performant data solutions.

Key Responsibilities

1. Data Pipeline Development: Design, build, and optimize ETL/ELT pipelines using Azure Databricks (PySpark, Delta Lake) and Azure Data Factory (ADF).

2. Data Flows & Transformations: Develop pipelines, data flows, and complex transformations with ADF, PySpark, and T-SQL for seamless data extraction, transformation, and loading.

3. Data Processing: Develop Databricks Python notebooks for tasks such as joining, filtering, and pre-aggregation.

4. Database & Query Optimization: Optimize database performance through SQL query tuning, index optimization, and code improvements to ensure efficient data retrieval and manipulation.

5. SSIS & Migration Support: Maintain and enhance SSIS package design and deployment for legacy workloads; contribute to migration and modernization into cloud-native pipelines.

6. Collaboration & DevOps: Work with cross-functional teams using Git (Azure Repos) for version control and Azure DevOps pipelines (CI/CD) for deployment.

7. Data Governance & Security: Partner with governance teams to integrate Microsoft Purview and Unity Catalog for cataloging, lineage tracking, and role-based security.

8. API & External Integration: Implement REST APIs to retrieve analytics data from diverse external data feeds, enhancing accessibility and interoperability.

9. Automation: Automate ETL processes and database maintenance tasks using SQL Agent Jobs, ensuring data integrity and operational reliability.

10. Advanced SQL Expertise: Craft and optimize complex T-SQL queries to support efficient data processing and analytical workloads.

Requirements

11. 5+ years of hands-on experience with Azure Databricks, Python, PySpark, and Delta Lake.

12. 5+ years of proven experience with Azure Data Factory for orchestrating and monitoring pipelines.

13. Strong SQL Server / T-SQL skills with a focus on query optimization, indexing strategies, and coding best practices.

14. Demonstrated experience in SSIS package design, deployment, and performance tuning.

15. Hands-on knowledge of Unity Catalog for governance.

16. Experience with Git (Azure DevOps Repos) and CI/CD practices in data engineering projects.

Nice to Have

17. Exposure to Change Data Capture (CDC), Change Data Feed (CDF), and Temporal Tables.

18. Experience with Microsoft Purview, Power BI, and Azure-native integrations.

19. Familiarity with Profisee Master Data Management (MDM).

20. Working in Agile/Scrum environments.

Preferred Qualifications

21. Microsoft Certified: Azure Data Engineer Associate (DP-203)

22. Microsoft Certified: Azure Solutions Architect Expert or equivalent advanced Azure certification

23. Databricks Certified Data Engineer Associate or Professional

24. Additional Microsoft SQL Server or Azure certifications demonstrating advanced database and cloud expertise

Benefits

We offer a comprehensive benefits package, including competitive salary, opportunities for growth and development, and a dynamic work environment.

Se candidatar

Criar um alerta

Salvar