Enterprise Data Solutions Architect
We seek an accomplished Data Architect with DataWarehouse expertise to spearhead the design and implementation of a new data warehouse instance for a major product line. This role involves architecting scalable pipelines, optimizing lakehouse performance, and integrating seamlessly with diverse real-time and batch data sources across the cloud.
The ideal candidate is passionate about data architecture, thrives in fast-paced environments, and has a proven track record of setting up high-performance lakehouse platforms on Databricks with a strong foundation in data warehousing principles.
* Design and deploy a tailored Databricks Lakehouse instance to meet the client's product-level data needs.
* Implement robust data ingestion pipelines using Spark (PySpark/Scala) and Delta Lake.
* Integrate AWS-native services (S3, Glue, Athena, Redshift, Lambda) with Databricks for optimized performance and scalability.
* Define data models, optimize query performance, and establish warehouse governance best practices.
* Collaborate cross-functionally with product teams, data scientists, and DevOps to streamline data workflows.
* Maintain CI/CD, preferably DBX, for data pipelines using GitOps and Infrastructure-as-Code.
* Monitor data jobs and resolve performance bottlenecks or failures across environments.