Job Title: Data Warehouse Architect
We are seeking a skilled Data Architect to spearhead the design and implementation of a new data warehouse instance.
This role will involve building scalable pipelines, optimizing lakehouse performance, and integrating with diverse real-time and batch data sources across AWS.
* Design and deploy a new Databricks Lakehouse instance tailored to product-level data needs.
* Architect and implement robust data ingestion pipelines using Spark (PySpark/Scala) and Delta Lake.
* Integrate AWS-native services (S3, Glue, Athena, Redshift, Lambda) with Databricks for optimized performance and scalability.
* Define data models, optimize query performance, and establish warehouse governance best practices.
* Collaborate cross-functionally with product teams, data scientists, and DevOps to streamline data workflows.
* Maintain CI/CD, preferably DBX for data pipelines using GitOps and Infrastructure-as-Code.
* Monitor data jobs and resolve performance bottlenecks or failures across environments.