Job Title:
Data Scientist
-----------------------------------
Description:
As a data professional, you will lead the design and implementation of an AWS Serverless DataLake architecture to efficiently manage large volumes of data and support various data processing workflows.
You will develop data ingestion pipelines and integration processes to ensure smooth and reliable transfer of data from various sources into the DataLake. You will implement data transformation and enrichment processes using AWS Lambda, Glue, or similar serverless technologies to ensure data quality and consistency.
You will collaborate with data scientists and analysts to understand their data requirements and design appropriate data models and schemas in the DataLake. You will optimize storage and retrieval mechanisms, leveraging AWS services such as S3, Athena, Redshift, or DynamoDB, to provide high-performance access to the data.
You will continuously evaluate new AWS services and technologies to enhance the DataLake architecture, improve data processing efficiency, and drive innovation. You will mentor and provide technical guidance to junior team members, fostering their growth and ensuring adherence to best practices. You will work with cross-functional teams to understand business requirements, prioritize tasks, and deliver high-quality solutions within defined timelines.
Key Responsibilities:
* Design and implement an efficient AWS Serverless DataLake architecture
* Develop data ingestion pipelines and integration processes
* Implement data transformation and enrichment processes
* Collaborate with data scientists and analysts to understand data requirements
* Optimize storage and retrieval mechanisms
* Evaluate new AWS services and technologies
* Mentor junior team members