We are seeking a skilled Data Scientist to develop and maintain data pipelines for threat intelligence ingestion, validation, and export automation flows.
Responsibilities
* Design and develop data pipelines for ingesting threat intelligence data from various sources into our data ecosystem.
* Implement data validation processes to ensure data accuracy, completeness, and consistency.
* Collaborate with threat analysts to understand data requirements and design appropriate solutions.
* Develop automation scripts and workflows for data export processes to external systems or partners.
* Optimize and enhance existing data pipelines for improved performance and scalability.
* Monitor data pipelines and troubleshoot issues as they arise, ensuring continuous data availability and integrity.
* Document technical specifications, data flows, and procedures for data pipeline maintenance and support.
* Stay updated on emerging technologies and best practices in data engineering and incorporate them into our data ecosystem.
* Provide technical guidance and support to other team members on data engineering best practices and methodologies.
Requirements
* Proven experience as a Data Engineer or similar role, with a focus on data ingest, validation, and export automation.
* Strong proficiency in Python.
* Experience with data pipeline orchestration tools such as Apache Airflow, Apache NiFi, or similar.
* Familiarity with cloud platforms such as Snowflake, AWS, Azure, or Google Cloud Platform.
* Experience with data validation techniques and tools for ensuring data quality.
* Experience building and deploying images using containerization technologies such as Docker and Kubernetes.
* Excellent problem-solving skills and attention to detail.
* Strong communication and collaboration skills, with the ability to work effectively in a team environment.
A successful candidate will have a strong background in data engineering, with experience in designing, developing, and maintaining data pipelines. They will be proficient in Python and have experience with data pipeline orchestration tools. Additionally, they will have familiarity with cloud platforms and experience with data validation techniques and tools.
The ideal candidate will be able to optimize and enhance existing data pipelines, monitor data pipelines, and troubleshoot issues as they arise. They will also be able to document technical specifications, data flows, and procedures for data pipeline maintenance and support.
This is an excellent opportunity for a motivated and experienced Data Engineer to join our team and contribute to the development of our data ecosystem.