Job Title: Data Engineer - Machine Learning Infrastructure Specialist
About the Role:
We are seeking a highly skilled Data Engineer with strong MLOps expertise and machine learning modeling experience in the financial domain. In this role, you will be responsible for building robust data pipelines and ML infrastructure to support our payment processing systems, fraud detection algorithms, and financial analytics solutions.
Key Responsibilities:
* Data Pipeline Development: Design, develop, and maintain scalable data pipelines using Python, Airflow, and PySpark to process large volumes of financial transaction data.
* MLOps Infrastructure: Implement and optimize MLOps infrastructure on AWS to automate the full machine learning lifecycle from development to production.
* Deployment Pipelines: Build and maintain deployment pipelines for ML models using SageMaker and other AWS services.
* Collaboration: Collaborate with data scientists and business stakeholders to implement machine learning solutions for fraud detection, risk assessment, and financial forecasting.
* Data Quality and Security: Ensure data quality, reliability, and security across all data engineering workloads.
* Optimization: Optimize data architecture to improve performance, scalability, and cost-efficiency.
* Monitoring and Alerting: Implement monitoring and alerting systems to ensure production ML models perform as expected.
Requirements and Qualifications:
* Experience: 3-5 years of experience in Data Engineering with a focus on MLOps in production environments.
* Proficiency: Strong proficiency in Python programming and data processing frameworks (PySpark).
* Airflow Experience: Experience with workflow orchestration tools, particularly Airflow.
* AWS Expertise: Hands-on experience with the AWS stack, especially SageMaker, Lambda, S3, and other relevant services.
* Machine Learning Knowledge: Working knowledge of machine learning model deployment and monitoring in production.
* Data Modeling: Experience with data modeling and database systems (SQL and NoSQL).
* Financial Services Domain: Knowledge of the financial services or payment processing domain is highly desirable.
* Containerization and CI/CD: Familiarity with containerization (Docker) and CI/CD pipelines.
* Problem-Solving Skills: Excellent problem-solving skills and ability to work in a fast-paced fintech environment.