Job Title: Cloud Infrastructure Specialist
We're seeking a highly skilled individual to design, build, and automate the core infrastructure that powers high-scale financial transactions.
This role sits at the heart of our cloud platform team, focusing on designing scalable, resilient cloud platform components in AWS.
You'll work in a highly technical environment, collaborating with architects, backend engineers, and security teams to deliver secure, cloud-native foundations.
Key Responsibilities:
* Design and implement modular, production-grade IaC using Terraform.
* Own multi-account AWS environments following Well-Architected principles.
* Develop internal platform tooling, automation, templates, and golden paths.
* Support EKS/Kubernetes environments and microservice deployment foundations.
Event-Driven Architecture:
* Build and support Kafka-based event streaming and messaging capabilities.
* Ensure high availability, reliability, and idempotent service patterns.
* Work with engineers to define infrastructure for distributed, real-time systems.
Observability, Reliability & Performance:
* Implement monitoring, alerting, logging, and distributed tracing (Datadog/Grafana/ELK).
* Ensure SLAs/SLOs for platform components are met.
* Contribute to disaster recovery design, multi-region readiness, and resilience testing.
Security & Compliance:
* Enforce best practices for IAM, secrets management, encryption, and network security.
* Build guardrails, policies, and automated compliance into CI/CD and IaC workflows.
* Contribute to platform risk assessments and mitigation strategies.
Collaboration & Technical Leadership:
* Partner with architects, platform leads, and product teams to translate requirements into technical designs.
* Work with developers to guide best practices for cloud-native builds.
* Mentor engineers in IaC, automation, and platform engineering patterns.
Requirements:
* 5+ years in Platform Engineering, Cloud Engineering, or Infrastructure Engineering.
* Strong experience with AWS (multi-account, networking, security, core services).
* Excellent Terraform skills, building modular and production-grade IaC.
* Solid understanding of Kubernetes/EKS or similar orchestration platforms.
* Experience with Kafka or other event-streaming technologies.
* Hands-on with CI/CD tooling (GitHub Actions, GitLab CI, ArgoCD).
* Strong understanding of monitoring, distributed tracing, and observability.
* Ability to work autonomously in fast-paced, cloud-native engineering teams.
Key Skills: Cloud Infrastructure, DevOps, Security, Compliance, Automation, Observability, Event Streaming