Cloud Infrastructure Specialist
As a Cloud Infrastructure Specialist, you will design and implement scalable cloud platform components in AWS using Terraform for Infrastructure as Code.
* Build secure multi-account AWS environments following Well-Architected principles to ensure high availability and reliability of our cloud infrastructure.
* Develop internal platform tooling, automation, templates, and golden paths to streamline our cloud operations.
* Support EKS/Kubernetes environments and microservice deployment foundations to enable efficient application development and deployment.
Event-Driven & Distributed Architecture:
* Design and build Kafka-based event streaming and messaging capabilities to support real-time data processing and communication.
* Ensure idempotent service patterns and high availability through robust monitoring and alerting systems.
Observability, Reliability & Performance:
* Implement comprehensive monitoring, logging, and distributed tracing using Datadog/Grafana/ELK to gain insights into our cloud infrastructure performance.
* Meet SLAs/SLOs for platform components by ensuring timely identification and resolution of issues.
Security, Governance & Compliance:
* Enforce best practices for IAM, secrets management, encryption, and network security to protect our cloud assets from threats.
* Develop guardrails, policies, and automated compliance into CI/CD and IaC workflows to ensure adherence to regulatory requirements.
Collaboration & Technical Leadership:
* Partner with architects, platform leads, and product teams to translate business requirements into technical designs that meet our needs.
* Guide developers on best practices for cloud-native builds and provide mentorship on IaC, automation, and platform engineering patterns.