Our client is seeking for a SRE/DevOps Engineer to design, build, and maintain the scalable, reliable, and high-performance infrastructure that powers our client's core services. You'll be working for an US-based Internet Service Provider.
Key Responsibilites:
* Architect, deploy, and manage robust cloud infrastructure on AWS and/or GCP, ensuring high availability and fault tolerance.
* Develop and maintain CI/CD pipelines using tools like GitLab CI, Jenkins, or similar, to streamline the software delivery lifecycle.
* Manage and scale containerized applications using Kubernetes and Docker, including orchestration, service mesh, and cluster security.
* Leverage your expertise in Linux administration to manage servers and troubleshoot complex issues. Configure and manage DNS, load balancers, and other core networking components.
* Utilize strong Bash scripting and development skills (e.G., Python, Go) to automate operational tasks and build essential tooling.
* Implement and manage Application Performance Monitoring (APM) and observability stacks (e.G., Prometheus, Grafana, ELK Stack, Datadog) to ensure proactive issue detection and resolution.
* Participate in an on-call rotation to respond to and resolve critical production incidents.
Required Qualifications
* Minimum of 5 years in a professional SRE, DevOps, or Systems Engineering role.
* Hands-on experience with AWS or GCP, including core services (EC2, S3, VPC, IAM, GKE, GCE, etc.).
* Proven mastery of Kubernetes and Docker in production environments.
* Knowledge of Linux administration and a solid understanding of networking principles (DNS, TCP/IP, HTTP, Load Balancing).
* CI/CD & Automation
* Advanced or fluent English