Reliability Infrastructure Specialist
We are seeking a highly skilled professional to develop and maintain high-performing, scalable, and secure OpenShift/Kubernetes clusters.
* Create and manage automated workflows for cluster maintenance.
* Design and implement tools to enhance operational efficiency.
* Configure and maintain supporting infrastructure applications.
* Monitor and resolve cluster and infrastructure service issues promptly.
* Manage on-premises and cloud-based infrastructure and services.
* Detect and resolve problems in OpenShift and/or Kubernetes clusters efficiently.
* Establish metrics to evaluate service performance and health.
The ideal candidate will possess 5+ years of experience in reliability engineering, strong Linux administration skills, automation experience with Python or equivalent, and expertise in installing, managing, maintaining, and troubleshooting OpenShift/Kubernetes clusters.