Senior DevOps Engineer - Ecosystem
We are seeking a Senior DevOps Engineer to join the Ecosystem team, responsible for the deployment, operation, and reliability of the NaaS platform infrastructure. You will manage the Kubernetes clusters, GitOps pipelines, and infrastructure-as-code, monitoring/alarming frameworks that underpin a multi-environment telecommunications automation platform.
Responsibilities:
* Deploy, operate, monitor, and maintain Kubernetes clusters hosting NaaS components (NetBox, Itential, Paragon, CI/CD workers, operators, ingest, kafka, APIGWs, collectors, databases etc...)
* Design, operate, maintain, monitor GitOps workflows using Flux for environment consistency and automated deployments
* Manage Vault administration for secrets management, and centralised credential lifecycle across all NaaS components
* Develop and maintain Helm charts and (deprecated) Ansible playbooks for component deployment and upgrades
* Implement and operate CI/CD pipelines (GitLab) for build, test, and deployment automation
* Design, implement and maintain homogenous telemetry and observability solutions which provide crucial insights to platform health
Key Skills (Required):
* Linux systems expertise (internals, debugging, performance tuning)
* Kubernetes production operations (multi-namespace, networking, RBAC, resource management, troubleshooting)
* GitOps tooling (Flux or ArgoCD) for declarative infrastructure management
* Infrastructure as Code (Terraform, Ansible, Helm)
* CI/CD pipeline design and operations (GitLab CI)
* Vault (HashiCorp) or similar secrets management platform
* Strong scripting and automation (Python, Bash)
* Multi-cluster Kubernetes management and hybrid-cloud operations
* Elasticsearch/Kafka/InfluxDB cluster management and operations
* Prometheus stack, DynaTrace, APM, Ingest and enrichment of structured logs, tracing, OpenTelemetry
Desirable Skills (Not a must):
* Experience with Juniper Paragon / Routing Director, Itential, or similar network automation platform deployments
* Container image build optimisation and registry management
* PostgreSQL administration and high-availability configuration
* Integration with OpenStack APIs, Cinder, Nova
* Experience with chaos engineering frameworks (LitmusChaos, Chaos Mesh)
* Experience with other container orchestration engines such as Openstack