Cloud Infrastructure SRE / Tier 4 Engineer
As a Cloud Infrastructure SRE / Tier 4 Engineer, you will join a specialized engineering task force dedicated to preventing and resolving the most critical and strategic customer issues encountered in the field.
Our global team of experienced engineers has a deep understanding of the lower layers of private cloud infrastructure. Together, we work on practical solutions, continuously learning and adapting. We collaborate closely, share knowledge, and tackle challenges head-on.
If you're an engineer eager to understand the core of cloud technologies and looking for a team that values hands-on expertise, you'll fit right in with us.
What you will learn and contribute to
* Maintain and enhance Red Hat private cloud: You’ll work extensively with Nokia Container Services (NCS) and CloudBand Infrastructure Software (CBIS), private cloud solutions based on Kubernetes and OpenStack .
* Deep Dive Troubleshooting: Starting from high-level Kubernetes error messages, you’ll navigate through multiple layers until pinpointing issues—even down to kernel-level bugs.
* Automate and Develop: Spend 30–50% of your time developing automation to prevent recurring issues. Solve it once, automate for the future. Use your Python skills to create health checks and improve reliability.
* Continuous Learning: In our rapidly evolving tech landscape, ongoing learning is a cornerstone.
* Collaborate with Developers and Engineers: Work closely with developers and product engineers to bridge infrastructure and software, ensuring seamless product delivery.
* Mode of Operation: This role requires participation in a follow-the-sun scheme with 12-hour shifts (within regulatory limits) or participation in on-call rotations to ensure business continuity during emergencies.
Your skills and experience
* Linux Expertise: Strong familiarity with Linux distributions; we primarily use Red Hat and CentOS .
* Networking Foundations: Solid understanding of networking fundamentals (VLANs, IP routing). Experience with Calico, Multus, and Open vSwitch is a plus.
* Problem-Solving Mindset: Excellent troubleshooting skills and analytical thinking to address complex challenges.
* Scripting and Automation: Proficiency in Bash and Python, or willingness to learn, plus familiarity with Ansible .
* Containerization & Virtualization: Knowledge of Podman, Kubernetes, Helm, and/or OpenStack ; experience with KVM/QEMU is advantageous.
* Storage Systems: Experience with Ceph and Rook is highly valued.
* Database Expertise: Understanding of relational databases (MySQL, MariaDB ) and experience with etcd .
* Monitoring and Logging: Familiarity with tools like ELK (Elasticsearch, Logstash, Kibana), Prometheus, and Grafana .
* Advanced written and spoken English.
It would be nice if you also had
* Proactive thinking and ownership mindset.
* Strong focus on quality and reliability.
* Passion for delivering training or knowledge-sharing sessions to operations teams.
Igualdade & Oportunidade para Todos
Representando 165 nacionalidades em todo o mundo, nos orgulhamos de ser um empregador que oferece igualdade de oportunidades, comprometido em fornecer oportunidades iguais de emprego a todos os candidatos e funcionários, independentemente de raça, religião, sexo, cor, idade, nacionalidade, gravidez, orientação sexual, deficiência física ou informações genéticas, ou qualquer outra classificação protegida, de acordo com as leis federais, estaduais e/ou locais.