As a key player in our organization, you will be responsible for ensuring the reliability and efficiency of our cloud-based systems.
Job Description:
The Site Reliability Engineer plays a critical role in guaranteeing the stability, scalability, and security of our infrastructure. This involves designing and implementing robust automation solutions using Terraform across multi-cloud environments (Azure, GCP, OCI).
Additionally, this position requires developing and maintaining Continuous Integration/Continuous Deployment (CI/CD) pipelines using Jenkins, Groovy, and Git. Automation of infrastructure provisioning, configuration management, and deployments using Ansible, Python, and Bash is also essential.
Collaboration with cloud architects, developers, and security teams to ensure infrastructure meets performance, scalability, and security requirements is vital. The engineer must maintain and optimize infrastructure codebases, ensuring modularity, reusability, and compliance with best practices.
Key Responsibilities:
• Design and implement IaC solutions using Terraform across Azure, GCP, and OCI.
• Develop and maintain CI/CD pipelines using Jenkins, Groovy, and Git.
• Automate infrastructure provisioning, configuration management, and deployments using Ansible, Python, and Bash.
• Collaborate with cloud architects, developers, and security teams to ensure infrastructure meets performance, scalability, and security requirements.
• Maintain and optimize infrastructure codebases.
Required Skills & Qualifications:
This position requires:
• A minimum of 8 years of experience in infrastructure automation and DevOps.
• Strong hands-on experience with Terraform for Infrastructure as Code (IaC) across Azure, GCP, and OCI.
• Proficiency in Jenkins pipeline development using Groovy.
• Solid experience with Ansible for configuration management and orchestration.
• Strong scripting skills in Python and Bash.
• Experience with Artifactory for artifact management.
• Proficient in Git for version control and collaboration.
• Good understanding of container orchestration platforms such as AKS (Azure Kubernetes Service), GKE (Google Kubernetes Engine), and OKE (Oracle Kubernetes Engine).
• Good understanding of networking concepts across cloud platforms (VPCs, subnets, firewalls, load balancers, DNS).
• Basic knowledge of database technologies (e.g., PostgreSQL, MySQL, Oracle DB) and their deployment in cloud environments.
• Familiarity with cloud-native services and infrastructure patterns.
• Excellent problem-solving, communication, and collaboration skills.