At Decskill, we're seeking an experienced Site Reliability Engineer (SRE) to join our team on a remote project in Brazil. As an SRE, you will play a vital role in ensuring the reliability and performance of our systems.
Key Responsibilities:
* Collaborate closely with internal and external stakeholders, developers, and managers to ensure timely deliverables.
* Maximize the capabilities of our Azure platform for business applications.
* Continuously evolve and enhance our identity management systems.
* Prioritize the observability of our Azure environments.
* Maintain cost efficiency and FinOps control of the platform.
* Implement SRE and DevOps practices with a focus on Infrastructure as Code, automation, and scalability.
* Design, develop, and implement new features to foster a CI/CD mindset, optimizing the software development lifecycle from development to production.
* Analyze, troubleshoot, and resolve complex incidents, ensuring they do not recur.
* Implement code review and testing mechanisms to continuously improve quality.
* Ensure robust security practices are integrated into all aspects of the infrastructure and identity management systems.
* Build and maintain secure, scalable, and resilient infrastructure on Azure.
* Apply best practices for security, compliance, and operational efficiency across all systems.
Requirements:
* Proven experience with Azure and its ecosystem in a production environment.
* Familiarity with identity management technologies, particularly Azure Active Directory and EntraID.
* Solid understanding of observability best practices, particularly in Azure environments.
* Proficiency in using code management tools and repositories.
* Experience with public cloud platforms.
* Expertise in Infrastructure as Code, particularly with Terraform.
* Strong autonomy and critical thinking skills, with a focus on collaboration.
* Proven ability to work effectively in cross-functional teams.
* Excellent communication skills (written, listening, and speaking), a collaborative mindset, and a proactive attitude.
* Fluency in English (mandatory).
Additional Preferred Skills:
* Proficiency in using GitLab and GitLab CI/CD.
* Strong knowledge of observability stacks, including monitoring, logging, and tracing.
* Experience working with Agile methodologies.
* Experience with security frameworks and best practices in cloud environments.
* Certified Azure Administrator (AZ-104) certification.