Join to apply for the Site Reliability Engineer role at Guidewire SoftwareJoin to apply for the Site Reliability Engineer role at Guidewire SoftwareGet AI-powered advice on this job and more exclusive features.SummaryThe OpportunityWe are searching for a Site Reliability Engineer eager for a rare chance to transform insurance with the industry's leading cloud platform. As a member of the SRE-Application team, you'll be responsible for building and evolving our SRE practice for the applications running on our Guidewire Cloud Platform. This is an opportunity to apply your expertise in automation, software engineering, and operational discipline to ensure the reliability, performance, and scalability of our cloud-based solutions.Job DescriptionWhat You'll doCollaborate with development teams to troubleshoot and solve problems, reducing customer impact.Develop automated runbooks and implement measures to handle issues proactively.Apply sound engineering principles and mature automation to our operating environments.Monitor, maintain, and enhance the reliability and performance of applications on our Guidewire Cloud Platform.Leverage your automation and software engineering expertise to optimize systems and eliminate toil.Document and examine incidents to improve processes and continuously prevent future occurrences.Stay up-to-date with the latest industry trends, tools, and best practices in site reliability engineering.Contribute to a culture of innovation, learning, and continuous improvement.What You'll BringProven experience as an SRE or similar role, with a track record of improving system reliabilityStrong problem-solving skills and the ability to analyze complex systems and devise effective solutionsExcellent collaboration and communication abilities to work cross-functionally and clearly document processesExperience with automation, monitoring, and performance optimization tools and techniquesDedication to maximizing uptime, scalability, and delivering an exceptional end-user experienceA passion for technology and a strong desire to continuously learn and grow your skillsAlignment with Guidewire's mission to leverage technology to help protect and support othersRequired Skills & ExperienceProven experience leveraging application performance monitoring (APM) and telemetry tools to troubleshoot and diagnose problemsProven experience triaging and debugging distributed systems on cloud infrastructureProven experience in designing and engineering CI/CD pipelines within Kubernetes (K8S) and legacy ecosystemsProven experience in designing and engineering monitors, dashboards, and synthetic transactions in DatadogProven experience in building, deploying, and running scalable infrastructure within AWS and Kubernetes ecosystems using Terraform and other cloud-native approachesProven experience in managing infrastructure configuration at scale using multiple approaches and/or tools such as GitOps, Puppet, or AnsibleGood understanding of AWS cloud networking and security with hands-on experience remediating infrastructure vulnerabilities at scaleGood understanding of SLIs, SLOs, and Error BudgetsComfortable with Linux system administration, with the ability to program/script using Python, Go, Java, shell, or equivalentParticipate in mandatory on-call rotations to ensure service availability and reliability, responding to incidents and alerts outside regular hours, including weekends and holidays. Candidates must be willing and able to fulfill this critical responsibility.Preferred SkillsSRE certified in multiple categoriesAWS certified in multiple categoriesProficiency with SQL, database administration, data pipelines, performance tuning, and schema designProficiency with multiple pipelining tools such as TeamCity, Bitbucket Pipelines, Jenkins, and GitHub ActionsFamiliarity with open-source distributed data processing frameworks such as Hadoop, Apache Spark, AWS Redshift, etc.About GuidewireGuidewire is the platform P&C insurers trust to engage, innovate, and grow efficiently. We combine digital, core, analytics, and AI to deliver our platform as a cloud service. More than 540+ insurers in 40 countries, from new ventures to the largest and most complex in the world, run on Guidewire.As a partner to our customers, we continually evolve to enable their success. We are proud of our unparalleled implementation track record with 1600+ successful projects, supported by the largest R&D team and partner ecosystem in the industry. Our Marketplace provides hundreds of applications that accelerate integration, localization, and innovation.For more information, please visit www.guidewire.com and follow us on Twitter: @Guidewire_PandC.Guidewire Software, Inc. is proud to be an equal opportunity and affirmative action employer. We are committed to an inclusive workplace, and believe that a diversity of perspectives, abilities, and cultures is a key to our success. Qualified applicants will receive consideration without regard to race, color, ancestry, religion, sex, national origin, citizenship, marital status, age, sexual orientation, gender identity, gender expression, veteran status, or disability. All offers are contingent upon passing a criminal history and other background checks where it's applicable to the position.Seniority levelSeniority levelNot ApplicableEmployment typeEmployment typeFull-timeJob functionJob functionEngineering and Information TechnologyIndustriesSoftware DevelopmentReferrals increase your chances of interviewing at Guidewire Software by 2xGet notified about new Site Reliability Engineer jobs in Brazil.DevOps Engineer Plena - Afirmativa para MulheresMid Level Site Reliability Engineer, BrazilSite Reliability Engineer - Healthcare IndustrySite Reliability Engineer SR (422575) - REMOTEEspecialista SRE (Site Reliability Engineering)We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr