AgileEngine is an Inc. 5000 company that creates award‑winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in application development, AI/ML, and our people‑first culture has earned us multiple Best Place to Work awards.
WHY JOIN US
If you’re looking for a place to grow, make an impact, and work with people who care, we’d love to meet you!
ABOUT THE ROLE
As a Site Reliability Engineer (SRE), you will drive the reliability, scalability, and performance of cloud‑native systems, enabling engineering teams to deliver with confidence. Working across AWS, Kubernetes, and modern DevOps practices, you’ll automate infrastructure, enhance observability, and support seamless deployments. This role offers strong ownership and cross‑team collaboration, with the opportunity to shape SRE and DevSecOps practices while improving system resilience at scale.
WHAT YOU WILL DO
Design, build, and deploy solutions to improve system reliability, scalability, and operational efficiency.
Build and maintain CI/CD pipelines and deployment automation.
Work with product teams to support application deployments and infrastructure requirements.
Improve system reliability through root cause analysis, post‑mortems, and automation.
Implement and maintain monitoring, logging, and alerting systems.
Support security scanning and DevSecOps practices.
Automate operational and support tasks to reduce manual work.
Assist support teams in troubleshooting infrastructure and deployment issues.
Promote SRE and DevSecOps best practices across engineering teams.
Provide after‑hours support when necessary for critical incidents.
MUST HAVES
8+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
Strong experience with AWS cloud infrastructure and architecture.
Strong experience with Infrastructure as Code using Terraform or AWS CloudFormation.
Experience building and maintaining CI/CD pipelines.
Experience with Git and GitLab in multi‑team environments.
Experience with containers and Kubernetes (EKS or similar).
Experience with monitoring, logging, and observability tools such as Grafana.
Strong scripting skills in Linux or Windows environments.
Strong communication skills and ability to work across engineering teams.
Upper‑intermediate English level.
NICE TO HAVES
AWS certifications.
Experience with Artifactory or artifact repositories.
Experience with DevSecOps and security scanning tools.
Experience with APM and infrastructure monitoring tools.
Experience improving automation and operational workflows.
Experience mentoring engineers or promoting best practices across teams.
PERKS AND BENEFITS
Professional growth: Mentorship, TechTalks, and personalized growth roadmaps.
Competitive compensation: USD‑based pay with education, fitness, and team activity budgets.
Exciting projects: Modern solutions with Fortune 500 and top product companies.
Flextime: Flexible schedule with remote and office options.
#J-18808-Ljbffr