We are seeking a skilled professional to take on the role of Senior System Engineer.
Responsibilities:
* Monitor and maintain the daily health, performance, and availability of enterprise monitoring tools (Splunk, DynaTrace, NewRelic).
* Perform routine maintenance, upgrades, and configuration tuning to ensure optimal system performance.
* Triage and resolve monitoring-related incidents and service tickets in a timely and efficient manner.
* Collaborate with application, infrastructure, and DevOps teams to integrate monitoring solutions and improve visibility.
* Develop and maintain dashboards, alerts, and reports to support operational and business needs.
* Participate in on-call rotations and support incident response efforts.
* Document operational procedures, runbooks, and knowledge base articles.
* Identify and implement automation opportunities to reduce manual effort and improve reliability.
Required Skills and Qualifications
* 5+ years of experience in systems engineering or enterprise monitoring roles.
* Hands-on experience with Splunk, DynaTrace, and NewRelic in production environments.
* Strong understanding of IT operations, incident management, and ticketing systems (e.g., ServiceNow, Jira).
* Proficiency in scripting languages (e.g., Python, PowerShell, Bash) for automation and tool integration.
* Familiarity with cloud platforms (AWS, Azure, or GCP) and containerized environments (Kubernetes, Docker).
* Excellent troubleshooting skills and a bias for action in high-pressure situations.
* Strong written and verbal communication skills in English; Portuguese is a plus.