Job Opportunity
We are seeking a proactive problem solver with a passion for automation and continuous improvement.
* Manage the daily health, performance, and availability of monitoring tools (Splunk, DynaTrace, NewRelic).
* Perform routine maintenance, upgrades, and configuration tuning to ensure optimal system performance.
* Triage and resolve monitoring-related incidents and service tickets in a timely manner.
* Collaborate with teams to integrate monitoring solutions and improve visibility.
The ideal candidate will have hands-on experience with Splunk, DynaTrace, and NewRelic in production environments, a strong understanding of IT operations, incident management, and ticketing systems, proficiency in scripting languages (e.g., Python, PowerShell, Bash) for automation and tool integration, familiarity with cloud platforms (AWS, Azure, or GCP) and containerized environments (Kubernetes, Docker), excellent troubleshooting skills, and strong written and verbal communication skills.