Job Overview
We are seeking a skilled Infrastructure Monitoring Specialist to join our team.
The ideal candidate will have experience with enterprise monitoring tools, including Splunk, DynaTrace, and NewRelic. They will be responsible for ensuring the optimal performance of these tools and resolving any related incidents in a timely manner.
In this role, you will work closely with application, infrastructure, and DevOps teams to integrate monitoring solutions and improve visibility across the organization.
Key Responsibilities:
* Maintenance and Upgrades: Perform routine maintenance, upgrades, and configuration tuning to ensure optimal system performance.
* Incident Resolution: Triage and resolve monitoring-related incidents and service tickets in a timely and efficient manner.
* Solution Integration: Collaborate with cross-functional teams to integrate monitoring solutions and improve visibility.
* Dashboards and Reports: Develop and maintain dashboards, alerts, and reports to support operational and business needs.
Requirements:
* Experience: 5+ years of experience in systems engineering or enterprise monitoring roles.
* Technical Skills: Hands-on experience with Splunk, DynaTrace, and NewRelic in production environments.
* Operational Knowledge: Strong understanding of IT operations, incident management, and ticketing systems.
* Scripting: Proficiency in scripting languages (e.g., Python, PowerShell, Bash) for automation and tool integration.
* Cloud and Containerization: Familiarity with cloud platforms (AWS, Azure, or GCP) and containerized environments (Kubernetes, Docker).