System Architect
We are seeking a proactive problem solver with a passion for automation, operational excellence, and continuous improvement.
Key Responsibilities:
* Ensure the optimal performance of enterprise monitoring tools (Splunk, DynaTrace, NewRelic).
* Perform routine maintenance, upgrades, and configuration tuning to achieve system stability.
* Triage and resolve monitoring-related incidents and service tickets in a timely and efficient manner.
* Collaborate with application, infrastructure, and DevOps teams to integrate monitoring solutions and improve visibility.
* Develop and maintain dashboards, alerts, and reports to support operational needs.
* Participate in on-call rotations and incident response efforts.
* Document operational procedures, runbooks, and knowledge base articles.
* Identify and implement automation opportunities to reduce manual effort and enhance reliability.