Role Summary
This critical role is the backbone of the technology infrastructure, ensuring 24/7/365 operational excellence. You will be responsible for detecting and resolving incidents, executing data center isolations, and supporting large-scale application migrations while minimizing business disruptions.
Key Responsibilities
- Incident Management: Monitor systems using Splunk and Dynatrace to detect anomalies and resolve infrastructure incidents before they escalate.
- Batch Processing: Oversee and execute production batch processing using CA7 and Autosys platforms
- Infrastructure Strategy: Support data center isolation activities and application migrations between on-prem environments and AWS.
- Release & Deployment: Manage release processes across applications, infrastructure, and networks using ServiceNow and CloudBees.
- Documentation: Maintain accurate operational runbooks, incident records, and post-incident reports.
Required Qualifications & Skills
- Experience: 3+ years of relevant technical experience in large-scale enterprise infrastructure operations
- Technical Core: Proven expertise in Autosys job execution, ServiceNow, and CloudBees pipeline deployment.
- Monitoring & Cloud: Proficiency with Splunk, Dynatrace, and OpenShift container maintenance. - Legacy Systems: Solid foundation in IBM Mainframe z/OS and CA7 batch execution.
- Education: University degree in Computer Science, IT, or a related field preferred. - Soft Skills: Strong analytical problem-solving skills and the ability to work independently under pressure. Bonus Skills - Experience with Microsoft Power Platform and SharePoint. - Familiarity with ITIL frameworks and change management processes.