We are looking for a Linux Subject Matter Expert (SME) to join our team and take ownership of Incident, Change, Problem, Knowledge, Availability, and Capacity Management for Linux-based environments. This role involves managing L2/L3 support activities, performing installations, patching, and migrations, as well as scripting and mentoring team members. The ideal candidate will bring solid technical expertise, excellent communication skills, and a proactive approach to operational excellence.
Key Responsibilities:
* Manage L2/L3 production and project support for Linux systems.
* Perform system installations, updates, patching, and migrations.
* Develop and maintain scripts using Shell and Perl.
* Lead and support ITIL-based processes including Incident, Change, and Problem Management.
* Provide routine operational support and troubleshooting.
* Coach and mentor team members to enhance technical capabilities.
* Collaborate with stakeholders and participate in technical meetings.
* Work flexible hours, including weekends, as needed.
Required Skills and Qualifications:
* Proven expertise in Linux systems administration and related components.
* Solid experience with ITIL-based tools and service management practices.
* Strong knowledge of file system and volume management, RAID, and storage integration.
* Familiarity with hardware troubleshooting and vendor support processes.
* Deep understanding of system and network configuration, including TCP/IP, VLANs, SSH/SSL, and system hardening.
* Experience with clustering (PCS), DNS, NIS, Proxy, and LDAP.
* Proficiency in shell scripting and automation.
* Understanding of SAN/NAS storage environments.
* Experience with performance tuning and monitoring tools (e.g., TOP, Grafana).
* Knowledge of user and system management, cron jobs, system dumps, and package management.
* Familiarity with OS installation tools (e.g., Kickstart) and backup/restore procedures.
* Strong analytical and troubleshooting skills.
* Excellent communication and customer interaction abilities.
* Ability to optimize systems for availability, performance, and reliability.
* Exposure to other UNIX-based OS (HP-UX, Solaris, AIX) is a plus.
* Knowledge of cloud platforms (AWS, Azure, GCP) is an advantage.
* Linux certification is a plus.
* Language: advanced English and Portuguese
Technologies & Tools:
* Operating Systems: Linux, HP-UX, Solaris, AIX
* Tools: Remedy, Grafana, Word, PowerPoint, Visio, Excel
* Hardware: Dell, HP, IBM, Oracle, Lenovo, VMware
* Storage: EMC, NetApp, Hitachi
* Backup: TSM, VEEAM
* Services: Apache, DNS, Proxy, DHCP, LDAP
* Clustering: Veritas Cluster