What are the responsibilities and job description for the Linux Systems Engineer position at ClearSky Technologies, a CTS Company?
Job Description
This role will have 'full-stack' authority and responsibility across all aspects of the company's Data Center and Customer-Premise deployed computer equipment, to include hardware, Host OS, hypervisor, Guest OS, Orchestration, Applications and monitoring.
Key Responsibilities
● This role is responsible for supporting physical and virtual computing environments, in a Linux-based data center environment
● RedHat/CentOS/Rocky - Administration, troubleshooting, and security of CentOS 7 and 9
● Linux iscsi – Attaching, disconnecting, and manipulating iscsi block devices using iscsiadm, LVM, and multipath
● Linux KVM virtualization – Creating, copying, management, and monitoring of virtual guests
● MySQL Database – Administration, Replication, Clustering, backups, and SQL queries.
● IP networking – IP addressing and subnetting, network address translation, routing protocols, Linux iptables, Linux bridging
● Data munging – Extracting useful information from various sources including log files and databases then presenting the data in a machine or human readable format
● Participate in a 24x7 on-call rotation
● Develop Knowledge Base articles and Standard Operating Procedures
● Investigate, diagnose and resolve production issues as logged by monitoring tools or as reported by customers
Qualifications
● The ideal candidate has at least 10 years of experience supporting physical and virtual network and computing environments, in a Linux-based data center environment
● Relevant experience would include physical and virtual server implementation, operation and maintenance, SAN, NFS, LVM, iscsi, Linux-ha, DevOps, 24x7x365 operations, HP Proliant, HPE Synergy, Dell, SuperMicro
● Proficient at troubleshooting production issues, solving problems and implementing updates to production software.
● Understanding of internet protocols including HTTP, DNS, RADIUS, SSH, and SMTP
● Analyzing packet captures to distinguish between problems at the protocol level vs. network connectivity
● Experience with writing and troubleshooting Perl, Python or Linux Shell Scripts required
● Experience with OpenNebula virtualization management helpful
● Experience with Cacti, Nagios, Prometheus, Grafana monitoring systems strongly desired
● Experience with Virtualization technologies (Linux KVM preferred) strongly desired
● Experience with MySQL administration, clustering, replication strongly desired
● Experience with Container technologies and supporting tools (i.e. Docker, Kubernetes, Rancher)
● Experience with Clustering technologies (i.e. PaceMaker)
● Working knowledge of SAN systems, especially in a ZFS environment
● Working knowledge of Java application servers including Tomcat, JBoss, and message brokers
● Good organizational and vendor management skills, inclusive of issue tracking, metric reporting, etc.
● Demonstrated experience in effective decision-making within a high-pressure 24x7 environment
● Strong research, investigation and evaluation skills
● Ability to identify and clearly communicate problems/support requests in order to influence the prioritization of production defects and/or enhancements
● Excellent verbal and written communication skills
● Results-oriented and highly flexible, able to work with minimal supervision.
● Inquisitive self-starter with strong attention to detail
Bonus Skills/Knowledge
● ActiveMQ
● Ansible
● Mobile carriers/Messaging
● Deep Packet Inspection products
● PaceMaker, DRBD
● ProxySQL
● Cisco/Fortigate/Mikrotik hardware and software, routing, switching, load balancing, firewalls, IPv4, IPv6
● Public Cloud migration/administration
Education
● 4 Year Technical Degree (Example: BSCS, BSEE, etc.) or equivalent workplace experience.
Requirements
● Ability to work outside of normal business hours when needed
● Ability to travel on an “as needed” basis when required, perhaps quarterly