What are the responsibilities and job description for the Window Server Engineer (Locals to NY or NJ only) position at SANS?
The principle objective of this position is to architect and provide expert level support and strategy for the production Windows server environment. Maintain proactive performance monitoring of the environment utilizing specific monitoring tools. Play an active role in the day-to-day support activities, administration and projects associated with windows infrastructure/applications and maintain all the Windows servers.
Position requirements:
- 10 plus years of experience preferred.
- Expert knowledge of Windows OS and Windows Enterprise components and technologies such as Domains, DNS, Registries, DHCP/WINS.
- Strong experience with automation tools like Ansible and powershell.
- Expert knowledge of Microsoft cluster technologies and strategies: server cluster, network and component load balancing
- Strong knowledge of Active Directory
- Good knowledge of VMWare vSphere and associated products (eg vCenter, SRM);
- Hand- on experience in Azure Cloud Infrastructure in hybrid environments.
- Strong knowledge of performance management and capacity planning.
- Strong knowledge of the SAN / EMC storage including the different SRDF types
- Strong knowledge of installing Windows OS on industry standard hardware (e.g. Client/IBM/Dell -including blade servers)
- Knowledge and experience of IIS, .NET, Knowledge of F5 are a big plus
- Working knowledge of Dynatrace monitoring is a plus.
- Good working knowledge of networking functions (i.e. function of TCP/IP, SNMP) subnetting and routing.
- Good communication and interpersonal skills
- Project Management and Documentation skills
- Service oriented, positive, committed and enthusiastic team player.
- Will be required to work weekends & on an on-call rotation 24*7
- Proficient in English.
Position Responsibilities:
- Responsible for supporting a trading environment consisting of Windows servers & to be actively involved in the planning & implementation of new projects.
- Develop and build automation scripts using Ansible to provision, deploy, perform patching and reduce manual operational work.
- Integrate AI / ML driven operational tools to improve monitoring, performance tuning and capacity planning.
- Build & support Microsoft Failover clusters & also work with stretch clusters.
- Interface with Vendors and Internal Global Teams to ensure the environment conforms to the current standards.
- Manage Business Continuance in relation to Disaster Recovery.
- Maintain System high reliability, availability and resiliency levels.
- Provide 1st, 2nd and 3rd Level Support for the production Environments
- Proactive System Performance Monitoring (using Geneos, Dynatrace Client/DELL/IBM) and capacity management
- Perform system upgrades and mandatory security/OS patching.
- Liaise with application support teams to gather requirements, design, and deploy standard systems
- Project implementation for all business lines.
- Provide 24*7 On call support based on rotation within the team
- P erform thorough morning checks on all critical applications using existing tools or scripts.
- Strictly abide and follow the company policy and procedures pertaining to change and incident management.
- Responsible for ensuring that all systems are deployed consistent with group standards and best practices.
- Incident and Problem management: take ownership and work with team to resolve production related issues.
- Collaborate with security teams to ensure compliance, hardening and vulnerability remediations across Windows platforms.
- Maintain an accurate and current inventory.