What are the responsibilities and job description for the Information Technology Operations Engineer position at Gotham Technology Group?
IT Operations / Systems Engineer
Location: New York, NY (Onsite – 3–4 days/week)
Employment Type: Full-Time (Permanent)
Position Overview
The IT Operations / Systems Engineer plays a critical, hands-on role in maintaining and evolving a high-availability hybrid infrastructure environment (Azure on-prem). This individual will be responsible for ensuring system performance, reliability, and scalability across a fast-paced, mission-critical environment.
This role sits at the intersection of systems, cloud, networking, and automation, requiring a strong technical foundation and the ability to troubleshoot complex issues in real time. The ideal candidate thrives under pressure, takes ownership, and proactively drives improvements across infrastructure and operations.
Key Responsibilities
- Support day-to-day operations of IT infrastructure, including servers, networks, storage, cloud platforms, and enterprise applications
- Monitor system performance, availability, and capacity; proactively identify and resolve issues
- Troubleshoot infrastructure incidents across servers, networks, storage, and applications
- Participate in on-call rotation and provide after-hours support as needed
- Assist with deployment, configuration, and maintenance of Azure/AWS and on-prem systems
- Automate operational tasks using scripting and infrastructure-as-code tools
- Perform system patching, upgrades, and vulnerability remediation
- Collaborate with internal teams on deployments, upgrades, and infrastructure changes
- Maintain documentation for systems, processes, and incident response procedures
- Work with third-party vendors to support and resolve infrastructure issues
- Support backup, disaster recovery, and business continuity planning/testing
- Analyze logs and metrics to identify root causes and recommend improvements
- Enforce operational standards, policies, and best practices
- Support identity and access management (IAM), including provisioning and security controls
Required Technical Skills
- 3–7 years of hands-on experience in IT operations, systems engineering, or infrastructure support
- Strong experience in hybrid environments (Azure preferred; AWS/GCP acceptable)
- Proficiency with:
- Windows Server and/or Linux
- Virtualization technologies
- Networking fundamentals (TCP/IP, DNS, routing, firewalls, VPNs)
- Experience with identity and access management:
- Active Directory, Azure AD, MFA, SSO
- Scripting/automation experience (PowerShell, Python, Bash, Terraform, etc.)
- Monitoring/observability tools (Splunk, Datadog, Grafana, ELK, LogicMonitor)
- Familiarity with zero trust and modern network security:
- Zscaler (ZIA/ZPA), SASE
- Experience with enterprise storage (Dell EMC / Dell APEX preferred)
- Understanding of backup, replication, and disaster recovery strategies
- Experience with ITSM tools and ITIL processes
Key Attributes
- Strong troubleshooting skills across multiple layers of the stack
- Ability to work in a fast-paced, high-pressure environment
- Strong communication and collaboration skills
- Self-starter with ownership mentality
- Ability to manage multiple priorities simultaneously
- Willingness to participate in on-call rotation
Education
- Bachelor’s degree in Computer Science, Information Technology, or related field (preferred)