What are the responsibilities and job description for the Workload Automation Engineer (IBM Tivoli/TWS/IWS) position at Themesoft Inc.?
Job Title: Workload Automation Engineer (IBM Tivoli-TWS/IWS)
Location: Warsaw, IN (Hybrid)
Role Overview
We are seeking a Workload Automation Engineer to define and drive the enterprise architecture, strategy, and operational model for IBM Tivoli/IBM Workload Scheduler (TWS/IWS) across distributed environments (on-prem and cloud). This role sets platform standards and reference designs, leads modernization and major upgrades/migrations, governs reliability and security practices, and serves as the senior technical partner for application, databases, and infrastructure organizations to deliver resilient, scalable scheduling services for mission-critical workloads. In addition, assist and supervise two job scheduling teams.
Must Haves
- Hands on to the end-to-end architecture for the TWS/IWS platform (components, topology, environments, integrations), including standards, patterns, integrations and APIs (REST/SOAP), event-based scheduling, and real-time/on-demand workload patterns.
- Experience with Tivoli Dynamic Workload Console (TDWC/TDWB) and critical path monitoring and integrating file transfer solutions (e.g., SFTP/PGP/GPG, managed file transfer platforms) into batch workflows.
- Experience with SAP and other enterprise application integrations via TWS extended agents.
- Experience building dashboards/metrics and integrating with observability platforms (e.g., Grafana/Graphite).
- Databases: DB2 (HADR), Oracle/Postgres (familiarity).
- Experience defining platform standards, leading upgrades/migrations, and coordinating cross-team delivery (e.g., change windows, cutovers, rollback planning).
- Artificial Intelligence – Navigating AI applications, understanding when to use them, and recognizing when they do not meet business expectations.
Key Responsibilities
- Own the end-to-end architecture for the TWS/IWS platform (components, topology, environments, integrations), including standards, patterns, and reference implementations.
- Provide technical oversight for additional (3rd-party) job scheduling platforms where used; establish operating standards, integration patterns, and support processes to ensure consistent controls and reliability.
- Lead enterprise-scale installations, upgrades, and migrations; define cutover/rollback strategies, coordinate change windows, and ensure readiness across dependent teams.
- Lead assessments of legacy scheduler instances and batch frameworks to identify candidates for retirement, consolidation, or migration; produce target-state recommendations, sequencing/roadmaps, and risk-based migration plans.
- Define reliability engineering practices for workload automation: availability targets, capacity planning, performance tuning, monitoring/alerting, and continuous improvement.
- Design and validate high-availability and disaster recovery solutions (including DB2 HADR where applicable); plan and execute regular DR tests and remediate gaps.
- Establish governance for workload onboarding and job design: scheduling standards, dependency modeling, naming conventions, calendars, critical path optimization, and SLA/SLO management.
- Architect and productionize automation for platform operations and self-service (e.g., provisioning, reporting, batch controls) using shell/Python/Perl and enterprise tooling.
- Own security and compliance posture: access model (LDAP/SSO), least-privilege controls, audit evidence, vulnerability remediation, and secure configuration baselines.
- Manage and develop two teams (e.g., platform engineering and operations): set priorities and operating rhythms, oversee delivery and support outcomes, coach/mentor team members, and drive performance management in partnership with leadership.
- Be available for major outages and critical events related to job scheduling, including QEND activities up to four (4) times per fiscal year.
- Participate in an on-call rotation and provide after-hours/weekend support as needed.
- Support a global operating model by working flexibly across EMEA and US business hours.
- Serve as escalation point for complex incidents; lead root-cause analysis and drive problem management.
- Mentor and guide engineers; lead technical design reviews, documentation/runbook standards, and knowledge sharing across the organization.
- Deep dive into other job scheduling teams like Automate, AS400, and Robot and assist in supervising these teams in IT Operations.
Required Qualifications
- High School Diploma or equivalent.
- 10 years of experience in enterprise workload automation, including 7 years of hands-on IBM TWS/IWS/IWA administration in distributed environments.
- Bachelor’s degree or 10 years of equivalent IT industry service experience.
- Proven experience in a lead/architect capacity.
- Strong Linux/UNIX engineering and production troubleshooting experience.
- Advanced automation/scripting skills (Shell, Python, and/or Perl).
- Strong incident response, root-cause analysis, and problem management experience.
- Strong change leadership aligned with ITIL processes.
- Excellent stakeholder communication and collaboration skills.
Preferred Qualifications
- DB2 administration experience, including HADR.
- Oracle/Postgres and SQL familiarity.
- REST/SOAP APIs and event-based scheduling.
- Tivoli Dynamic Workload Console (TDWC/TDWB).
- SFTP/PGP/GPG integrations.
- SAP integrations via TWS extended agents.
- Grafana/Graphite observability experience.
- Cloud automation and workload modernization experience.
- ServiceNow and ITSM process experience.
- Working knowledge of ITIL concepts and best practices.
- Strong analytical, problem-solving, and communication skills.
Skills & Tools
Workload Automation: IBM TWS/IWS/IWA, TDWC/TDWB, Dynamic Scheduling, JSDL
Operating Systems: Linux, UNIX (AIX/SunOS), Windows (Agent Support)
Databases: DB2 (HADR), Oracle/Postgres
Scripting: Shell, Python, Perl
ITSM/Monitoring: ServiceNow, AppDynamics, OBM, Grafana, Graphite, ITIL Processes
Security: LDAP/SSO, Role-Based Access Control, Audit & Patch Compliance