What are the responsibilities and job description for the AB Initio Developer position at The Judge Group?
- Ab Initio Developer — detailed skillset expectations (senior, 7 yrs)
Ab Initio core development (expert)
- GDE proficiency: designs and builds complex graphs end-to-end; strong command of component configuration, metadata propagation, partitioning, and checkpointing.
- Reusable framework mindset: parameterized graphs, shared subgraphs, reusable transform components, standardized logging/error handling patterns.
- Programming in transforms: strong use of DML, XFR, and embedded scripting where used; can write efficient expressions and avoid costly per-record logic.
- Metadata discipline: robust record formats, strict typing, handling of optional/nullable fields, encoding/locale concerns, and schema evolution strategy.
2) Performance engineering & scalability (expert)
- Parallelism & partitioning ; chooses partition keys intentionally; understands hash/range/round-robin/broadcast; avoids skew and hot partitions.
- Sort/join/aggregate optimization: picks correct join strategies, minimizes re-sorts, uses flow tuning and component alternatives (e.g., lookup vs join) appropriately.
- Memory/I/O awareness: understands spill behavior, buffer sizing concepts, file system throughput, and impacts of compression and record sizes.
- Benchmarking: can baseline runtime, identify bottlenecks, and improve throughput while preserving correctness.
Data engineering fundamentals (expert)
- ETL design: incremental vs full loads, CDC patterns (where applicable), late-arriving data handling, idempotent re-runs.
- Data quality: profiling, validation rules, reconciliations, thresholding, and actionable rejects (not silent drops).
- Lineage & audit: column-level mapping clarity, control totals, and traceable transformations for regulated reporting.
Platform & operational mastery (strong senior)
- Scheduling/orchestration integrates with enterprise schedulers; understands dependencies, calendars, rerun semantics, and backfills.
- Error handling: standardized reject paths, quarantines, retries, alerting hooks, and operational dashboards/log patterns.
- Restartability: checkpointing, safe partial reruns, and consistent handling of intermediate datasets.
Salary : $60 - $75