Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Data Migrators have created a tool called S2PX (what?) that assists in the migration of legacy DataStage Server jobs to modern DataStage jobs capable of executing on the DataStage Parallel (PX) engine.

S2PX Design Principles

It may surprise some users to discover that often S2PX does not generate the same optimal Parallel Job design that an experienced Developer would create given the the original Server Job as a model.

S2PX is a tool for taking the designed behaviour of your Server jobs and making them available in a Parallel canvas while simultaneously replicating the complexity and (often unexpected) foibles, quirks and inconsistencies of the Server Job operating environment. It’s this need to replicate the often unintuitive behaviour of the Server environment that can lead those with limited Server Job design experience to question some of S2PX’s conversion decisions. Rest assured that all of those seemingly unusual decisions are all founded on sound reasoning and a substantial body of evidence of how Server Jobs are used in the real world.

The core principles driving S2PX’s design are…

  • S2PX prioritises the generation of a functionally-accurate Parallel Jobs above all else.

  • S2PX aims to provide as much coverage of Server Job functionality as possible.

  • S2PX design decisions have been informed by a detailed analysis of >150K real-world Server jobs submitted by a broad range of IBM IBM’s DataStage customers from around the world.

  • S2PX does not automatically convert DataStage BASIC routines into Parallel engine compatible routines, although it does provide facilities to make that processes easier.

  • S2PX will likely not generate the same Parallel Job design that an experienced Developer would create given the the original Server Job as a modeluses your solution’s design-time information to inform its conversion. Runtime logs are not used to inform conversion decision.

  • S2PX generates Parallel Jobs with every stage set to run in sequential mode (for good reasons)

  • S2PX converts Hashed files to DRS stages (for good reasons)

  • S2PX runs in situ, meaning it exports Server jobs from your existing DataStage environment and delivers automatically-generated Parallel jobs into that same environment. Migrating those newly-generated Parallel jobs to an upgraded environment (e.g. DataStage v11.5 to v11.7, or even to Cloud Pak for Data) is achieved using other tools.

...