What is advanced use of ALTSEQ?

Advanced use includes building full 256-byte translation tables for code page conversion (e.g. EBCDIC to ASCII), using the same ALTSEQ for both sort collation and TRAN=ALTSEQ in INREC/OUTREC, translating only selected fields, and reusing the table across jobs via shared members or procedures.

How do I build a full EBCDIC-to-ASCII ALTSEQ table?

Define ALTSEQ CODE=(pair1,pair2,...) with 256 pairs, one for each input byte value (00–FF) and its corresponding ASCII (or target) value. Use IBM or other authoritative EBCDIC-to-ASCII mapping tables. Place the definition in a member and include it in SYSIN; then use TRAN=ALTSEQ on the field(s) to convert.

Can ALTSEQ affect both sort order and output content?

Yes. The same ALTSEQ table can be used as the alternate collating sequence for the sort (so key comparison follows the mapping) and for TRAN=ALTSEQ in INREC or OUTREC (so output bytes are translated). One definition serves both purposes.

What if my ALTSEQ table does not define all 256 bytes?

Bytes not defined in the table may be left unchanged or may have product-specific behavior. For reliable code page conversion, define all 256 bytes. For partial translation (e.g. only uppercase to lowercase), defining just the needed pairs is fine.

ALTSEQ Advanced Use - Full Translation Tables and Custom Collation

Q: How do I translate only one field in a record with ALTSEQ?

In INREC or OUTREC, specify the record layout with BUILD or FIELDS and attach TRAN=ALTSEQ only to the segment(s) that represent the field(s) you want to translate. Other segments are copied without translation. Exact syntax (e.g. position,length,TRAN=ALTSEQ) is in your DFSORT manual.

ALTSEQ Advanced Use

Advanced use of ALTSEQ in DFSORT includes building full 256-byte translation tables for complete code page conversion (such as EBCDIC to ASCII), using the same ALTSEQ definition for both sort collation and character translation in INREC or OUTREC, translating only selected fields while leaving the rest of the record unchanged, and reusing a long ALTSEQ table across jobs by storing it in a shared member or procedure. This page builds on the basic ALTSEQ page: it covers full translation tables, common pitfalls, combining collation and translation, and how to maintain and include large ALTSEQ definitions.

Full 256-Byte Translation Tables

For code page or character set conversion (e.g. EBCDIC to ASCII), every possible byte value (0–255) can appear in the data. If your ALTSEQ CODE= table omits some values, those bytes may be left untranslated or may be handled in a product-specific way. To get predictable results, define a mapping for all 256 bytes. The table is a list of (input_hex, output_hex) pairs: for each input byte value, the output byte value is given. You can generate the list from an authoritative EBCDIC-to-ASCII (or other) mapping table published by IBM or your platform vendor. Keep the definition in a separate dataset member so you can include it in multiple jobs and update it in one place.

EBCDIC to ASCII Conversion

A common advanced use is converting EBCDIC data to ASCII for transfer to non-mainframe systems. You define ALTSEQ CODE= with 256 pairs mapping each EBCDIC byte to its ASCII equivalent. Then in INREC or OUTREC you specify TRAN=ALTSEQ for the field(s) or the whole record that should be converted. Control characters, spaces, digits, letters, and national characters all need correct mappings; missing or wrong pairs can cause downstream systems to misinterpret the data. Use a standard mapping (e.g. IBM code page documentation) and test with sample data that includes spaces, punctuation, and any special characters your application uses.

Using the Same Table for Sort and Translation

The same ALTSEQ definition can serve two roles in one job: as the alternate collating sequence for the sort (so key comparison uses your mapping) and as the table for TRAN=ALTSEQ when building records. For example, you might want output in ASCII but also want the sort order to follow ASCII character order. Define the ALTSEQ table once; reference it for the sort (per your product’s syntax) and use TRAN=ALTSEQ on the appropriate OUTREC (or INREC) fields. That way you do not maintain two separate tables and the behavior is consistent.

Translating Selected Fields Only

In INREC or OUTREC you specify the record layout as a series of segments (position, length, and optional modifiers). Attach TRAN=ALTSEQ only to the segment(s) that represent the field(s) you want to translate. Other segments are copied as-is. For example, you might translate only bytes 41–80 (a name field) and leave bytes 1–40 (binary or already-ASCII data) unchanged. The exact syntax for attaching TRAN=ALTSEQ to a segment (e.g. whether it applies to the previous segment or a following one) depends on your DFSORT version; see the INREC/OUTREC section of the manual.

Maintaining and Including Long ALTSEQ Definitions

A full 256-byte table makes the ALTSEQ statement long. To avoid repeating it in every job, put the ALTSEQ statement (and optionally other control statements) in a separate dataset member. In the JCL, point SYSIN to a concatenation that includes that member, or use a procedure that contains the member. When the mapping changes (e.g. a new code page), you update the member once and all jobs that include it get the new table. Document the member name and the fact that it defines ALTSEQ so that maintainers know where to look.

Pitfalls: Missing or Wrong Mappings

If the ALTSEQ table does not define a byte value, that byte may pass through unchanged or may be treated in an undefined way. In EBCDIC-to-ASCII conversion, control characters (e.g. X'00'–X'3F') and high bytes (e.g. X'80'–X'FF') often need explicit mappings; otherwise binary data or national characters can be wrong. Another pitfall is reversing the pairs (output_hex, input_hex instead of input_hex, output_hex), which produces incorrect translation. Verify a few known characters (e.g. space, letter A, digit 0) in the output and spot-check the full range if the data is critical.

Performance

Applying TRAN=ALTSEQ to large fields or entire records adds work per record. For very large files, the cost is usually acceptable, but if you only need to convert a few columns, specify TRAN=ALTSEQ only on those segments to reduce the amount of data translated. Using ALTSEQ as the collating sequence can also affect sort performance slightly compared to the default sequence; for most jobs the difference is small.

Explain It Like I'm Five

Imagine a big secret decoder sheet: for every possible squiggle (all 256 of them), you write down what it should become in the new alphabet. When the sorter builds a new card, it uses your sheet to change only the parts you mark. If you forget to write down what one squiggle means, that squiggle might not change and could look wrong. So for a full new alphabet, you need to write down every squiggle. And you can use the same sheet so the sorter both sorts by the new alphabet and writes the new alphabet on the card.

Exercises

Why is it important to define all 256 bytes when doing EBCDIC-to-ASCII conversion with ALTSEQ?
How can you reuse one ALTSEQ table in many jobs without copying the whole statement each time?
How do you translate only bytes 50–80 of an 80-byte record and leave 1–49 unchanged?
What can go wrong if your ALTSEQ table omits mappings for control characters (e.g. X'0A')?

Quiz

Test Your Knowledge

1. Why would you define all 256 byte values in an ALTSEQ CODE= table?

DFSORT requires it
For full code-page or EBCDIC-to-ASCII conversion, every possible input byte must map to an output byte so no character is left untranslated or undefined
Only for sort
It is faster

2. Can you use ALTSEQ for both sort order and translation in the same job?

No, only one use per job
Yes; the same ALTSEQ table can be used as the collating sequence for SORT and for TRAN=ALTSEQ in INREC/OUTREC so sort order and output content both follow the mapping
Only in MERGE
Only with MODS

3. What is a common pitfall when building an EBCDIC-to-ASCII ALTSEQ table?

Using CH format
Missing mappings for control characters or rarely used bytes, which can leave some bytes untranslated or produce wrong results in downstream systems
Using INCLUDE
Sort key length

4. How can you avoid duplicating a long ALTSEQ table in multiple jobs?

You cannot
Define the table once in a member (e.g. ALTSEQ member) and use SYSIN or a copy library to include it in each job; some sites use a shared INCLUDE or procedure
Use MODS
Use OPTION COPY

5. When translating only part of a record with TRAN=ALTSEQ, what must you specify correctly?

Only the sort key
The position and length of the field(s) that get TRAN=ALTSEQ so that only the intended bytes are translated and the rest of the record is unchanged
Only the record length
Only OPTION