What is DFSORT parallel processing?

DFSORT parallel processing means the product uses multiple threads or engines to perform sort or merge work concurrently, so more than one CPU can contribute. When the workload and system are suitable, this can reduce elapsed time. Support and options (e.g. THREADS) depend on your DFSORT or z/OS Beyond Sort version and installation.

How do I enable parallel processing in DFSORT?

If your product supports it, parallelism is typically enabled or tuned via an option such as THREADS in the OPTION statement or via PARM/DFSPARM on the EXEC. The exact option name and allowed values are version- and installation-specific. See the THREADS and parallelism tutorial and your DFSORT documentation.

When does parallel sort improve performance?

Parallel sort tends to improve performance when the sort is large enough to benefit from multiple threads and when the system has sufficient CPU and memory. Small sorts may not show improvement; I/O-bound jobs may be limited by disk. Tuning often combines THREADS (or equivalent) with SIZE, FILSZ, and sortwork options.

What limits parallel sort performance?

Limits include I/O bandwidth (disk-bound jobs), insufficient memory, lack of available CPUs, and workload size. If the job is already waiting on I/O or is too small, adding threads may not reduce elapsed time. Proper tuning of memory (SIZE/MOSIZE), sortwork (DYNALLOC or SORTWK), and FILSZ is still important.

Does DFSORT use multiple CPUs?

When the product supports parallel processing (e.g. THREADS or equivalent), DFSORT can use multiple CPUs or engines to perform sort/merge work. Whether it is enabled by default and how to tune it depend on your DFSORT or z/OS Beyond Sort release and installation. Check your documentation for parallelism options.

DFSORT Parallel Processing - Multi-Thread Multi-Engine Performance

DFSORT Parallel Processing

Parallel processing in DFSORT means the product uses more than one thread or engine to perform sort or merge work at the same time. On a system with multiple CPUs or logical processors, this can reduce elapsed time by spreading work across them. Not every DFSORT environment supports parallelism; when it does, it is often controlled by an option such as THREADS in the OPTION statement. This page explains what parallel processing is, when it helps, what limits its benefit, and how it relates to memory, sortwork, and other tuning parameters.

Performance Optimization

What Is Parallel Processing in DFSORT?

In a single-threaded sort, one logical stream of work runs: input is read, records are sorted (in memory and/or sortwork), and output is written. In parallel processing, DFSORT can use multiple threads or engines so that several streams of work run concurrently. For example, different portions of the sort or merge can be handled by different threads, each potentially using a different CPU. The goal is to shorten elapsed time by using more of the system’s processing capacity. The exact implementation (how work is split, how many threads, and how they coordinate) depends on your DFSORT or z/OS Beyond Sort version and installation.

When Does Parallelism Help?

Parallel processing tends to help when:

The sort or merge is large enough that the work can be meaningfully split across threads. Very small sorts may not benefit because the overhead of coordinating threads can outweigh the gain.
The system has multiple CPUs or logical processors available so that threads can actually run concurrently.
There is enough memory for the product to use multiple threads without excessive paging or contention.
The job is not already limited by something else (e.g. I/O). If the bottleneck is disk, adding more CPU threads may not reduce elapsed time.

So parallelism is one lever among many: you still need sufficient memory (SIZE/MOSIZE), sortwork (DYNALLOC or SORTWK), and a realistic FILSZ. Enabling THREADS (or the equivalent) can then reduce elapsed time when the workload and system are suitable.

Factors That Affect Benefit

Factors affecting parallel sort benefit
Factor	Effect
Workload size	Larger sorts can benefit more; small sorts may have overhead that outweighs gains.
CPU availability	Multiple processors must be available for threads to run in parallel.
Memory	Each thread may use memory; insufficient memory can limit parallelism or cause paging.
I/O	If the job is I/O-bound, more threads may not reduce elapsed time.
Product/version	Support and option names (e.g. THREADS) vary by DFSORT/Beyond Sort release.

How Parallelism Is Controlled

When your DFSORT or z/OS Beyond Sort installation supports parallel processing, it is typically controlled by an option in the OPTION statement or by a PARM (or DFSPARM) value on the EXEC. A common option name is THREADS; the syntax might be THREADS=n to specify the number of threads, or a keyword like THREADS=YES. The exact option name, allowed values, and default behavior are product- and installation-dependent. For details on the THREADS option and syntax, see the THREADS and parallelism tutorial. Here we focus on the concepts: what parallelism is and when to consider it.

Example: OPTION THREADS (Illustrative)

The following is illustrative; your product may use a different keyword or syntax.

text

1
2
  OPTION THREADS=4
  SORT FIELDS=(1,20,CH,A)

This might request that DFSORT use up to four threads for sort work. Whether four is optimal depends on your system (number of CPUs, memory, and the size of the sort). Some installations leave the default (e.g. OPTIMAL or a product-chosen value) and only override when tuning for a specific workload.

Interaction with Memory and Sortwork

Parallel processing does not replace the need for adequate memory and sortwork. The product still needs enough central storage (SIZE/MOSIZE) and enough sortwork (DYNALLOC or SORTWK) to complete the sort. Parallelism spreads the work across threads; each thread may use memory and may generate I/O to sortwork. So tuning typically involves setting FILSZ, SIZE, and sortwork appropriately, and then enabling or tuning THREADS (or equivalent) if supported. If the job fails with capacity or resource errors, address memory and sortwork first; parallelism is an optimization on top of that.

I/O-Bound vs CPU-Bound

If your sort is I/O-bound—meaning the CPUs are often idle waiting for data from disk—adding more threads may not reduce elapsed time much, because the bottleneck is the disk, not the CPU. In that case, improving I/O (e.g. more or faster sortwork volumes, better BLKSIZE, or more memory to reduce sortwork I/O) may help more than increasing THREADS. If the job is CPU-bound and has enough memory and sortwork, then parallelism can help. Understanding your job’s profile (e.g. from SMF or performance tools) helps decide where to tune.

Product and Version Differences

Not all DFSORT environments support parallel processing. Support and option names (e.g. THREADS, PARALLEL) vary by product (e.g. DFSORT vs z/OS Beyond Sort) and release. Some installations may have the feature disabled or may use different PARM conventions. Always check your DFSORT Application Programming Guide or installation documentation to see whether parallelism is available and how to enable or tune it.

Explain It Like I'm Five

Imagine sorting a huge pile of cards. If one person does it, it takes a long time. If four friends help and each sorts a part of the pile, then you combine the parts, the job can finish sooner—but only if you have four friends (CPUs) and enough table space (memory) for everyone to work. If the slow part is running to get more cards (I/O), then having more friends doesn’t help until the running is faster. Parallel processing is like having more friends help with the sort; it works when the job is big enough and when you have enough helpers and space.

Exercises

Your large sort is I/O-bound. Would increasing THREADS alone be likely to help? What might you tune instead?
Why might a very small sort not benefit from parallel processing?
Where would you look to find the exact option name and syntax for parallel processing in your DFSORT installation?

Quiz

Test Your Knowledge

1. What is parallel processing in DFSORT?

Using multiple input files
Using multiple threads or engines to perform sort/merge work so that more than one CPU can work on the job at once, potentially reducing elapsed time
Using multiple SORTWK only
Copying records in parallel

2. When does parallel processing help most?

Only for very small files
When the sort is large enough that the work can be split across threads and the system has sufficient CPU and memory; small sorts may not benefit due to overhead
Only when using INCLUDE
Only for MERGE

3. What can limit the benefit of parallel sort?

Nothing limits it
I/O bandwidth, memory, CPU availability, and workload size—if the job is I/O-bound or too small, or if memory is tight, adding threads may not improve elapsed time
Only record length
Only number of sort keys

4. How is parallel processing typically enabled in DFSORT?

It is always on
Via an option such as OPTION THREADS (or equivalent) in SYSIN or via PARM/DFSPARM; the exact keyword and values are product- and installation-dependent
Only in JCL EXEC
Only for JOINKEYS

5. Is parallel processing available in all DFSORT environments?

Yes, in every installation
No; support depends on the product (e.g. DFSORT vs z/OS Beyond Sort), release, and installation. Some environments may not offer it or may use different option names
Only on tape
Only for fixed-length records