How do I find records that contain a specific string in DFSORT?

Use INCLUDE with COND= and the SS (substring search) format: INCLUDE COND=(start,length,SS,EQ,C'your_string'). DFSORT searches for the string anywhere within the byte range [start, start+length-1]. Any record containing that substring is kept. Use the correct start and length for your search area.

What is SS in DFSORT INCLUDE COND?

SS stands for substring search. When you use (position, length, SS, EQ, constant), DFSORT looks for the constant anywhere within the specified length bytes starting at position. The field can be longer than the constant. EQ means "contains"; NE would mean "does not contain."

How do I omit records that contain a character or string in DFSORT?

Use OMIT with SS: OMIT COND=(start,length,SS,EQ,C'string'). Records that contain that substring in the range are omitted. Example: OMIT COND=(1,80,SS,EQ,C'DELETE') omits every record that has "DELETE" anywhere in the first 80 bytes.

Can I search for multiple patterns with INCLUDE?

Yes. Use OR to combine substring conditions: INCLUDE COND=(1,80,SS,EQ,C'ABC',OR,1,80,SS,EQ,C'XYZ'). Records containing either "ABC" or "XYZ" in the first 80 bytes are kept. Use AND to require two (or more) substrings in the same or different ranges.

Does SS work with NE (not equal)?

Yes. INCLUDE COND=(1,80,SS,NE,C'SKIP') would keep records that do not contain "SKIP" in the first 80 bytes. NE with SS means "does not contain." Check your DFSORT manual for full operator support with SS.

Pattern Matching - DFSORT INCLUDE OMIT Substring Search SS

Pattern Matching with INCLUDE and OMIT

Pattern matching in DFSORT often means finding or excluding records that contain a given string anywhere within a field—not just when the whole field equals the string. For example, you might want to keep records where the word "ERROR" appears anywhere in the first 80 bytes, or omit records that contain a comma in a certain range. This is done with the SS (substring search) format in INCLUDE or OMIT COND=. With SS, you specify a search range (start position and length) and a constant; DFSORT looks for that constant anywhere within the range. If it finds it, the condition is true (for EQ) or false (for NE). So "contains" is expressed as (start, length, SS, EQ, C'string'). This is different from CH (character) with EQ, which requires the entire field to exactly match the constant (same length, byte for byte). This page explains SS substring search, how to combine multiple patterns with AND/OR, and pitfalls such as commas inside constants.

INCLUDE / OMIT Advanced Filtering

SS (Substring Search) Syntax

The condition for substring search has the form (start, length, SS, operator, constant). Start is the starting byte position (1-based). Length is the number of bytes in the search range—DFSORT looks for the constant anywhere within those bytes. SS is the format code for substring search. Operator is usually EQ (contains) or NE (does not contain); GT, GE, LT, LE may also be supported for collating-order comparison—check your manual. Constant is the string to search for, typically C'…' for character. The constant can be shorter than the length; the length defines the window in which to search. Example: keep records that contain "ERROR" anywhere in the first 80 bytes of the record:

text

1
INCLUDE COND=(1,80,SS,EQ,C'ERROR')

So if "ERROR" appears at position 10, 50, or anywhere in 1–80, the record is kept. To omit records that contain that string:

text

1
OMIT COND=(1,80,SS,EQ,C'ERROR')

Then only records that do not contain "ERROR" in the first 80 bytes are written to the output.

SS vs CH: Exact Match vs "Contains"

With CH (character format), EQ means the entire field must exactly match the constant. The field length and the constant length must match. So (1,5,CH,EQ,C'HELLO') keeps only records where bytes 1–5 are exactly H-E-L-L-O. With SS, EQ means the constant appears somewhere within the search range. So (1,80,SS,EQ,C'HELLO') keeps any record that has "HELLO" as a substring in the first 80 bytes. Use CH when you need a fixed-position exact match; use SS when you need "contains" or "find anywhere."

CH vs SS for pattern matching
Format	Comparison	Meaning
CH	Exact match	Entire field must equal the constant; same length required
SS	Substring search	Constant can appear anywhere within the field; field can be longer

Searching in a Specific Region

You can limit the substring search to a specific part of the record by setting start and length accordingly. For example, to find "WARN" only in bytes 41–80 (e.g. a second half of a fixed block):

text

1
INCLUDE COND=(41,40,SS,EQ,C'WARN')

So only if "WARN" appears in that 40-byte region is the record kept. This avoids false hits in the first 40 bytes. Similarly, to omit records that have a comma anywhere in positions 1–50 (e.g. to ensure a region has no delimiter):

text

1
OMIT COND=(1,50,SS,EQ,C',')

Any record with a comma in 1–50 is dropped.

Multiple Patterns with OR

To keep records that contain any of several strings, use OR between SS conditions. Each condition specifies the same or different search range and a different constant. Example: keep records that contain "ERROR" or "WARN" or "FAIL" in the first 80 bytes:

text

1
INCLUDE COND=(1,80,SS,EQ,C'ERROR',OR,1,80,SS,EQ,C'WARN',OR,1,80,SS,EQ,C'FAIL')

If any one of the three substrings is found in 1–80, the record is kept. The same (start, length, SS, EQ, constant) is repeated for each pattern; only the constant changes. Note: if your constant itself contains a comma, it can be confused with the comma that separates elements in COND=. Use a different delimiter in the constant (e.g. a period or slash) or escape as required by your product; see your DFSORT manual.

Requiring Multiple Patterns with AND

To keep records that contain all of several strings, use AND. You can search in the same range or in different ranges. Example: keep records that contain "ID=" in 1–40 and "OK" in 41–80:

text

1
INCLUDE COND=(1,40,SS,EQ,C'ID=',AND,41,40,SS,EQ,C'OK')

Both substrings must be present in their respective regions. So AND narrows the set (all conditions true); OR widens it (at least one true).

Constants That Contain Commas

In COND=, commas separate the elements of each condition and the AND/OR keywords. If your search string contains a comma (e.g. C'A,B'), the parser may treat it as a delimiter. Different DFSORT versions handle this differently—some allow quoted or escaped commas inside C'…'. To avoid ambiguity, you can (1) use a separator that is not a comma in your constant (e.g. C'A.B' if that still matches your data), or (2) check your Application Programming Guide for how to include a comma in a constant. Document any such cases in your shop standards.

Explain It Like I'm Five

Imagine you have a long line of letters and you want to find the word "CAT" somewhere in that line. You don't care if it's at the start, the middle, or the end—you just want to know if "CAT" appears anywhere. That's what SS does: it looks for the little word (pattern) anywhere inside the big line (the search range). If it finds it, we keep that line. If we use "omit," we throw away any line that has that word. We can also say "keep lines that have CAT or DOG" (OR) or "keep lines that have both CAT and DOG" (AND). The computer just scans the bytes and checks.

Exercises

Write INCLUDE COND= to keep records that contain "PENDING" anywhere in bytes 1–100.
Write OMIT COND= to drop records that contain a space (C' ') in positions 20–30.
Keep records that contain either "ACTIVE" or "HOLD" in the first 60 bytes. Use OR.
Keep records that contain "START" in 1–40 and "END" in 41–80. Use AND.

Quiz

Test Your Knowledge

1. How do you keep records that contain the string "ERROR" anywhere in the first 80 bytes?

INCLUDE COND=(1,80,CH,EQ,C'ERROR')
INCLUDE COND=(1,80,SS,EQ,C'ERROR')
Use OUTFIL only
INCLUDE does not support substring

2. What is the difference between CH,EQ and SS,EQ for the same position and length?

No difference
CH,EQ requires the entire field to exactly match the constant; SS,EQ requires the constant to appear anywhere within the field
SS is for numeric only
CH is for substring

3. How do you omit records that contain a comma in positions 1–50?

OMIT COND=(1,50,SS,EQ,C',')
OMIT COND=(1,50,CH,NE,C',')
Two OMIT statements
Comma cannot be used in COND

4. To match multiple different substrings (e.g. keep if "ABC" OR "XYZ" appears in 1–30), how do you code it?

INCLUDE COND=(1,30,SS,EQ,C'ABC,XYZ')
INCLUDE COND=(1,30,SS,EQ,C'ABC',OR,1,30,SS,EQ,C'XYZ')
Two INCLUDE statements
SS does not support OR

5. Can you use AND to require two different substrings in the same or different areas?

No
Yes—e.g. INCLUDE COND=(1,40,SS,EQ,C'ABC',AND,41,40,SS,EQ,C'XYZ') keeps records that contain ABC in 1–40 and XYZ in 41–80
Only in same field
AND is only for numeric

Pattern Matching with INCLUDE and OMIT

SS (Substring Search) Syntax

SS vs CH: Exact Match vs "Contains"

Searching in a Specific Region

Multiple Patterns with OR

Requiring Multiple Patterns with AND

Constants That Contain Commas

Explain It Like I'm Five

Exercises

Quiz

Test Your Knowledge

Related Concepts

Comparison operators

AND/OR conditions

Complex conditional expressions

Related Pages