Are duplicate keys allowed in VSAM KSDS?

No. The primary key in a KSDS must be unique. Duplicate primary keys are not allowed. If you need multiple records with the same “key” value, use that as an alternate key (alternate index with NONUNIQUEKEY) and keep a unique primary key (e.g. record ID).

What is NONUNIQUEKEY in VSAM?

NONUNIQUEKEY is a parameter used when defining an alternate index. It allows multiple base records to have the same alternate key value. So you can access by that key and get multiple records (e.g. all orders for one customer). The primary key of the base cluster remains unique.

What file status is returned for duplicate key in VSAM?

When you try to insert a record with a duplicate primary key, VSAM typically returns file status 22 (duplicate key) or an equivalent condition code. The exact code can vary by interface (COBOL, CICS, etc.). The WRITE does not succeed.

How do I handle duplicate keys when loading a VSAM file?

Ensure keys are unique before loading: sort and deduplicate input, or use a key that includes a sequence number. If duplicates are expected, check file status after each WRITE and skip, log, or update the existing record instead of inserting.

MainframeMaster

VSAM Duplicate Keys

In VSAM, “duplicate keys” means more than one record sharing the same key value. For the primary key of a Key-Sequenced Data Set (KSDS), duplicates are not allowed—each record must have a unique primary key, and an insert with a duplicate key fails. For alternate indexes, you can allow duplicates by defining the alternate index with NONUNIQUEKEY; then multiple base records can have the same alternate key value (e.g. many orders with the same customer ID). Understanding when duplicates are allowed and how to handle duplicate-key errors is important for designing files and writing robust programs. This page explains primary key uniqueness, alternate index UNIQUEKEY vs NONUNIQUEKEY, what happens when a duplicate key is written, and how to handle or avoid duplicates.

Primary Key: No Duplicates Allowed

The primary key of a KSDS uniquely identifies each record. No two records can have the same primary key value. If your program issues a WRITE (or equivalent) to add a record whose key already exists in the file, VSAM does not add the record. Instead it returns a condition indicating duplicate key (often file status 22 in COBOL or an equivalent return code). The new record is not written; the existing record with that key is unchanged. There is no “replace if exists” or “ignore duplicate” option for the primary key at the VSAM level. The application must ensure that every key it writes is unique, or it must handle the duplicate-key return (e.g. skip the record, log an error, or read the existing record and update it instead).

Why Duplicates Are Not Allowed for the Primary Key

The KSDS index structure maps each key value to one record. The index entries (sequence set and index set) assume a one-to-one relationship: given a key, there is exactly one record. If two records had the same primary key, the index could not point to both; the search and insert logic would not know which record to return or where to put a new record. So the design of KSDS requires unique primary keys. If your data naturally has a non-unique field (e.g. customer ID that appears on many order records), that field should not be the primary key. Use a unique key (e.g. order ID) as the primary key and create an alternate index on customer ID if you need to access by customer; the alternate index can allow duplicates (NONUNIQUEKEY).

Duplicate keys: primary vs alternate index
Key type	Duplicates allowed?
Primary key (KSDS)	Not allowed. Every record must have a unique primary key.
Alternate index (UNIQUEKEY)	Not allowed. Each alternate key value can appear at most once.
Alternate index (NONUNIQUEKEY)	Allowed. Multiple records can share the same alternate key value.

Alternate Index: UNIQUEKEY vs NONUNIQUEKEY

An alternate index is a secondary access path. It is built over a base cluster (KSDS or ESDS) and allows you to access records by a different key (the alternate key). When you define the alternate index, you specify whether the alternate key must be unique or can have duplicates. UNIQUEKEY means each alternate key value can appear at most once—similar to the primary key. NONUNIQUEKEY means multiple base records can have the same alternate key value. For example, if the base cluster is a KSDS of orders with primary key order-ID, you can define an alternate index on customer-ID with NONUNIQUEKEY so that one customer ID maps to many orders. A path defined over that alternate index lets you read all orders for a given customer. So “duplicate keys” in the sense of multiple records with the same key value are allowed only for alternate keys when you use NONUNIQUEKEY.

What Happens When You Get a Duplicate Key

When you WRITE a record to a KSDS and the primary key already exists, the WRITE fails. VSAM returns a condition code or file status (e.g. 22 for duplicate key in COBOL). The record is not written. Your program should check the file status after each WRITE and branch on duplicate key: for example, display an error, write the key to a report, skip the record, or read the existing record and perform an update instead of an insert. In batch jobs that load from a sequential file, a common approach is to sort the input by key and remove duplicates before writing to the KSDS, or to build the key so it is unique (e.g. add a sequence number or timestamp to the key). That way you avoid duplicate-key errors during the load.

Handling Duplicates Before Load

If the source data can contain duplicate keys, you have several options. (1) Deduplicate: sort the input by key and keep only the first (or last) record per key before writing to the KSDS. (2) Make the key unique: add a field to the key so that each record has a unique key (e.g. line number, sequence number, or timestamp). (3) Update instead of insert: when you get a duplicate-key status, read the existing record by key and REWRITE with the new data (only if that matches your business logic). (4) Log and skip: write duplicate keys to an error file or report and continue. The right choice depends on whether duplicates are errors or expected and how you want to resolve them.

Logical Delete and “Duplicate” Keys

Some applications use a “logical delete” pattern: instead of physically deleting a record, they mark it as deleted (e.g. a flag byte) and may later insert a “new” record with the same key. In a KSDS you cannot have two records with the same primary key. So you cannot insert a new record with the same key as a logically deleted one without first physically deleting the old record (or using a key that is unique, e.g. version number in the key). If you need to support “reuse” of a key, you must delete the old record and then insert the new one; there is no in-place “replace key” for the primary key.

File Status and Return Codes

The exact file status or return code for duplicate key depends on the interface. In COBOL with VSAM, status 22 often means “duplicate key” or “record already exists.” In CICS or other environments the code may be different. Check your compiler or runtime documentation. After every WRITE to a KSDS you should test for this condition and handle it. Example pattern:

cobol

1
2
3
4
5
6
7
8
9
       *> After WRITE to KSDS
           EVALUATE FILE-STATUS
             WHEN '00'
               *> Success
             WHEN '22'
               *> Duplicate key - handle (skip, log, or update)
             WHEN OTHER
               *> Other error
           END-EVALUATE

Key Takeaways

The primary key of a KSDS must be unique. Duplicate primary keys are not allowed; a WRITE with a duplicate key fails (e.g. file status 22).
Alternate indexes can allow duplicate alternate key values when defined with NONUNIQUEKEY; UNIQUEKEY means the alternate key is unique.
Handle duplicate-key returns in the program: skip, log, deduplicate input, or make the key unique (e.g. add sequence number).
To “replace” a record by key you must delete the old record and insert the new one; you cannot change the primary key on REWRITE.

Explain Like I'm Five

The main key (primary key) is like a unique ID for each drawer—no two drawers can have the same ID. If you try to add a second drawer with the same ID, the system says “that ID is already used” and doesn’t add it. But you can have a second kind of label (alternate key) where many drawers can share the same label—like “all drawers for Customer A.” So “duplicate” main IDs are not allowed; “duplicate” second labels are allowed when you set up the file that way.

Test Your Knowledge

1. Can the primary key of a KSDS have duplicate values?

Yes
No, primary key must be unique
Only in alternate index
Only for variable-length records

2. What does NONUNIQUEKEY allow?

Duplicate primary keys
Duplicate alternate key values in an alternate index
Variable key length
Multiple indexes

3. What typically happens when you WRITE a record with a duplicate primary key?

VSAM overwrites the old record
The WRITE fails (e.g. file status 22)
VSAM appends the record
VSAM adds it to an alternate index

VSAM Duplicate Keys

Primary Key: No Duplicates Allowed

Why Duplicates Are Not Allowed for the Primary Key

Alternate Index: UNIQUEKEY vs NONUNIQUEKEY

What Happens When You Get a Duplicate Key

Handling Duplicates Before Load

Logical Delete and “Duplicate” Keys

File Status and Return Codes

Key Takeaways

Explain Like I'm Five

Test Your Knowledge

Test Your Knowledge

Unique keys

Key definition

KSDS structure

Index component