MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media

Summary List of MARC 21 Characters Assigned to the Private Use Area (PUA)

Revised September 2, 2004

Contents

Introduction

This document brings together the remaining mappings for 61 MARC-8 East Asian Character Code (EACC) values that were mapped to Unicode character values in the Private Use Area (PUA). The other former PUA characters have been identified in Unicode and the mappings changed to regular Unicode encodings. (See Changes to Unicode Mappings for MARC 21 Characters formerly assigned to the Private Use Area.) Alternative non-PUA character mappings to the geta character are indicated in this document to facilitate implementation of the MARC character repertoire in systems where it is not possible or desirable to include characters from the Private Use Area.

The characters are divided into two groups. The groups include:

These characters are all listed in the MARC 21 character set tables. They are also separately listed here for ease of identification of current PUA characters valid in MARC records.

Issues Related to PUA Characters

MARC-8 mappings to PUA values are provided for users that must have a high level of roundtrip compatibility of data being converted to UCS/Unicode and back to MARC-8, or where there is a need to retain the distinctiveness of any of the 61 characters in this group from other Unicode characters.

Implementers who choose to retain PUA characters in their UCS/Unicode data should note the following. Characters assigned to PUA values, by definition, are locally defined and not natively supported by off-the-shelf Unicode-compliant software and hardware. This can be problematic for display and printing of data since fonts may not include a meaningful glyph for any PUA character value. Character shapes and meanings are necessarily ambiguous for PUA character values, thus they usually display and print as some sort of fill character, if they are supported at all. No special normalization, folding, indexing, or retrieval capability can be assumed for PUA characters. Implementers should also recognize that once data has been converted from MARC-8 to UCS/Unicode using the alternative geta, is will not be possible to return to the original EACC values.

In the future, some MARC-8 characters currently mapped to the PUA may become mappable to non-PUA characters due to the addition of new East Asian characters. This list will be evaluated occasionally by experts to determine whether any of these characters have been added to Unicode.

Ancient Korean hangul characters

Mapped to geta character: 26

MARC8 PUA Character Name or Description Alternate
6F7624E8B0 Korean hangul (ancient) 〓 (3013)
6F7640E8B2Korean hangul (ancient)〓 (3013)
6F7641E8B3Korean hangul (ancient)〓 (3013)
6F7642E8B4Korean hangul (ancient)〓 (3013)
6F7643E8B5Korean hangul (ancient)〓 (3013)
6F7644E8B6Korean hangul (ancient)〓 (3013)
6F7645E8B7Korean hangul (ancient)〓 (3013)
6F7646E8B8Korean hangul (ancient)〓 (3013)
6F7647E8B9Korean hangul (ancient)〓 (3013)
6F7648E8BAKorean hangul (ancient)〓 (3013)
6F7649E8BBKorean hangul (ancient)〓 (3013)
6F764AE8BCKorean hangul (ancient)〓 (3013)
6F764BE8BDKorean hangul (ancient)〓 (3013)
6F764CE8BEKorean hangul (ancient)〓 (3013)
6F764DE8BFKorean hangul (ancient)〓 (3013)
6F764EE8C0Korean hangul (ancient)〓 (3013)
6F764FE8C1Korean hangul (ancient)〓 (3013)
6F7650E8C2Korean hangul (ancient)〓 (3013)
6F7651E8C3Korean hangul (ancient)〓 (3013)
6F7652E8C4Korean hangul (ancient)〓 (3013)
6F7653E8C5Korean hangul (ancient)〓 (3013)
6F7654E8C6Korean hangul (ancient)〓 (3013)
6F7655E8C7Korean hangul (ancient)〓 (3013)
6F7656E8C8Korean hangul (ancient)〓 (3013)
6F7657E8C9Korean hangul (ancient)〓 (3013)
6F7723E8CAKorean hangul〓 (3013)

Component characters used in RLIN's CJK input method

These characters should not appear in communications data.

Count: 35

MARC8 PUA Character Name or Description Alternate
212A21E8D0EACC component character〓 (3013)
212A22E8D1EACC component character〓 (3013)
212A23E8D2EACC component character〓 (3013)
212A24E8D3EACC component character〓 (3013)
212A25E8D4EACC component character〓 (3013)
212A26E8D5EACC component character〓 (3013)
212A28E8D6EACC component character〓 (3013)
212A29E8D7EACC component character〓 (3013)
212A2AE8D8EACC component character〓 (3013)
212A2BE8D9EACC component character〓 (3013)
212A2CE8DAEACC component character〓 (3013)
212A2DE8DBEACC component character〓 (3013)
212A2EE8DCEACC component character〓 (3013)
212A2FE8DDEACC component character〓 (3013)
212A30E8DEEACC component character〓 (3013)
212A32E8DFEACC component character〓 (3013)
212A33E8E0EACC component character〓 (3013)
212A34E8E1EACC component character〓 (3013)
212A35E8E2EACC component character〓 (3013)
212A36E8E3EACC component character〓 (3013)
212A37E8E4EACC component character〓 (3013)
212A38E8E5EACC component character〓 (3013)
212A39E8E6EACC component character〓 (3013)
212A3AE8E7EACC component character〓 (3013)
212A3BE8E8EACC component character〓 (3013)
212A3CE8E9EACC component character〓 (3013)
212A3DE8EAEACC component character〓 (3013)
212A3EE8EBEACC component character〓 (3013)
212A3FE8ECEACC component character〓 (3013)
212A40E8EDEACC component character〓 (3013)
212A41E8EEEACC component character〓 (3013)
212A42E8EFEACC component character〓 (3013)
212A43E8F0EACC component character〓 (3013)
212A44E8F1EACC component character〓 (3013)
212A45E8F2EACC component character〓 (3013)

MARC 21 HOME >> Specifications >> Character Sets >> Part 5 > > East Asian Code Tables

The Library of Congress >> Especially for Librarians and Archivists >> Standards
( 12/07/2007 )
Contact Us