MARC Electronic File Transfer Specifications

Network Development and MARC Standards Office
Library of Congress


The specifications for transfering files of MARC records electronically assume the use of a standard protocol. The File Transfer Protocol (FTP) is the most generally used. FTP can be used in place of a physical exchange medium to move MARC authority, bibliographic, classification, community information, and holdings records from a storage device in an originating system to a receiving system. These electronic file transfer specifications were approved by the Library of Congress and the MARC Advisory Group in June 1993.

The labels used with electronic file transfer, like tape and diskette volume and file labels, communicate information from the originating system to the receiving system. The labels identify and characterize the transaction and the information contained in the data that is being transferred electronically. The labels will make possible an eye-readable display of the identifying information and also enable machine processing of the information.

Electronic Transfer File Organization

Each file of MARC records should be accompanied by a companion label file containing bibliographic file and volume information. The label and record file pair can be transferred separately or they can be compressed (archived) into a single file for transfer by the originating system. It is then up to the receiving system to decompress (restore) the label and record file(s). The concept of physical volumes does not apply to electronic transfers as it does to exchange using dismountable physical media, but the identification of volumes may be particularly useful with groups of files compressed for transfer into a single file. Record blocking is not used and individual records do not span files or transfers. Large logical files of records can be segmented into multiple physical files during separate transfers. The following configurations may be used.

Uncompressed transfer           Compressed transfer

Label File                         Compressed File
Bibliographic File                      Label File 1
                                        Bibliographic File 1
                                        Label File n
                                        Bibliographic File n

Electronic Transfer Label Files

A file of MARC records is transferred electronically following a label file that identifies and characterizes the file of MARC records, and optionally, the volume to which it belongs. NOTE: Volume information is not transmitted in a separate file as is done with tape and diskette. The label file is used to transfer non-MARC information between an originating system and a receiving system. The label file is written using only ASCII Latin characters. A label file is required for each file of MARC records transmitted.

Electronic Transfer File Names

These MARC specifications do not mandate the length or style of file names due to differences between operating systems. Exchange partners should agree upon file naming conventions prior to transferring MARC records electronically.

Structure

A label file consists of a sequence of fixed-length and variable fields. The order of fields should be the same as the list below. The length of fixed-length fields is specified in the descriptions of those fields. Variable length fields have no maximum length. Each field in a label file begins with a three-character tag followed by two blanks (ASCII character 2016). The field's data content follows the two blanks. Each field ends with a carriage return (ASCII control character hexadecimal OD) or a carriage return/line feed pair (ASCII control characters hexadecimal OD OA) immediately following the data in the field. The end-of-field character (ASCII control character hexadecimal 1E) is no longer used.

Certain fields in the label files are specified as mandatory and must be present in each label file. In these fields, data content or one fill character the vertical bar ("|" - ASCII character 7C16), must be supplied. Mandatory or optional characteristics of the fields are also noted below. Several of the fields are repeatable.

Electronic Transfer File (and Volume) Labels Tags/Fields

The following order of the fields in the file label is mandatory whether or not all fields are present

                                           Mandatory    Fixed/
Tag   Element Name        Description       /Optional   Variable   Repeatable
DAT   Date compiled       YYYYMMDDHHMMSS.F      M          F           NR
RBF   Number of records   Numeric               M          V           NR
DSN   Data Set Name       Alphanumeric          M          V           NR
ORS   Originating sys ID  Alphanumeric          M          V           NR
CID   Country ID          Alphanumeric          O          F           NR
DTS   Date sent           YYYYMMDDHHMMSS.F      O          F           NR
DTR   Dates of records    YYYYMMDDYYYYMMDD      O          F           NR
FOR   Format              Alphanumeric          M          F           NR
FQF   Format qualifier    Alphanumeric          O          V           NR
DES   Description         Alphanumeric          O          V            R
CS0-n Character set 0-n   Alphanumeric          O          V           NR
CV0-n Set variations 0-n  Alphanumeric          O          V           NR
VOL   Volume              Alphanumeric          O          V            R
ISS   Issue               Alphanumeric          O          V            R
FDI   Final Dest. ID      Alphanumeric          O          V           NR
REP   Reply to            Alphanumeric          O          V            R
NOT   Note                Alphanumeric          O          V            R

DAT (Date compiled):
Mandatory; Fixed length; Not repeatable. This is the date the originating system completed the compilation of the file of records. This is not the date of the creation of the records contained in the bibliographic file. The field is recorded according to Representation for Calendar Date and Ordinal Date for Information Interchange (ANSI X3.30) and Representations of Local Time of the Day for Information Interchange (ANSI X3.43). The date requires 8 numeric characters in the pattern yyyymmdd (4 for the year, 2 for the month, and 2 for the day; right justified and zero filled). The time requires 8 numeric characters in the pattern hhmmss.f (2 for the hour, 2 for the minute, 2 for the second, and 2 for a decimal fraction of the second, including the decimal point). The 24-hour clock is used.
RBF (Number of records in file):
Mandatory; Variable length; Non-repeatable. This element includes the number of logical records contained in the file of MARC records.
DSN (Data Set Name):
Mandatory; Variable length; Not repeatable. The filename of the file of MARC records (which is sent separately) for which this is a file label.
ORS (Originating system ID):
Mandatory; Variable length; Not repeatable. The name of the system that compiled the files of records. This could be a symbol (e.g., OCLC or NUC) or text.
CID (Country identifier):
Optional; Fixed length; Not repeatable. The country identifier of the system that compiled the files of records. The identifier is taken from Codes for representation of names of countries (ISO 3166).
DTS (Date sent):
Optional; Fixed length; Not repeatable. This is the date of transmission of the file of MARC records. The field is recorded according to Representation for Calendar Date and Ordinal Date for Information Interchange (ANSI X3.30) and Representations of Local Time of the Day for Information Interchange (ANSI X3.43). The date requires 8 numeric characters in the pattern yyyymmdd (4 for the year, 2 for the month, and 2 for the day; right justified and zero filled). The time requires 8 numeric characters in the pattern hhmmss.f (2 for the hour, 2 for the minute, 2 for the second, and 2 for a decimal fraction of the second, including the decimal point). The 24-hour clock is used.
DTR (Dates of records):
Optional; Fixed length; Not repeatable. This includes inclusive dates of last transaction of the records in the file, i.e. the first and last date recorded in the 005 fields of the file of records. The field is recorded according to Representation for Calendar Date and Ordinal Date for Information Interchange (ANSI X3.30). The date requires 16 numeric characters in the pattern yyyymmddyyymmdd (4 for the year, 2 for the month, and 2 for the day for each date; right justified and zero filled).
FOR (Format):
Mandatory; Fixed length; Not repeatable. This element designates the format of the records, generally M for MARC. Other codes may be defined as needed (e.g., Z for Z39.2 for copyright records).
FQF (Format qualifier):
Optional; Variable length; Not repeatable. This element provides additional description of the format of the record file. For example, it may identify a particular tag set/specification for MARC records or a particular DTD for SGML records. For MARC formats, the content of the FQF field may be text of a code from the list Z39.50 registered record syntaxes. For DTDs, the content is the identifier in the DTD DOCTYPE element.
DES (Description of records):
Optional; Variable length; Repeatable. This element describes the records. The data could be coded or describe a product name. (For example, OCLC uses B for Bibliographic describing a data type; CDS may use a product name, such as MDS-Books All.)
CS0-n (Character set 0-n):
Optional; Variable length; Not repeatable. These fields specify the character sets (control and/or graphic) needed for processing the record data file. The field content is text indicating a particular set (e.g., ISO 646-IRV, ISO Registration #37, MARC, or a reference to a private character set). CS0 indicates at least the G0 set and CS1-n indicate other sets in the file.
CV0-n (Character variation 0-n):
Optional; Variable length; Repeatable. These fields are used in conjunction with the CS fields and contain a textual description of the variations from the set specified in the corresponding CSn field. Variations noted in this field may be that the set: 1) was not used strictly according to the standard; 2) has options for some positions that need to be specified; or 3) has additional characters in positions that are undefined in the standard.
VOL (Volume):
Optional; Variable length; Repeatable. This may be used if it is desirable to assign a volume number when distribution of records is by subscription. Each file within a subscription year may be given a volume and issue number.
ISS (Issue):
Optional; Variable length; Repeatable. This may be used if it is desirable to assign a volume and issue number when distribution of records is by subscription. Each file within a subscription year may be given a volume and issue number. It may be combined with Volume (e.g., V1402).
FDI (Final destination ID):
Optional; Variable length; Not repeatable. This field would contain the name or identifier of the final-destination database.
REP (Reply to):
Optional; Variable length; Repeatable. This field contains an address given as a contact for problems/questions in transmission. It may include an Internet or postal address.
NOT (Note):
Optional; Variable length; Repeatable. This field contains textual information or messages about the file.

Electronic Transfer of MARC Label and Record Files

Each file is transferred electronically as a separate operation. If no compression is involved, a label file should be transferred preceding the MARC record file to which it relates. If label and record files have been compressed, the transfer of all data can occur as a single operation to the receiving system. In this case, the receiving system must process the compressed file to restore the label and record file(s) as separate files.

Example of an electronic file transfer label file: ("<EOF>" = end-of-field character)


[Example of an electronic file
label]



Go to:


Library of Congress
Library of Congress Help Desk ( 12/31/2002 )