MARC to CT Crosswalk

MARC to CT Crosswalk is based on Library of Congress Crosswalks with metadata of Harvard (MARC) and UIUC (MARCXML), and MARC tag usages of Harvard, WorldCat, and UIUC (Harvard MARC records were downloaded on May 1, 2013 from this link: http://openmetadata.lib.harvard.edu/bibdata/data; UIUC MARCXML records are May, 2010 version). The following is a little part of the MARC to CT crosswalk. The full version is in MARC to CT crosswalk.

*highly used by search engines *highly used tags over 50% Consider tags more used than 1%

MARC tag

--Leader and Directory--

LEADER

=LDR 00709nam a2200217I 4500

!!(Leader/06 and Leader/07)

--Control Fields (001-006)--

=001 001000001-1

!!

002: {dollar}{dollar}a1003-6962

003: OCLC

004: 004880179

=005 20020606090541.3

Usage of Marc tags in Harvard Records

100.0

100.0

0.000101582673684

0.0572223014914

3.90702591093e-05

100.0

Usage of Marc tags in WorldCat, by

2010 Oct.1, 2013

0

100% of WorldCat in 2010 report,

100.9457:10/1/2013

x: not in WorldCat.

x

2010,

5.94E-06

Oct.1, 2013

x.

x: 2013,

and 2010 report.

Usage of Marc tags

UIUC

MarcXml records

100.0

99.9999955452

x

x

x

99.9999955452

Common Terminology

(CT)

description

type=” recordinfo”

changed (031814) from

object=”record” type=”info”

(12/19/13)(08/28/13).

Ignored (07/2013).

typeGenre no type

authority=”LC MARC type”

-typeGenre values will be translated values based on the tables of leader 06 and leader 07.

Leader 07 - Bibliographic level

a - Monographic component part

b - Serial component part

c - Collection

d – Subunit

i - Integrating resource

m - Monograph/Item

s – Serial

description

type=”issuance”

(10/21/13)

identifier type=”control number” source=”harvard” (9/11/13). It is changed from type=”record” of identifier (8/28/13, July,2013).

Ignored

identifier

type=”control number” source=”harvard”

merged (021314) from

ignored, (09/04/13)and

identifier type=”control number identifier” (8/28/13, July,2013)

ignored(aug. 2013).

Ignored (aug. 2013).

identifier type=”control number” (July,2013)

description

type=” recordinfo”

changed (031814) from

type=”info” object=”record” Adding object=”record” will be better to clarify the info. is about a record not about a resource, and to preserve record info.(12/19/13). type=record info” (11/27/13).

ignored (10/11/13)

*date type=” latest record transaction”

(10/10/13)

Explanation.

The research team, have changed the concept of the Common Terminology not only for locally LC, Harvard, MIT, and UIUC of the U.S. but for globally including WorldCat, as an international Common Terminology. The CT may be used for world libraries systems and search engines (08/28/13). It is based on all 12 million Harvard Records and 9 million UIUC MarcXml records.

It is added, because it contains important information about the record, and it is used in all Harvard and UIUC records. And all values will be transferred into description not separating the info for detail (012314).

Ignored, because there is no mapping to MODS, and it is leader about the record, not about object/item(07/2013).

Leader 06, Leader 07

Leader 06 - Type of record

001 - Control Number

Control number assigned by the organization creating, using, or distributing the record.

An organization using a record of another organization may move the incoming control number from field 001 (and the control number identifier from field 003) to field 035 (System Control Number), 010 (Library of Congress Control Number), or 016 (National Bibliographic Agency Control Number), as appropriate, and place its own system control number in field 001 (and its control number identifier in field 003).

Unassigned, ISSN? It is not in LC.

Because it is only 13 times used. Harvard only.

Because, the MARC code identifying whose system control number is present in field 001 is contained in field 003 (Control Number Identifier)(LC) (021314).

because It is not used in WorldCat in 2010 report (09/04/13) and UIUC records (10/10/13).

It is added, because we need to preserve information whose system control number is present in field 001 (8/28/13, July,2013)

because it is used very few. (aug. 2013).

003 - Control Number Identifier

004 - Control Number for Related Bibliographic Record

because it is used very few in Harvard, and not in LC and WorldCat nor UIUC.

It is grouped with record info in description (11/27/13).

Because it is about date for record not the resource (10/11/13).

Although it is not used for search, Harvard and UIUC use it almost 100%. Thus, the information is preserved (10/10/13).

Because the information is not used for any searches although it serves as a version (9/11/13).

It is added, because the date and time of latest transaction serve as a version identifier for the record (08/28/13).

ignored (9/11/13) from date type=”record change”

Ignored??, although it is about record not object, it should be preserved?, because it is mostly used in Harvard records? (Aug. 19, 2013).Let’s ignore, because it is about the lastest record transaction (Aug. 26, 2013).

005- Date and Time of Latest Transaction (NR), but there is no mapping to MODS.