-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What's going on in the Hathi-specifc MARC field 974? #1
Comments
Hi Ben, The 974 field is not a standard MARC 21 field (i.e., not something supported by OCLC). This seems like it might be metadata defined by HathiTrust or possibly by the digital object's owning library. I'll be examining some HT MARC later this week and early next week. I should be able to provide some more definitive answers to you then. Regarding organization codes: There are many sources. The following website lists many of them: https://www.loc.gov/marc/holdings/echdorg.html |
An update on this issue. 974 has been scrutinized in the past. Our metadata librarian here at Illinois provided the following information: "The 974 field is added by Zephir, the CDL that processes HathiTrust metadata. Subfield u works as the HathiTrust item id which is composed of the namespace assigned to it and the identifier provided by the institution. With regards to subfield t, I wouldn't mash the titles together -- they are different accounts of what the title is. I haven't verified it yet but the relationship between 975y and 008 may be the same. The important factor is that everything in the 9XX fields is from a different person than the rest of the record. So you'll need to note it as a source of data bias if you use them. You might be able to do some great comparative analyses on the assertions being made in 974 and the rest of the record but, I wouldn't mix them together. |
OK, that's very helpful. It would be helpful to know what Zephir intends by the
If that is the universal practice, it will be extremely useful for Bookworm; serial misdating is a major issue right now, so any solution to that is (I think--perhaps this needs to be discussed) worth the risks of using hybrid data sources for the records. The online Hathi catalog uses that field for display, so someone in Michigan must have an answer to this. I had not seen |
The following mapping document for organizations may also be helpful: https://docs.google.com/document/d/1ILpVfk5y3auLpBicflpGIJbf9bnNZ9qeJwkW48CZ1Xo/edit Looking over 974z, it seems roughly equivalent to the strings that appear in the "enum" attribute of the json (in the parts beyond the marc-xml blob). I'll see if I can contact someone at the California Digital Library on Monday. Zephir seems to be their brain child. It is odd that 974 doesn't appear in the HathiTrust's documentation but a different field 955 does. It seems possible that these two fields are doing something similar. |
Regarding 974 b and c -- this seems to indicate the source institution. For instance, UC--UCLA refers to the University of California, Los Angeles whereas UC--NRLF refers to the the Northern Regional Library Facility which is a part of the University of California, Berkeley Libraries. |
OK, thanks so much for looking into this. It's great to know where this is coming from. |
After today's conference call I wanted to bump this for @jjett: Here are the remaining questions I have about 974, restated from above with new information.
|
Have cc'd you and Peter on an email with your questions to Jonathan Rothman (@U_Mich) who is on the Zephir team that oversees the HTDL's MARC metadata. |
Copying Bill's answers here for the project's records. Hey Jacob (et. al.) The short answers are: There isn’t a published definition of the 974 — we treat it as an internal implementation detail, so we never bothered. -Bill- HathiTrust Collection Code mapping "mdp" => "University of Michigan", |
Following up: if you folks just throw an @billdueber in any issues where you have questions I might be able to answer, I'll get the ping and see what I can do. Don't guess when you can just ask :-) |
There's lots of good information in the Hathi-specific MARC field 97, but I don't totally understand it.
974b
and974c
both seem to be libraries. They're often, but not always the same. What's the difference?HVD
-> "Harvard University Libraries", or whatever it is?)974t
play compared to the record title field? Is it safe to smash them together?974y
play compared to the record publication date field? Is it safe to always overwrite the record date (which may be a serial beginning date) with974y
(which generally gives the year for the specific serial volume)?The text was updated successfully, but these errors were encountered: