Here are some notes about scope and assumptions for the migration of the OH microsite into the digital collections.
Common identifier
We'll base the correspondence between source and destination records on the "oral history transcript number". For the Rauscher OH, for instance, both source and destination records contain this ID, which is "0560": https://digital.sciencehistory.org/works/gr4xnkk in the citation, and https://oh.sciencehistory.org/oral-histories/rauscher-iii-frank-j under "interview details".
Migration:
We plan to migrate the following metadata elements from the Oral History microsite into the digital collections:
Interview
Sponsor field ⬅ correction: this was actually entered manually, and was not part of the migration.
Interviewee
Portrait
Birth and death
Date and place
Education
Date, Institution, degree, subject
Career
Start and end dates, institution, role
Awards
Date, award
Interviewer
Name
Profile
Connection between interview and profile
Institutions -> FAST headings
Institution names under Education and Career will be converted, where possible, to the equivalent FAST term.
Create list of all unique institutions from current data in microsite by automatic extraction from db
We’ll have to find the corresponding FAST heading for each one, and make a table, perhaps in google docs. This is expected to be a manual process.
Overwriting data in production
The abovementioned fields (listed at top) are all currently BLANK in the digital collections. As we refine our migration code, the blank fields will be populated with successive versions of data harvested from the microsite, each replacing the previous one.
Careful: this means if you enter any data *manually* into any of the destination fields in the Digital Collections, that data will be replaced with fresh microsite data next time we run a migration.
Test plan
The idea here is to compare each sample interview on staging in the digital collections with its corresponding microsite record. If everything looks good, we’ll run the import in production during the first week of May. Careful: don’t actually change the metadata in response to this test, as it will be overwritten anyway.
Sample records:
Interviews with multiple interviewees: Cole/Verma, Aitchisons, …
Interviews with multiple interviewers: Hay, Ehrlich, …
Interviews with FAST headings changed: Yi, …
Other outliers: Schoemaker, …
Metadata to check
Interviewee portrait
Alt text
Caption
Interviewee bio
Birth and death
Date and place
Education
Date, Institution (should be FAST) degree, subject
Career
Start and end dates, institution (should be FAST), role
Awards
Date, award
Interviewer
Name
Biography