ArchivesSpace (or ASpace for short) is a server whose main purpose is to host a software program also named… ArchivesSpace. The program is “an open source archives information management application for managing and providing web access to archives, manuscripts and digital objects”. The server also hosts a few auxiliary programs who take the output from ArchivesSpace and convert it into various other formats, which are then made available via an Apache webserver on the same machine.

Child pages (Children Display)

We store digital descriptions of our archival collections in the following six places:

LocationFormat	Type of technology	Number of collections described	Source	Example	Who can see it?
`Shared/P/Othmer Library/Archives/Collections Inventories/Archival Finding Aids and Box Lists`	Word documents	Roughly 270, dates 1997 – present.	This is the initial description we create upon accessioning a collection.	`P/Othmer Library/Archives/Collections Inventories/Archival Finding Aids and Box Lists/Labovsky Collection Finding Aid.doc`	Institute staff
ArchivesSpace sitepublic user interface (PUI)	MySQL-backed website	Same as below		Tischler papers	Public
ArchivesSpace admin site	Same as above	Roughly 120 as of 2022	Entered manually based on the P drive Word files.	https://archives.sciencehistory.org/resources/81#tree::resource_81 Tischler papers	Only logged in ArchivesSpace users
S3 EAD bucket	EAD (xml format)	Roughly 120 as of 2022	Generated nightly from ArchivesSpace database	http://ead.sciencehistory.org/	Public	ArchivesSpace Apache front end	HTML	Roughly 120 as of 2022	Generated nightly from ArchivesSpace database. These will be replaced by pages in the PUI in summer 2022.https://archives.sciencehistory.org/2012-021.html	Public
OPAC	PDF	460; see complete list	Exported manually as PDF from the ArchivesSpace site, then attached to the OPAC record for the collection	https://othmerlib.sciencehistory.org/articles/1065801.15134/1.PDF	Public
https://guides.othmerlibrary.sciencehistory.org/friendly.php?s=CHFArchives	LibGuide	Most collections, categorized by subject.	Created and maintained by Ashley Augustyniak	Subject: nuclear chemistry	Technically public, but does not appear to be linked from anywhere.

Workflow

Finding aids are stored first written up as Word documents at Shared/P/Othmer Library/Archives/Collections Inventories/Archival Finding Aids and Box Lists.
Kent , Birch and Patrick B. enter enters the data in them, one by one, into ArchivesSpace. They finding aids are revised in the process.
Once they are in ArchivesSpace:
- They are automatically exported by https://chemheritagegithub.atlassian.net/wiki/spaces/HDCSD/pages/2151514113com/sciencehistory/export+_archivesspace+_xml to EAD files at http://ead.sciencehistory.org/They are also converted to HTML. Examples: Wotiz; Simon; Fenn; Carbogel; Brody. There is currently no Web page that lists these HTML files.
- Kent also exports them to a PDF, which he then sends to Caroline. These are entered into the OPAC. (see e.g. https://othmerlib.sciencehistory.org/articles/1065801.15134/1.PDF )
  - Note: the PDF has to be manually updated in the OPAC every time the metadata in ArchivesSpace changes.
- In certain cases the The OPAC record also points at the HTML file at to a PUI URL at https://archives.sciencehistory.org/ .
- Certain works in the Digital Collections also point to these HTML filesthe PUI. Example: https://digital.sciencehistory.org/works/81jkowj.
Finally, the exported EAD files are also ingested by University of Penn Libraries Special Collections and the Center for the History of Science, Technology, and Medicine (CHSTM).
- Penn, in turn, processes these EAD files on a nightly basis and adds them to the Philadelphia Area Archives Research Portal (PAARP)search portal, a service funded by PACSCL.
  Example: http://dla.library.upenn.edu/dla/pacscl/detail
  .
  html?id=PACSCL_SCIHIST_2012021USpaphchf
Likewise, CHSTM ingests these EADs and makes them searchable at its search portal.
- Example https://www.chstm.org/collections/search?text=Carbogel
Note that external links to our HTML finding aids are rare and can be disregarded. There should be no need to provide redirects to these URLS when we eliminate them.

OBSOLETE – Technical details about the server

This section should be removed later in 2022.

ArchivesSpace lives on an AWS S3 server ArchivesSpace-prod, at https://50.16.132.240/ (also found at https://archives.sciencehistory.org)

The current production version of Aspace is 3.0.1 .

Terminal access: ssh -i /path/to/production/pem_file.pem ubuntu@50.16.132.240

The ubuntu user owns all the admin scripts.

The relevant Ansible role is: /roles/archivesspace/ in the ansible-inventory codebase.

SSL is based on the following: http://www.rubydoc.info/github/archivesspace/archivesspace

The executables are at /opt/archivesspace/

The configuration file is /opt/archivesspace/config/config.rb
Logs are at: logs/archivesspace.out

Apache server is at /var/log/apache2/

Configuration for the Apache site is at /etc/apache2/sites-available/000-default.conf.

Main users

Kenton Jaehnig
Sarah Newhouse
Patrick Shea
Birch Mezzaroba (intern)
Patrick Burden (intern)

OBSOLETE – Startup

To start Archivesspace:
- /opt/archivesspace/archivesspace.sh start (as user ubuntu)
There may be a short delay as the server re-indexes data.

...

OBSOLETE –Restarting the server to fix Tomcat memory leak

We restart the ArchivesSpace program (not the server) using a cronjob that runs /opt/archivesspace/archivesspace.sh restart every night at 2 am. This prevents a chronic memory leak from eating up all the CPU credits for the machine.

When the server is restarted, Jetty creates a set of temporary files in /tmp

that look like this:

hsperfdata_ubuntu
jetty-0.0.0.0-8089-backend.war-_-any-3200460420275417425
jetty-0.0.0.0-8090-solr.war--any-_1669707332158985985
jetty-0.0.0.0-8091-indexer.war-_aspace-indexer-any-3026688914663148716
jetty-0.0.0.0-8080-frontend.war--any-3028692540497613460
jetty-0.0.0.0-8081-public.war--any-268053434795494538
jetty-0.0.0.0-8082-oai.war--any-_243630232179303838

Only the most recent set are used by Jetty, but the old ones accumulate rapidly if the server is restarted nightly.

A cron job removes obsolete ones nightly.

Export

The ArchivesSpace EADs are harvested by:

Institution	Liaison	Contact
Center for the History of Science, Technology, and Medicine (CHSTM)	Richard Shrake	shraker13@gmail.com
University of Penn Libraries Special Collections	Holly Mengel	hmengel@pobox.upenn.edu

Both institutions harvest the EADs by automatically scrapinghttp://ead.sciencehistory.org/.

OBSOLETE – Building the server

The server not yet fully ansible-ized.

What is missing from the ansible build:

The build doesn’t copy the scripts in /home/ubuntu over correctly. Passwords for the scripts also need to be provided.
All these directories under /var/www/html/ are also missing: css; ead; font-awesome-4.7.0; fonts; img; js.
The ubuntu user needs to be added to the www-data group
SSH keys are not loaded into /etc/ssl/private/
The archivesspace server is not actually started (sudo systemctl start archivesspace).

OBSOLETE – Backups

These consist of making backups of the sql database used by the ArchivesSpace program.

Place the Mysql database in /backup

mysql-backup.sh

Dumps the mysql database to /backup/aspace-backup.sql.
This script is run as a crontab by user ubuntu : 30 17 * * 1-5 /home/ubuntu/archivesspace_scripts/mysql-backup.sh

Sync /backup to an s3 bucket

s3-backup.sh

Runs an aws s3 sync command to place the contents of /backup at https://s3.console.aws.amazon.com/s3/object/chf-hydra-backup/Aspace/aspace-backup.sql?region=us-west-2&tab=overview.

This script is run as a crontab by user ubuntu : 45 17 * * 1-5 /home/ubuntu/archivesspace_scripts/s3-backup.sh

See Backups and Recovery (Historical notes) for a discussion of how the chf-hydra-backup s3 bucket is then copied to Dubnium and in-house storage.

OBSOLETE – Restoring from backup

You can get a recent backup of the database at https://s3.console.aws.amazon.com/s3/object/chf-hydra-backup/Aspace/aspace-backup.sql

Note that the create_aspace.yml playbook creates a minimal, basically empty aspace database with no actual archival data in it.

To restore from such a backup onto a freshly-created ArchivesSpace server,

copy your backup database to an arbitrary location on the new server
ssh in to the new server
Log into the empty archivesspace database:
- mysql archivesspace --password='the_archivessace_database_password' --user=the_user
Once at the mysql command prompt, load the database:
- mysql> \. /path/to/your/aspace-backup.sql

...

If you have a sciencehistory.orgaddress, you can get access to it by filling out a form.See also https://github.com/sciencehistory/ansible

Versions Compared

Old Version 75

New Version 76

Key

Workflow

OBSOLETE – Technical details about the server

Main users

OBSOLETE – Startup

OBSOLETE –Restarting the server to fix Tomcat memory leak

Export

OBSOLETE – Building the server

OBSOLETE – Backups

OBSOLETE – Restoring from backup

Page Comparison

Versions Compared

Old Version 75

New Version 76

Key

Workflow

OBSOLETE – Technical details about the server

Main users

OBSOLETE – Startup

OBSOLETE –Restarting the server to fix Tomcat memory leak

Export

OBSOLETE – Building the server

OBSOLETE – Backups

OBSOLETE – Restoring from backup