Backups and Recovery

Here is the current backup strategy as a diagram:

Recovery Options

Fedora:

S3:

Currently we are using the s3 sync tool (akin to rsync for S3) to pull over key fedora data into the chf-hydra-backup bucket. This is a slight misnomer as it handles backups for ArchivesSpace as well now, but Fedora data is pulled over into

FedoraBackup (contains all Fedora binary data)

PGSql (contains the Fedora Postgres database fcrepo_backup)

Both sets are needed to do a full restore.

Note: As a reminder while S3's visual interface uses folders, those locations are actually just the first step in a path of individual block stored objects, folders do not exist in S3.

Stop Tomcat
Download the Postgres database fcrepo_backup.sql to the Fedora machine.
Fedora might still have active connections to postgres, a restart sudo service postgresql restart will kill them.
Import the database psql fcrepo < fcrepo_backup.sql
1. If the database already exists, such as a sync, you will want to drop the existing database and then run the command.
Check that the user trilby has permissions to access and use the newly made fcrepo database.
Delete the existing folder(s) inside /opt/fedora-data (This step is not always required but makes it simpler)
Using screen or tmux start an aws s3 sync to copy all the data over in the FedoraBackup "folder" to /opt/fedora-data
Wait a while for all the data (>800 GB) to copy over.
Run chown -R tomcat7:tomcat7 /opt/fedora-data to give ownership on the new files to the tomcat user so Fedora can access them.
Restart Tomcat
This will restore the Fedora database. Current cost estimates (2/18) are about $.10 to do this restore.

Users:

Go to S3 and download the postgres backup files.
Stop Apache
Restart the postgres service, this should remove the default connection to the Hydra database that Hydra has when running so you can change it.
In Postgres delete the automatically generated chf_hydra database
1. Log in via psql -u postgres
  1. The postgres account password is in ansible-vault (groupvars/all)
2. Run: DROP DATABASE chf_hydra;
3. Run: CREATE DATABASE chf_hydra;
Then import the downloaded database
1. Either:
  1. pg_restore -d chf_hydra -U postgres chf_hydra.dump
  2. psql chf_hydra < chf_hydra_dump.sql
Then set permissions
1. psql -U postgres
2. GRANT Create,Connect,Temporary ON DATABASE chf_hydra TO chf_pg_hydra;
You may now restart postgres and Apache2

Minter:

The minter is now part of postgres, just restore the chf_hydra database for the users to app and the minter will be restored.

Redis

Redis keeps a database in memory which handles the transaction record data such as the history of edits on a record. It does not contain the actual data, simply the timeline of changes. Losing this causes the history of object edits to be lost, but the objects themselves will be fine.

Copy over redis-dump/dump.rdb to App
It must be changed to be owned by redis
1. sudo chmod -R redis:redis filename
Then you will need to stop the redis server
1. sudo service redis-server stop
Move redis-dump.rdb to /var/lib/redis/dump.rdb overwriting the existing file there called dump.rdb
Restart redis
1. sudo service redis-server start
When starting up redis will read the .rdb dump file and copy that data back to the in memory database.

Indexing:

The index is being backed up to speed up the time to recovery for DR or migrations. If you cannot access it, a manual reindex can be done with the instructions in Application administration. This process takes at least one business day, so is not recommended versus rebuilding from the backup.

In the chf-hydra-backup, pull down the solr-backup.tar.gz file under Solr to the Solr server.
Extract the archive
Use the solr restore commands at Application administration

Last steps:

Costs

A quick cost analysis has restoration costing $30-35 dollars, this is as of 6/11/2018 with approximately 1 TB of data. Approximately 66% of the cost was due to inter-region transfer fees (moving data from US-WEST to US-EAST). The rest is standard LIST, GET, and related fees.