General outline
- Move over minter
- Fedora Export - see below
- migrate postgres
- Fedora Import - see below
- run (currently nonexistent) verification job
- migrate dump.rdb
- Reindex solr
Spin up machine
Run ansible scripts
Box Build changes
The current build scripts in Ansible have problems with a few changes we need.
Creating Drives
- In the AWS visual interface, go to EC2
- Go to Volumes
- Select Create Volumes
- Make sure the volume is
- General Purpose SSD
- 150 GB
- Availability Zone b
- Create 2 of these
- Once each one is made, select it and under Actions choose Attach Volume. Type the name or id of the machine and attach the volume.
- ssh into the box
- Run sudo fdisk -l
- You should see /dev/vxdg and /dev/xvdh
- If not, check if the volumes are attached
- Create the filesystem for each disk
- sudo mkfs.xfs /dev/xvdg
- sudo mkfs.xfs /dev/xvdh
- Mount each disk
- sudo mount /dev/xvdg /opt/fedora-data
- sudo mount /dev/xvdh /opt/sufia-project/releases/XXXX/tmp
- Edit the fstab file to retain these mounts
- sudo vi /etc/fstab
/dev/xvdg /opt/fedora-data xfs defaults 0 0
- /dev/xvdh /opt/sufia-project/releases/XXXX/tmp xfs defaults 0 0
- Change the owner of the two mount locations
- sudo chown -R tomcat7:tomcat7 /opt/fedora-data
- sudo chown -R hydep:deploy /opt/sufia-project/releases/XXXX/tmp
Deploy Sufia
Restart Solr
- If this is the first time sufia has been deployed, Solr now runs outside of tomcat and needs to be restarted after deployment.
- sudo service solr restart
Ensure apache is off
We don't want anyone doing stuff before we're ready.
Activate maintenance mode on old server
Fedora export
In sufia 6 instance:
- Run audit script
RAILS_ENV=production bundle exec sufia_survey -v
- Run json export
$ RAILS_ENV=production bundle exec sufia_export --models GenericFile=Chf::Export::GenericFileConverter,Collection=Chf::Export::CollectionConverter
- Open up fedora port to the other server so it can grab the binaries
- Change all the 127.0.0.1 URIs to reflect actual host, e.g.
$ find tmp/export -type f -name "*.json" -print0 | xargs -0 sed -i "s/127\.0\.0\.1/staging.hydra.chemheritage.org/g"
- Move the resulting directory full of exported data from tmp/export to the new server's tmp/import (or wherever desired; this can be provided to the import script)
$ cd tmp; tar -czf json_export_201611141510.tgz export
- Then from your own machine:
$ scp staging:/opt/sufia-project/current/tmp/json_export_201611141510.tgz new_box_ip:/opt/sufia-project/current/tmp/.
Fedora import
On sufia 7 instance:
- Mount the /dev/xvdh drive on the tmp directory in sufia (/opt/sufia-project/releases/XXXXX/tmp)
- Change the owner of the tmp directory
- sudo chown hydep:deploy /opt/sufia-project/releases/XXXXX/tmp
- unpack the exported json files
cd opt/sufia-project/current/tmp/tar -xzf json_export_201611141510.tgzmv export import
- configure sufia6_user and sufia6_password in config/application
- run the import
$ RAILS_ENV=production bundle exec sufia_import -d tmp/import --json_mapping Chf::Import::GenericFileTranslator=generic_file_,Sufia::Import::CollectionTranslator=collection_
- You can use the little bash script I wrote to create batches of files if you want. It's at /opt/sufia-project/batch_imports.sh
$ RAILS_ENV=production bundle exec sufia_import -d /opt/sufia-project/import/gf_batch_0 --json_mapping Chf::Import::GenericFileTranslator=generic_file_
Postgres export/Import
On Staging
- Run the following to generate the export.
pg_dump -U postgres chf_hydra -Fp > chf_hydra_dump.sql
On Migration
From your machine run
scp -3 -i /path/to/test.pem ubuntu@staging:~/chf_hydra_dump.sql ubuntu@new_box_ip:~
- Run this command to get into postgres (password for user is stored elsewhere)
psql -U postgres
- Inside Postgres generate the database and required permissions
CREATE DATABASE chf_hydra;
GRANT Create,Connect,Temporary ON DATABASE chf_hydra TO chf_pg_hydra
- Then enter \q to quit
- Finally import the data you copied over with scp
psql _U postgres chf_hydra < chf_hydra_dump.sql
How to check the statefile
There are 3 parts to the state: sequence, counters, and seed. You need the correct combination of all three in order to have the right state. However, if you know you have two valid state files with the same origin you can do a rough comparison of their equivalence just by checking the sequence. To check sequence in our 7.2-based application:
$ cd /opt/sufia-project/current$ bin/rails c production> sf = ActiveFedora::Noid::Minter::File.new> state = sf.read> state[:seq]
To check in our 6.7-based application:
$ cd /opt/sufia-project/current$ bin/rails c production> sm = ActiveFedora::Noid::SynchronizedMinter.new> state = {}> ::File.open(sm.statefile, ::File::RDWR|::File::CREAT, 0644) do |f|> f.flock(::File::LOCK_EX)> state = sm.send(:state_for, f)> end> state[:seq]
To check sequence on a file that's not in the default location, pass the template and the filename when you create the object with 'new', e.g:
> sm = ActiveFedora::Noid::SynchronizedMinter.new(".reeddeeddk", "/var/sufia/minter-state")
Misc.
Postgres
You can get a list of all tables and fields with the command:
SELECT * FROM information_schema.columns WHERE table_schema = 'public'
Cleanup
To clean up a server for a new migration test, take the following steps.
- Stop Tomcat and Solr
- Remove all the folders in /opt/fedora-data.
- Remove all the files in /var/solr/data/collection1/data/index/, and /var/sufia/derivatives
- Remove all the upload files in the tmp directory of the version of sufia used.
- Enter into postgres (psql -U postgres)
- Drop the fcrepo database (DROP DATABASE fcrepo;)
- Build a new fcrepo database (CREATE DATABASE fcrepo;)
- Grant the fcrepo user (currently tribly until we get a better user) all privileges on fcrepo. (grant all privileges on fcrepo to tribly;)
- Restart tomcat and solr