Difference between revisions of "Dans Data Upload"
Jump to navigation
Jump to search
Line 9: | Line 9: | ||
* copy over the scripts from the repository | * copy over the scripts from the repository | ||
cp -a ~/dans/scripts/* . | cp -a ~/dans/scripts/* . | ||
− | * generate a '''sorted''' list of files | + | * generate a '''sorted''' list of files. '''Note''': All further actions are done based on this list! |
find -L $ARCHIVE -type f | sort > ${ARCHIVE}-files.txt | find -L $ARCHIVE -type f | sort > ${ARCHIVE}-files.txt | ||
+ | * check the list of files , remove any entries such as '.Trash' folders if desired. | ||
+ | * generate a list of tarball.lst files. Each tarball.lst file contains a subset of entries from the ${ARCHIVE}-files.txt file that , when tarred up into a single .tar file, is roughly 8 GB in size. | ||
+ | ./gen-tar-list ${ARCHIVE}-files.txt | ||
+ | The output files are named '$ARCHIVE-<N>.tar.lst' where <N> is a 4 digit counter starting at 1 |
Revision as of 10:38, 10 May 2012
How to upload a DANS archive to the grid
- create a new directory with the name of the archive. As an example we use the 'Crome' archive. We refer to the name of the archive using the environment variable '$ARCHIVE' :
export ARCHIVE=Crome mkdir -p ~/dans/$ARCHIVE/
- In this directory create another directory with the same name; this directory will contain the list of files and directories that need to be uploaded
cd ~/dans/$ARCHIVE mkdir $ARCHIVE
- copy over the scripts from the repository
cp -a ~/dans/scripts/* .
- generate a sorted list of files. Note: All further actions are done based on this list!
find -L $ARCHIVE -type f | sort > ${ARCHIVE}-files.txt
- check the list of files , remove any entries such as '.Trash' folders if desired.
- generate a list of tarball.lst files. Each tarball.lst file contains a subset of entries from the ${ARCHIVE}-files.txt file that , when tarred up into a single .tar file, is roughly 8 GB in size.
./gen-tar-list ${ARCHIVE}-files.txt
The output files are named '$ARCHIVE-<N>.tar.lst' where <N> is a 4 digit counter starting at 1