Required for this procedure:
...
- Open the Excel template ETDtempDspace.xlsm.
- Run the macro "Delete_Everything" by using CTRL-X. This will delete the content of sheet 1 and any existing XML map. If there is not an existing XML map, it will make an error which can be ignore.
- Delete the sheet2 that has old content in it. Create a new worksheet and rename it sheet2 if not named that already.
- Return to sheet1, cell 1A. Use developer import to import the file you created using Editix.
- Press ctr-r to run the reformatter macro.
- Go to the sheet2.
- Separate the keywords by changing commas in the keyword field to || where appropriate.
- Ensure that there are no spaces in file names. If there are, you'll need to change them in the spreadsheet, and also change the actual file name to match it.
Sort by the collection field with the departments in it. Check for any department that didn't fill in and any that aren't the correct department names, using the .csv in the Collection File Program to find the definitive versions of the departments name to use. Fix with find and replace, both in the department field and dc.relation.ispartof field with departmental collections names. Add these find and replaces to the macro so that they don't have to be done manually each time.
- Change the dates to the year-month-date format (this is a UK format).
Prepare the files:
Check for anything unexpectedly left over. ETD's with extra files will unzip into folders or into additional zip files. These will require some initial manipulation to prepare them. Take a look at the extra files and handle each case as appropriate as follows:
...
For other problems with the files Proquest FTP's to us, ask Michelle to call Proquest technical support at 877-408-5027 or 800-889-3358 (or email at tsupport@proquest.com or
http://support.proquest.com/ ) to find a solution.
Adding Supplements to the metadata in Excel and Moving them to the PDF Directory
...
Make sure all rights fields have the header dcterms.accessRights .
Save your sheet2 (you must be on it) as a .csv file. While on the "save as" screen, change the character encoding to UTF8 by using the tools drop-down, selecting web options, then encoding, and UTF8.
- In the CSV file, make sure that dates are in the YYYY-MM-DD format. Find and replace to fix if necessary.
- Be sure to close Excel or the next steps won't work.
...
- Move the entire SimpleArchiveFormat directory into your CollectionFilesProgram directory.
- Go the dos prompt: in the start menu, click "Run," then type cmd in the box, and click ok.
- Change the directory to the CollectionFilesProgram directory by typing cd c:\ETD\CollectionFilesProgram.
- Run the program by typing "python safscript.py"
- Look at the log, saf_log.txt for any items skipped. If items have been skipped, they included an unknown program. Notify Michelle, and she'll manually create collection files for this set, and modify programming to include the unknown value in future sets.
...
Email the zipped files to DSS at CP:
Joseph Andrew Koivisto <jkoivist@umd.edu>