19 Feb 2018 I modified the archives, to split up pj.Location, pj.Subject, pj.PersonOfInterst, pj.Author, pj.Event. These are the 'multivalued' metadata where you can specify values separated by ;. Current processing will split the values and store them individually in archives. Some values were added before the online processing could handle them. SO I have split any existing values. A few I did by hand. The others were done by running the following several times. find . -name "doc.xml" -exec sed -i -r 's%()([^;]+);\s*([^\s].+)()%\1\2\3\4\6\n\1\2\3\5\6%g' {} \; ? 2017 We have changed to using original tif files instead of jpg. 17 Aug 2017 I have made a new plugin, PJPlugin. It processes the item files, and converts all metadata to pj.metadata, (ignoring any that are empty). It also changes the image file extension from .tif to .jpg. Don't need to do any of the old sed stuff. SO I am using the jpg files instead of the tif files. Copying .item, OCR/.txt, jpg/.jpg Have done 3A*, 3B*, 3C*, 3D* Ran import.pl -keepold to add new documents to archives. buildcol.pl -site pei-jones written-works mv building index and re configure message router (or restart server) **** adding the new archives files to svn. Want to add folder and doc.xml, NOT all the images. 1. add the folder without any contents: svn add --depth=empty foldername You need to add all the folders first, eg svn add --depth=empty 3A* 2. Set the svn:ignore property. Existing folders will already have it, but need to set it for the new folders. Run svn propget svn:ignore . (inside archives). will look something like this: >svn propget svn:ignore *.jpeg *.jpg *.JPG *.gif *.png *.tif The run propset: archives>svn propset svn:ignore "*.jpeg *.jpg *.JPG *.gif *.png *.tif" . --recursive Now when you do svn st, should only show the doc.xml files as being unversioned. add them: svn add */doc.xml svn commit -m "adding the new archive files" (NOTE: if you did "svn add foldername" by mistake and got all the images added, can do "svn rm --keep-local foldername" to undo it. ) ************************** Old Notes. Adding extra material from /nzdl-storage/other-projects/pei-jones/JOnes_Collection Am just copying .item files, OCR/.txt files, tif/.tif files. (not jpg/.jpg files, or OCR/.html files) Have done 3B*, 3C*, 3D* Copied into import. Ran sed to convert metadata to pj.metadata. find . -name "*.item" -exec sed -i 's//<pj.Title>/g' {} \; find . -name "*.item" -exec sed -i 's/<Provenance>/<pj.Provenance>/g' {} \; find . -name "*.item" -exec sed -i 's/<Year>/<pj.Year>/g' {} \ find . -name "*.item" -exec sed -i 's/<Location>/<pj.Location>/g' {} \; find . -name "*.item" -exec sed -i 's/<Media>/<pj.Document-Type>/g' {} \; find . -name "*.item" -exec sed -i 's/<Notes>/<pj.Notes>/g' {} \; Ran import.pl -keepold to add new documents to archives. buildcol.pl -site pei-jones written-works mv building index and re configure message router (or restart server) NOTE!! pj.metadata from item file gets added as ex.pj.metadata into archives. For the docs mentioned above, I have run sed on the archives to remove the ex. But we need to modify the plugin process to not add the metadata as ex.pj. Actually, maybe the new plugin can take eg Location from the item file and store it as pj.Location directly.