<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE entry
  PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<entry>
    <url>http://library.syr.edu/find/scrc/collections/manuscript/index.php</url>
    <institution>Syracuse University Special Collections Research Center</institution>
    <updated>2007-04-17</updated>
    <desc>
      <p>In 2004 Syracuse University had approximately 12,000 pages of finding aids 
	  covering 800+ collections, most existing only in hard
        copy.  Immediate mass conversion of all finding aids was not an option, so we began by selectively 
		identifying finding aids for large/in-demand collections or sets of
        related collections and converting them; over time we moved on to lesser-known collections, and to 
		finding aids that were suspect, complicated or insufficient (e.g., those requiring incorporaton of annotations, 
		merging of two or more separate accessions, or other manual
        intervention as part of the conversion process).</p>
      <p>We began with a test set of 20 finding aids and hammered out many of the bugs both with
        encoding and XSLT conversion using that data set, though we continue to occasionally
        encounter oddities. This gave us two very useful results: the first was detailed, documented
        tagging specifications, most of which are embodied in the second, a skeleton EAD template
        that is pre-loaded with much of the standard coding such as institution name, <font face="courier">encodinganalog</font> attributes, etc.</p>
	  <p>The last step was to create collection-level EAD records for collections that had no
	  finding aid whatsoever.  We decided that something was better than nothing, and that we would 
	  go ahead and place on-line ALL finding aids, whether the collection was processed or not.  Finding 
	  aids for unprocessed collections included an access restriction note, "This collection is unprocessed 
	  and accessible by special permission only.  Please contact the repository listed above for more 
	  information."</p>
	<p>As of December 2010, with the assistance of numerous student workers and interns eager to 
	  get hands-on EAD experience, we have collection-level or better finding aids for every one of our 2000+ 
	  collections, large or small, processed or unprocessed.  A side benefit of this undertaking   
	  is that we now have a far more accurate and detailed sense of what we have, how much we have, 
	  and the processing status of each collection.</p>
    </desc>
    <delivery>
      <p>We had many discussions over whether to deliver XML with a style sheet, convert XML to HTML
        on the fly, or provide static HTML. (We also explored the capabilities of ContentDM and EAD,
        with somewhat disappointing results; Archivists' Toolkit and Archon were not available when we 
		embarked on this effort).  In the end, for a variety of reasons, we chose to
        maintain EAD as our source document and for search purposes, but to generate and deliver static HTML.</p>
      <p>Using <a href="http://saxon.sourceforge.net/">saxon</a> in conjunction with an XSLT style
        sheet (originally downloaded from the <a href="http://www.archivists.org/saagroups/ead/ead2002cookbook.html">EAD cookbook</a> but
        heavily revised in-house), we produce from the EAD both a full HTML version and a
        stripped-down printer-friendly HTML version. Our home page provides links to the HTML files,
        offering <a href="http://library.syr.edu/find/scrc/collections/manuscript/index.php">two browse options, two 
		search options, and a listing by subject area</a>.  The search is done using 
	    SWISH-E (<a href="http://www.swish-e.org/">http://www.swish-e.org/</a>), a free open-source XML indexing tool, for the 
		EAD indexing in conjunction with HTML forms and php scripting.</p>
		<p>While this search has proven surprisingly robust and effective, there are some limitations such as problems with 
		diacritics and the effort required to add features such as highlighting of search terms in context.  
		We are currently exploring the idea of moving our finding aids to an <a href="http://xtf.cdlib.org">XTF platform</a>.</p>
		<p>Included with our EAD process is the creation or update of MARC records in our OPAC; thus, as of December 2010, all 2000+ 
	  of our collections also had MARC records so that researchers who begin with the catalog have a good chance of discovering our 
	  material.  We use <a href="http://people.oregonstate.edu/~reeset/marcedit/html">marcedit</a> to produce
        MARC records from the EAD, which are then imported into our OPAC. We have had excellent
        results with this, though a small amount of manual editing (2-10 minutes) is still required for many
        files, particularly in adding/correcting subfields. This gives us a fully detailed MARC record complete with subject headings. 
		The <a href="http://www.oclc.org/bibformats/en/8xx/856.shtm">856 field</a> is populated with a link to the finding aid, 
		so our collections and inventories are easily and immediately accessible from the OPAC.</p>
      <p>In addition, our finding aids are
        harvested, indexed, and searchable via RLG's <a href="http://www.archivegrid.org">ArchiveGrid</a>. 
		Within the next twelve months we hope to have full search capability on
        our own site as well.</p>
    </delivery>
    <encoding>
      <p><u>Legacy finding aids (electronic)</u>: If a MARC record existed for the collection, a
        skeleton EAD template was generated from it using <a href="http://people.oregonstate.edu/~reeset/marcedit/html">marcedit</a>, a free
        product developed by Terry Reese at Oregon State; if not, we used XMetaL to create a skeleton
        EAD beginning with the pre-filled template and adding the necessary minimum information, including
        appropriate controlaccess elements.  Jason Casden's excellent 
		<a href="http://www.archivists.org/saagroups/ead/tools.html">tri-XMLdate-normalizer.pl script</a> saved us lots of time in 
		inserting the normal attribute in date elements.  Tagging of the inventory sections was done through a combination of
        search/replace in Word, strategic use of Excel, cut-and-paste, etc. 
        </p>
      <p><u>Legacy finding aids (paper)</u>: Text was OCR'd in-house, then the same process followed as
        for electronic.</p>
      <p><u>Outsourcing</u>: To jumpstart our effort we contracted out approximately 1500pp to a local
        company, <a href="http://www.amconresearch.com">Amcon Research</a>, for OCR and EAD encoding to
        our specs, with very good results.</p>
		<p><u>New finding aids</u>:  Created directly in EAD beginning with our template, using 
		<a href="http://www.oxygenxml.com/">oXygen</a> or <a href="http://na.justsystems.com/index.php">XMetaL</a>.  
		We also had one tech-savvy student who used the 
		<a href="http://www.archivists.org/saagroups/ead/tools.html">EAD Cookbook Tools for NoteTab Pro</a>.
		Any new collection immediately receives a collection-level EAD file and we use MarcEdit to generate a 
		MARC record for it.  Inventory sections are added as and when the collection is 
		processed.  Adhering to "more product, less process" means that if the collection is reasonably well 
		organized (i.e. foldered and labeled) we do a simple box list and encode that as the inventory section, 
		with a note that the collection has received minimal processing and that the inventory is a box list only.</p>
	<p><u>Entities</u>: We use entities for a few items such as the name of the institution, the
        name of the library, the link to our OPAC, the link to our subject listing of collections,
        and a few other items, so when/if those change we do not need to revise the EAD files.</p>
	  <p><u>All finding aids</u>: XMetaL or Oxygen validates the document against the EAD 2002 DTD, and we run each 
	  file through RLG's EAD Report Card to ensure compliance with RLG Best Practices.  We do a final visual 
	  quality check on the HTML output.  Where applicable, we identify related collections and use <font face="courier">extref</font>
        elements within <font face="courier">relatedmaterial</font> to add links between them.  For large inventories, we 
		regularly use Jason Casden's  
		<a href="http://www.archivists.org/saagroups/ead/tools.html">tri-XMLdate-normalizer.pl script</a> to insert the 
		<font face="courier">@normal</font> attribute for <font face="courier">unitdate</font> elements.</p>
    </encoding>
    <contact>Michele Combs, <a href="mailto:mrrothen@syr.edu">mrrothen@syr.edu</a>, Syracuse
      University, Syracuse NY.</contact>
    <rlg>Yes</rlg>
  </entry>
