Tools & helper files Columbia University, Rare Book and Manuscript Library


URL: http://www.columbia.edu/cu/libraries/indiv/rare/guides/

Encoding Procedure:  

Direct input by processor into a database or creation of a Word document. We do some OCR.  If we start with a Word document we import the data into the database by supplying the appropriate number of tabs (fourteen at this point). Much of the insertion of the tab stops is accomplished using Find/Replace; however, we do need to watch what we are doing.

Once the data is in the database we can further edit using the editing features of the data base.  The database is configured so that all fields are optional and the output will still parse.  The database we use is Pro Cite, and the markup is an output style applied to each entry in the database.  We also use the Berkeley Template to encode all of the higher level information.  The template gives us a text file into which we insert the text file output from Pro Cite.

We validate the document with ParserPlus, the Windows-based version of Jim Clark's SP produced by CSW Informatics.

Delivery Mechanism:  

We deliver the documents in native SGML and HTML.  Both are hard coded. We deliver the EAD via Panorama Free and RLG's Archival Resources project.

Contact:  

Patrick T. Lawlor, Curator The Herbert H. Lehman Suite and Papers, Columbia University lawlor@columbia.edu

RLG Member:  

Yes

Last updated:  Date unknown

Update information:
If any information concerning the above EAD implementation is incorrect or out of date download the XML source file for this entry, make required changes and mail back to levjen@umd.edu. Updated entries may only be submitted by the contact listed above.