Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
  1. The updating process begins when the Metadata/Copy Cataloging Unit Team Lead sends a list of terms (usually as a Word document) newly added to the local Historical Images thesaurus spreadsheet.
     
  2. Start OpenRefine and navigate to Create Project > Clipboard. Paste in the new terms. Click Next.



  3. Use the below parameters and click Create Project.



  4. Microsoft Word routinely changes double-dashes (–) into the longer emdash (—). These need to be replaced with the correct double-dashes.

    This change can be accomplished by navigating to Edit Cells > Transform:
     

     

    In the Transform window, using the following expression:  value.replace('—','--')


     
    Click okay. All emdashes should now be replaced.

  5. Return to Edit Cells to trim the whitespace in the new terms:


     

    Navigate to Reconcile > Start Reconciling
  6. :
    Image Removed 
     
    If you haven't already, add the LC Reconciliation Service. In the Reconciliation window, click on Add Standard Service and enter the following URL: http://lc-reconcile.cmh2166.webfactional.com/
    Image Removed
    Click on "LoC Reconciliation Service", make sure LoC is selected as the Type to match against, and click Start Reconciling.
    Image Removed
    Wait until the Reconciliation process is finished running.
    Image Removed
     The Reconciliation results will show up in a facet on the left hand side of the screen. Click on the Matched headings to see only those in the main window.
    Image Removed 
    Navigate to Edit Column > Add Column Based On This Column, and create a new column that extracts the LC URI from the Reconciliation results. Use this Expression: cell.recon.match.id
    Image Removed 
    Once the URIs are added in a new column, close the Judgement facets from the Reconciliation process and clear the Reconciliation data. Navigate to: Reconcile > Actions > Clear Reconciliation Data.
    Image Removed 
    Export your reconciled data to a new TSV file using the Export button at the top right hand part of the screen:



  7. Open the working Excel spreadsheet that generates RDF entities – you will find it on the Projects server at: \\Projects\AuthorityWork\RDF\HI\HI-Thesaurus-Updated.xlsx

  8.  Load your new terms and URIs from the TSV file into the spreadsheet . Place the new term(s) in column A and the matched URI into column D.

  9. The FORMULAS tab contains pre-written Excel FORMULAS to help you generate the RDF elements.

    NOTE: this is not real-world linked data. The formulas concoct bunk URIs which are necessary for the RDF to function.

    1. Increment the URI number (for colum column B),
    2. Generate URI (column C), and Generate RDF elements, with or without LC URIs from column D.
      Simply insert the correct starting row for each. Always be sure that any formulas are turned into VALUES once generated.
       
    3. Compile SKOS (column G) – generates RDF XML elements from columns A & C.
       
  10. Next, you will add the SKOS RDF XML elements generated in Column G. To start, open the the RDF file  – which can Once the RDF entities in column G are generated and stored as VALUES, copy the new terms and paste them in the RDF file can be found at \\Projects\AuthorityWork\RDF\HI\HI.rdf
    Paste  – using Oxygen XML editor.

    Copy the new terms and paste them in the RDF file just before the STATUS declaration at the end of the file. The pasted-in entities will look different than the ones that already exist in the RDF file:



    To fix this, click on the Format and Indent button in the toolbar:



  11. Your code should be indented and beautiful. Save the RDF file.

  12. Finish by emailing the Metadata/Copy Cataloging Unit Team Lead to say the RDF file is updated.