Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
  1. The updating process begins when the Metadata/Copy Cataloging Unit Team Lead sends a list of terms (usually as a Word document) newly added to the local Historical Images thesaurus spreadsheet.
     
  2. Start OpenRefine and navigate to Create Project > Clipboard. Paste in the new terms. Click Next.



  3. Use the below parameters and click Create Project.



  4. Microsoft Word routinely changes double-dashes (–) into the longer emdash (—). These need to be replaced with the correct double-dashes.

    This change can be accomplished by navigating to Edit Cells > Transform:
     

     

    In the Transform window, using the following expression:  value.replace('—','--')


     
    Click okay. All emdashes should now be replaced.

  5. Return to Edit Cells to trim the whitespace in the new terms:



     

  6. Navigate to Reconcile > Start Reconciling:

     
     
    If you haven't already, add the LC Reconciliation Service. In the Reconciliation window, click on Add Standard Service and enter the following URL: http://lc-reconcile.cmh2166.webfactional.com/



  7. Click on "LoC Reconciliation Service", make sure LoC is selected as the Type to match against, and click Start Reconciling.



  8. Wait until the Reconciliation process is finished running.


     
  9. The Reconciliation results will show up in a facet on the left hand side of the screen. Click on the Matched headings to see only those in the main window.

     

  10. Navigate to Edit Column > Add Column Based On This Column, and create a new column that extracts the LC URI from the Reconciliation results. Use this Expression: cell.recon.match.id

     

  11. Once the URIs are added in a new column, close the Judgement facets from the Reconciliation process and clear the Reconciliation data. Navigate to: Reconcile > Actions > Clear Reconciliation Data.

     

  12. Export your reconciled data to a new TSV file using the Export button at the top right hand part of the screen:



  13. Open the working spreadsheet that generates RDF entities – you will find it on the Projects server at: \\Projects\AuthorityWork\RDF\HI\HI-Thesaurus-Updated.xlsx

  14.  Load your new terms and URIs from the TSV file into the spreadsheet. Place the new term(s) in column A and the matched URI into column D.

  15. The FORMULAS tab contains pre-written Excel FORMULAS to help you generate the RDF elements:
    1. Increment the URI number (for colum B),
    2. Generate URI (column C), and 
    3. Generate RDF elements, with or without LC URIs from column D.

      Simply insert the correct starting row for each. Always be sure that any formulas are turned into VALUES once generated.
       
  16. Once the RDF entities in column G are generated and stored as VALUES, copy the new terms and paste them in the RDF file can be found at \\Projects\AuthorityWork\RDF\HI\HI.rdf

    Paste the new terms just before the STATUS declaration at the end of the file. The pasted-in entities will look different than the ones that already exist in the RDF file:



    To fix this, click on the Format and Indent button in the toolbar:



  17. Your code should be indented and beautiful. Save the RDF file.

  18. Finish by emailing the Metadata/Copy Cataloging Unit Team Lead to say the RDF file is updated.