Skip to Main Content

E-book & Streaming Media Management

This guide describes E-book and streaming media collection management resources and tips.

Update URLs

Due to changes of vendors, the links of e-resources may change as well. Some changes like server and domain change may make the old URLs not accessible. However, Alma does not have a URL validation job. While it has a export URLs job, the URLs exported may not be the portfolio link. Although vendor may provide MARC records with updated URLs, when import existing records to Alma, only bib records will be updated, not the portfolio information. For records with PO line associated or local fields and course reserve titles, deleting and re-importing should not be performed  just to update the portfolio URLs. 

The best way we have found is to check URLs collection by collection. This makes sense to us as one collection is usually come from the same vendor and the URL address should have the same "prefix." We use MARCEDIT and OpenRefine to extract only 020 and 856 fields from the vendor provided MARC record. Then, match the link in the MARC with the URLs in the collection portfolio list using ISBN. Then update the portfolios using porfolio loader. 

1. Extract ISBN + URL

To extract 020 and 856 fields

  • Open the vendor provided MARC records in MarcEdit. Select File > Export for > OpenRefine, it should generate a JSON file

  • Open the exported JSON file in OpenRefine. Check the following boxes
    • Preserve empty strings
    • Trim leading & trailing whitespace from strings
  • Create a project. Click the down angle of the tag column > choose Facet > Text facet. Find the rows of 020 and 856, click include on the right side of the two rows. Export CSV file.

2. Get Portfolio List

Export portfolio list in Alma and sort the URL column alphabetically. Find the rows with old URLs. Copy ISBN values of those rows to a new TXT file with "ISBN" header.

3. Get New URLs

Put isbn TXT file and OpenRefine exported CSV file in the same directory as this Python script: checklink.py. The script takes those two files and generate a new TXT with updated URLs.

To run this script, run the following line in the command: python checklink.py <name of the txt file> <name of the csv file> <name of the output new URL txt>.

Paste those new URLs back to the portfolio list generated from ALMA. Please be noted that ALMA generated portfolio list may contain errors: "." after ISBN or note after ISBN. ISBN should be numbers. Please correct the errors in the portfolio list before load new portfolio list.