Newspaper Spreadsheet Ingest Workflow

This guide offers a workflow for ingesting and migrating newspaper content into the CTDA. As with many things in the system, there are multiple ways that you can approach adding newspaper issues to the repository. We have found that the workflow set out below works best for ingesting newspaper content using the spreadsheet ingest module. 

The steps below outline how to configure three separate spreadsheets for ingest. In theory, all of this work can be done in one spreadsheet, and you may get to a point where you can do just that. But to start, we have found that working with each type of content individually (newspaper title objects, newspaper issue objects, and newspaper page objects) in its own spreadsheet causes less confusion.

You can work on all three spreadsheets at once, and then ingest them one after another on the same day, or you can work on one at a time, and ingest them days or weeks apart. How fast or slowly you work on newspapers is up to you and your schedule. But, you have to follow the order set out below (we don't like to use the word must at all, but for this process we must):

  • You must ingest a newspaper object in order to have newspaper issues display as desired in the repository.

  • You must ingest newspaper issues before newspaper pages if you want the pages to display as desired.

While doing these steps out of order will not be a complete disaster, it is difficult to get everything to appear as desired, and offers the potential for more errors in the process.


Steps needed to ingest newspapers using spreadsheet ingest


1. Ingest Newspaper Objects using the Newspaper Content Model (islandora:newspaperCModel)

Go to the Newspaper Object 2.0 Template, and make a copy to your own Google Drive

Fill out the template with the metadata for the newspaper object.

You will need fill out the following columns with data in order to ingest newspapers via spreadsheet ingest correctly:



Column Title

How to fill out the columns



Column Title

How to fill out the columns

1a

Label

Title of the newspaper object

1b

Title

Title of the newspaper object

1c

Heldby

Select your institution from the dropdown

1d

Rights

Select a rights status of the newspaper

1e

PID

Create a PID for your newspaper. PIDs begin with your instituional namespace followed by a colon and then any alphanumeric identifier you determine.

1f

parent_object

Enter the PID of the collection object the newspaper should appear under

1g

parent_predicate

Select isMemberOfCollection

1h

cmodel

Select islandora:newspaperCModel

After completing the template, follow the ingest instructions to ingest your newspaper objects.


2. Ingest Newspaper Issues using the Newspaper Issue Content Model (islandora:newspaperIssueCModel)

Go to the Newspaper Issue 2.0 Template, and make a copy to your own Google Drive

Fill out the template with the metadata for the newspaper object.

You will need fill out the following columns with data in order to ingest newspapers via spreadsheet ingest correctly:



Column Title

How to fill out the columns



Column Title

How to fill out the columns

2a

Label

Title of the newspaper issue

2b

Title

Title of the newspaper issue

2c

Heldby

Select your institution from the dropdown

2d

Rights

Select a rights status of the newspaper

2e

PID

Create a PID for your newspaper issue. PIDs begin with your instituional namespace followed by a colon and then any alphanumeric identifier you determine.

2f

parent_object

Enter the PID of the newspaper object to which the issue belongs.  You should use a PID listed in your newspaper object spreadsheet (look at 1e in the section above).

2g

parent_predicate

Select isMemberOf

2h

cmodel

Select islandora:newspaperIssueCModel

2i

isSequenceNumber

Enter the sequence number of each issue as it should appear in each newspaper object (i.e. issue 1 should be 1, issue 2 should be 2, ect.)

2j

dateIssued

Enter the date the newspaper issue was produced in the YYYY-MM-DD format.

After completing the template, follow the ingest instructions to ingest your newspaper issues.


3. Ingest Newspaper Pages using the Newspaper Page Content Model (islandora:newspaperPageCModel)

Go to the Paged Content 2.0 Template, and make a copy to your own Google Drive

Fill out the template with the metadata for the newspaper pages.

You will need fill out the following columns with data in order to ingest newspapers via spreadsheet ingest correctly:



Column Title

How to fill out the columns



Column Title

How to fill out the columns

3a

Label

Title of the newspaper page

3b

Title

Title of the newspaper page

3c

Heldby

Select your institution from the dropdown

3d

Rights

Select a rights status of the page

3e

parent_object

Enter the PID of the newspaper issue to which page belongs.  You should use a PID listed in your newspaper issue spreadsheet (look at 2e in the section above).

3f

isPageOf

Enter the PID of the newspaper issue to which each page belongs. The PID in this column should match the PID entered in the parent_object column (see 3e).

3g

parent_predicate

Select isMemberOf

3h

cmodel

Select islandora:newspaperIssueCModel

3i

isPageNumber

Enter the page number of each page as it should appear in each newspaper issue.

3j

isSequenceNumber

Enter the sequence number of each page as it should appear in each newspaper issue. This number should match the number entered in the isPageNumber column (see 3i above)

After completing the template, follow the ingest instructions to ingest your newspaper pages.