Datastreams

What are Datastreams?

Datastreams are the files that make up a repository archival information package. All datastreams are generated automatically when an object is added to the repository, Adding, updating or replacing datastreams allows you to manage your objects within CTDA.

In This Guide

 

Accessing and Downloading Datastreams

  • Navigate to your object and click the download button 

  • The icon will expand to display the datastreams associated with your object. Click on any of the links in the menu to download that datastream.

  • See the sections below if you need to manage, add or, replace a datastream.


Managing Datastreams

  • To view the datastreams associated with an object in the repository, navigate to that object's page. Click the "Manage" tab. 

  • Click "Datastreams".

  • After clicking "Datastreams" you will see this screen. 

  1. Datastream ID (DSID) - an internal unique identifier within the scope of the digital object, for example, RELS_EXT, MODS, or TN.

  2. Datastream Label - the human-readable title given to a data stream. It does not need to be unique but it should be informative enough for a user to understand the purpose of the data stream.

  3. Versions - used to view past versions of the datastreams and revert to an early version of a particular DS to correct a mistake.

  4. Operations - the options in this column allow you to manage each datastream.


Add or Replace a Datastream

All Fedora Digital Objects contain datastreams which can be added, replaced or sometimes deleted. Remember that some datastreams are required and cannot be deleted.

Replace a Datastream

  • Go to the object for which you want to replace a datastream. Click “Manage”.

  • Click “Datastreams”.

  • If the datastream already exists, you can select “replace”.

  • If you want to replace the TN, or thumbnail, datastream click on “replace” on the line for that datastream ID “TN”.

  • In the new window, click "Choose File" and find your new thumbnail. Click "Upload", then "Add Contents".

  • This is the same process to replace any datastream you have permissions to.

Add a Datastream

  • Not all content models come with thumbnail datastream. Sometimes, ingests don’t go as planned and you need to add a datastream rather than replace it.

  • Go to the digital asset where you want to add a datastream.

  • Click “Manage”.

  • Click “Datastreams”.

  • Click on “Add a datastream”.

  • In the new window, fill out:
    Datastream ID – the ID is an autocomplete field and datastream IDs are a controlled vocabulary. MODS is always MODS, TN is always TN, OBJ is always OBJ. If you are adding a datastream that is for preservation purposes, you can create a naming convention for that purpose, i.e. AM for Archival Master.
    Datastream Label – The label can be the filename or just left with the default such as MODS Record.

  • Upload your file and click “Add Datastream”

Deleting Datastreams

You have the ability to delete datastreams from your objects in CTDA, but we recommend that you do not.


Regenerate All Datastreams

  • Go to the object for which you want to regenerate all of the datastreams. Click the Manage tab.

  • Click Properties.

  • Click acquire the lock and then click the Confirm button on the next page.

  • After acquiring the lock, click the Regenerate all derivatives button at the bottom of the page.

  • Click the Regenerate button on the next screen to start the derivative regeneration process.

  • A progress bar will appear on the next page.

  • Once completed, you should see a green box with the message Derivatives successfully created.


Datastream List

ID

Label

Content Models

ID

Label

Content Models

RELS-EXT

Fedora Object to Object Relationship Metadata

All

MODS

MODS metadata record, created at time of ingest

All

DC

Dublin Core metadata record, created at time of ingest

All

OBJ

The original file ingested into the repository (File format for this DS will depend on the content model used)

All

TN

Thumbnail image, created at time of ingest

All

TECHMD

Technical Metadata

All

JP2

JPEG 2000 derivative, created at the time of ingest

Large Image

JPG

Medium-sized JPEG, created at the time of ingest and used in the standard image viewer

Large Image

MEDIUM_SIZE

A compressed version of the image, created at the time of ingest and used on the object's View page

Basic Image

PROXY_MP3

MP3 derivative, created at the time of ingest

Audio

PDF

PDF derivative, created at the time of ingest

Book, PDF, Newspaper

RELS-INT

Internal Fedora relationship metadata defining the dimensions of the JP2 datastream

Book Page, Newspaper Page

OCR

Optical Character Recognition file

Book Page, Newspaper Page

HOCR

A converted version of the OCR datastream, intended to be more human-readable

Book Page, Newspaper Page

COLLECTION_POLICY

Specifies which content models are associated with a particular collection

Collection

FULL_TEXT

Text of a PDF

Document, PDF

PREVIEW

Preview image, created at time of ingest

Document, PDF

MP4

MPEG-4 video derivative, created at time of ingest

Video

MKV

Matroska video derivative, created at time of ingest

Video