Bulk ingestion is a form of ingestion in which a specially crafted zip file is provided. This file contains metadata needed by the ingestion process, as well as file manifests that enable the system to confirm uncorrupted receipt of the correct information.
This zip file is in the BagIt format. The Library of Congress has built and shared the Bagger utility for constructing these files. BagIt zips contain the following contents:
Name | Purpose |
---|---|
bagit.txt | Declaration file |
bag-info.txt | Metadata file (unused by Packrat) |
manifest-sha1.txt | Data file manifest, with hashes for each data file in the SHA1 format |
tagmanifest-sha1.txt | Tag file manifest, with hashes for each metadata file in the SHA1 format |
data/* | Data files, stored in the data subfolder, potentially with additional file path elements |
capture_data_photo.csv | Optional photogrammetry Capture Data metadata |
models.csv | Optional Model metadata |
scenes.csv | Optional Scene metadata |
One of capture_data_photo.csv, models.csv, or scenes.csv must be supplied. These metadata files contain common data as well as object-specific metadata needed for ingestion. Some of these fields make use of Controlled Vocabularies:
Metadata Usage | Column Header | Meaning |
---|---|---|
All | subject_guid | Subject Identifier |
All | subject_name | Subject Name |
All | unit_guid | Unit Identifier |
All | unit_name | Unit Name |
All | item_guid | Media Group Identifier |
All | item_name | Media Group Name |
All | item_subtitle | Media Group Subtitle |
All | entire_subject | Is this Media Group the Entire Subject or Just a Portion? |
All | name | Name of Object |
Capture Data | date_captured | Date of Capture |
Capture Data | description | Description of Capture |
Capture Data | capture_dataset_type | Dataset Type (Controlled Vocabulary) |
Capture Data | capture_dataset_field_id | Field ID for Capture Data dataset |
Capture Data | item_position_type | Position Type (Controlled Vocabulary) |
Capture Data | item_position_field_id | Field ID for Position |
Capture Data | item_arrangement_field_id | Field ID for Arrangement |
Capture Data | focus_type | Focus Type (Controlled Vocabulary) |
Capture Data | light_source_type | Light Source Type (Controlled Vocabulary) |
Capture Data | background_removal_method | Background Removal Method (Controlled Vocabulary) |
Capture Data | cluster_type_type | Cluster Type (Controlled Vocabulary) |
Capture Data | cluster_geometry_field_id | Field ID for Cluster Geometry |
Models | model_subtitle | Model Subtitle |
Models | date_created | Creation Date |
Models | creation_method | Creation Method (Controlled Vocabulary) |
Models | modality | Model Modality (Controlled Vocabulary) |
Models | units | Model Units (Controlled Vocabulary) |
Models | purpose | Model Purpose (Controlled Vocabulary) |
Scenes | scene_subtitle | Scene Subtitle |
Scenes | posed_and_qcd | Set if the Scene has been Posed and QC’d |
Scenes | approved_for_publication | Set if the Scene has been Approved for Publication |
All | directory_path | Relative Path to Data File(s) Starting from the Data Folder |