Data Formats in DSP-API
As explained in What Is DSP and DSP-API (previous Knora)?, the DSP stores data in a small number of formats that are suitable for long-term preservation while facilitating data reuse.
The following is a non-exhaustive list of data formats and how their content can be stored and managed by DSP-API:
Original Format | Format in DSP |
---|---|
Text (XML, LaTeX, Microsoft Word, etc.) | Knora resources (RDF) containing Standoff/RDF |
Tabular data, including relational databases | Knora resources |
Data in tree or graph structures | Knora resources |
Images (JPEG, PNG, etc.) | JPEG 2000 files stored by Sipi |
Audio and video files | Audio and video files stored by Sipi (in archival formats to be determined) |
Can be stored by Sipi, but data reuse is improved by extracting the text for storage as Standoff/RDF |
Last update:
2022-06-10