Skip to content

Overview

DSP-API supports various ways of handling textual data:

Text in RDF

Textual data can be included directly in the data stored in DSP-API. This is the default way of handling text in the DSP. There are three ways of representing textual data in DSP-API, two of which are fully supported by DSP-APP and DSP-TOOLS.

Texts stored in RDF can be searched using both full-text search and structured queries.

Simple Text

If a text requires no formatting, it can simply be stored as a string in a knora-base:TextValue. This is sufficient in many cases, especially for shorter texts like names, titles, identifiers, etc.

Text with Formatting

For text requiring regular markup, knora-base:TextValue can be used in combination with the DSP's standard standoff markup.

This allows for the following markup:

  • structural markup
    • paragraphs
    • headings levels 1-6
    • ordered lists
    • unordered lists
    • tables
    • line breaks
    • horizontal rules
    • code blocks
    • block quotes
    • footnotes
  • typographical markup
    • italics
    • bold
    • underline
    • strikethrough
    • subscript
    • superscript
  • semantic markup
    • links
    • DSP internal links

DSP-APP provides a text editor for conveniently editing text with standard standoff markup.

More details can be found here.

Text with Custom Markup

It is possible to create custom XML-to-Schema mappings, which allows for creating project specific custom markup for text values. Details can be found here.

Info

Custom markup is not supported by DSP-TOOLS and is view-only in DSP-APP.
Creating custom markup is relatively involved, so that it should only be used by projects working with complex textual data.

File Based

Text files of various formats (Word, PDF, XML, etc.) can be uploaded to the media file server. For more details, see here

This allows for easy upload and retrieval of the file. However, it does not allow for searching within the file content.

TEI XML

All text values in DSP-API using standoff markup can be converted to TEI XML as described here.

Info

Improved support for TEI XML is in planning.