Skip to content

Metadata enrichment in the TG digital library using information from the GND

The digital library contains links to the GND entities of the authors. The idea is to enrich the TG metadata of the work files by using the information from the GND entities and adding several metadata fields. The specific fields would be:

  1. gender (directly from the GND), in the relations/ns2:RDF/rdf:Description/eltec:authorGender element
  2. Geographical area of the author (directly from the GND). We could use the subject element in the work element in the same way that we write other GND entities. However, in my opinion, this would be a misuse of the element because it implies that the work is about this country, which is not the case. Two other options are to use a new element, such as db:authorGeographicalArea, or to simply leave this information out. Personally, I would prefer the latter option.
  3. Basic classification using the mapping of the geographical area of the author and the information in the K10plus. Multiple values per work are possible and are actually the norm.

I have taken an example (https://textgridlab.org/1.0/tgcrud-public/rest/textgrid:104rg.0/metadata) and make the changes to it: 1159_An_Erich_Bachmann.104rg.0.work.meta_enriched.xml

Here is the table of the annotated data 20250825_works_with_annotations.tsv. This data is not final, we need to discuss some aspect with Susanne

Edited by Jose Calvo Tello