French-German Meeting
on Copyrighted Works in Digital Libraries
Trier Center for Digital Humanities
Trier University, Germany
2023-12-14
URL: textplus.org
See: Rehm et al. (2007).
See: Haber (2012).
See: Jett et al. (2020)
The basic idea behind derived text formats is essentially the following: It is based on collections of copyright-protected full texts […] to which an institution has legal access. These text collections are transformed into so-called derived text formats through the application of processing routines, which essentially represent both targeted information enrichment (for example through linguistic annotation) and information reduction (for example through the deletion of word forms or the removal of sequence information).
The derived text formats are designed in such a way that the texts in the form then available no longer fall within the scope of copyright on the one hand, but on the other hand still allow the application of the most diverse quantitative analyses of the texts possible. […] Such datasets can be stored without restrictions, used in research, published and reused by third parties. In addition to reuse and publication, the creation of derived text formats is also possible without permission, provided that there is legal access to the original material.
Source: Schöch et al. (2020); see also: Grisse (2020), Jotzo (2020).