DROC

Aus Kallimachos
Version vom 16. Mai 2017, 12:47 Uhr von DeletedUser (Diskussion | Beiträge) (Die Seite wurde neu angelegt: „<div class="notab"> ==Deutscher Romankorpus (DROC)== This repository contains a manually annotated corpus for german literary novels. DROC contains 90 fragmen…“)
(Unterschied) ← Nächstältere Version | Aktuelle Version (Unterschied) | Nächstjüngere Version → (Unterschied)
Wechseln zu:Navigation, Suche

Deutscher Romankorpus (DROC)

This repository contains a manually annotated corpus for german literary novels. DROC contains 90 fragments of novels with an average length of about 200 sentences and a total length of 390.000 tokens.

DROC contains manually labeled annotations for:


Character References that refer to (usually human) entities appearing in the novel (about 50.000) Coreferences between those references Direct Speech annotations (about 2000) Speaker and Addressees for each direct speech