DROC

Aus Kallimachos
Wechseln zu:Navigation, Suche

Deutscher Romankorpus (DROC)

This repository contains a manually annotated corpus for german literary novels. DROC contains 90 fragments of novels with an average length of about 200 sentences and a total length of 390.000 tokens.

DROC contains manually labeled annotations for:

Character References that refer to (usually human) entities appearing in the novel (about 50.000) Coreferences between those references Direct Speech annotations (about 2000) Speaker and Addressees for each direct speech