Corpus of copy-edited texts
Lektor is an extensive collection of copyrighted texts and translations and is intended for anyone who is interested in the process of copyediting. This type of corpus enables us to see the most frequent language errors in Slovene (excluding prefferential and stylistic corrections). It includes modern non-literary, mostly technical and popualar-science texts which were all written by different authors and corrected by different copyeditors. It contains 30,258 copyedits which are divided into 5 main categories (style, morphology, ortography, syntax and pragmatics) and 50 subcategories.