Trendi corpus

The Trendi Monitor corpus draws text from online media portals and is updated monthly

The Trendi monitor corpus contains text from more than 100 media websites and is the first monitor corpus for Slovene. The corpus is updated monthly and contains texts from 2019 to the present.

Its main purpose is to provide insight into current language use – to monitor the use of words and phrases, the emergence of new words and meanings.

The Trends Corpus exceeded one billion words at the beginning of 2025. Every month the corpus grows by about 15-20 million words.

LINKS AND CONTACT

dr. Iztok Kosem

Faculty of Computer and Information Science

Večna pot 113

1000 Ljubljana

  • E-mail: iztok.kosem@fri.uni-lj.si