PROJECT DESCRIPTION

The project of compiling the Collocations Dictionary of Modern Slovene is jointly conducted by the Centre of Language Resources and Technologies at the University of Ljubljana, Centre of Applied Linguistics at Trojina Institute, and part of the research team of the “Slovene Language – Basic, Contrastive, and Applied Studies” programme at the Faculty of Arts, University of Ljubljana. It also represents part of the data to be used for the future description of Slovene lexis in the Dictionary of Modern Slovene.

Collocations are the most frequent and typical combinations of words in real language as is used by native speakers. The dictionary is intended for professional writers, such as translators and journalists, authors and learners of Slovene as a second or foreign language, and all users who are interested in natural and typical language use and behaviour of words in terms of their immediate context.

In the first phase, the Collocations Dictionary Database was produced which contains automatically extracted data from the Gigafida text corpus. Special functionalities of the Sketch Engine tool adapted for Slovene were used to identify collocations. Automatically extracted data present the first phase in the process of compiling a comprehensive dictionary of Slovene collocations. The final version will contain manually checked collocation data that will be arranged by frequency and relevance, and described with semantic indicators and other relevant types of information.