TOOLS AND RESOURCES

Digital dictionaries are designed with the characteristics of digital media and their users in mind.
A lexicon includes information about words, eg. parts of speech, inflections and accents.

Slovenian-Hungarian dictionary 2.0

Comprehensive Slovenian-Hungarian dictionary

The Comprehensive Slovenian-Hungarian dictionary is a general bilingual dictionary that is being compiled at the Centre for Language Resources and Technologies of the University of Ljubljana (CJVT UL). Version 2.0 contains 15,362 headwords, 61,190 translations, 28,748 collocations and other word combinations, and 7,741 examples. The Comprehensive Slovenian-Hungarian dictionary is a growing dictionary, which means that new headwords will be added in regular intervals.

Thesaurus of Modern Slovene 2.0

Thesaurus of Modern Slovene

The Thesaurus of Modern Slovene 2.0 contains 100,837 cues and 362,828 synonyms. Most of the entries are fully machine-prepared. In this version, we have manually prepared the semantic breakdown for 3,421 cues, described it with short semantic indicators and classified the synonyms under the corresponding meanings. Another novelty is the addition of manually checked antonyms, available for 3,599 cues. The meaning-structured entries and antonyms indicate the direction in which we would like to expand the dictionary in the future.

Collocations 2.0

The Collocations Dictionary of Modern Slovene

The second version of the Collocation Dictionary of Modern Slovene contains 81,443 cues and 4,491,958 collocations. The dictionary is produced using advanced computational methods for collocation recognition, which we have already tested for Slovene. They are being regularly improved and upgraded. In the current version, 2090 cues have been manually completed and 11227 already contain semantic segmentations and a combination of manually checked and automatic collocations. New features include a transparent three-tier ranking of entries according to their completion and a new way of user participation in the improvement of the dictionary.

 

Slovenian-Hungarian dictionary 1.0

Comprehensive Slovenian-Hungarian dictionary

The Comprehensive Slovenian-Hungarian dictionary is a general bilingual dictionary that is being compiled at the Centre for Language Resources and Technologies of the University of Ljubljana (CJVT UL). Version 1.0 contains 10,946 headwords, 33,298 translations, 15,265 collocations and other word combinations, and 2,416 examples. The dictionary compilation is co-funded by the Slovenian Research Agency (ARRS) through CJVT-dedicated funds as part of the Network of Infrastructure Centres at the University of Ljubljana. The Comprehensive Slovenian-Hungarian dictionary is a growing dictionary, which means that new headwords will be added in regular intervals.

Sloleks 2.0

Morphological Lexicon of Slovene

Sloleks 2.0 contains essential information on Slovene words, e.g. their part-of-speech category and their grammatical features. For each word, the database also contains all its inflected forms. All word form have been assigned accents and IPA transcriptions using automatic methods (using the eBralec speech synthesis system).

The Sloleks lexicon is intended both for language users who can browse the database through an online interface and find information on the inflectional, grammatical, and accentual features of Slovene word forms, as well as for developers of language technology applications.

Collocations 1.0

The Collocations Dictionary of Modern Slovene

This dictionary contains 35,989 headwords and 7,338,801 collocations. It is the first dictionary of collocations for Slovene and represents the first step towards filling the gap in the field of language resources for Slovene, particularly those aimed at facilitating language production. It is characterized by a phase-based entry representation, collocational data in context, and numerous filtering and sorting options.
It is a responsive dictionary, meaning that the compilation of the dictionary database is immediately followed by providing the language community with access to a large amount of relevant, albeit somewhat noisy language information.

Thesaurus of Modern Slovene 1.0

The Thesaurus of Modern Slovene

This thesaurus is the largest automatically generated open-access collection of Slovene synonyms and the first responsive dictionary that dynamically responds to chnages in language as well as the public opinion – it namely enables its users to add missing synonyms to the dictionary or vote on already existing entries. Unlike other similar language resources, the Thesaurus is based on a range of different databases and enables users to compare different synonyms based on their collocates and check their use in the Gigafida reference corpus of modern Slovene.

LBS

The Slovene lexical database
This lexical database is comprehensive syntactic and semantic description of a selected set of Slovene words. The description was based exclusively on the analysis of reference corpora of Slovene. Its wordlist was selected from 5,000 most frequent word in FidaPLUS and Gigafida corpora. In addition, we also considered a selection of words from school books in order to accommodate to the needs of school population. The database served as the primary source for preparing the concept of a new dictionary of modern Slovene.

Sloleks 1.0

Slovene Morphological Lexicon 1.0

Sloleks 1.0 is a lexicon of Slovene word forms. It contains, in an XML database, basic information on 100,000  Slovene words, especially their word class and related features. For each word, all its word forms are provided. Since Slovene is morphologically very rich language, each word has many word forms. Declension is typical of nouns, adjectives, pronouns, numerals, verbs and adverbs. The first version of the lexicon was made as part of the project Communication in Slovene.