As part of the project From Citizen Science to a Digital Dictionary Database, we will lexicographically validate the synonym contributions submitted by citizens and integrate them into the Digital Dictionary Database for Slovenian. Synonyms and antonyms will be reviewed and validated by lexicographers, who will ensure that any problematic entries (e.g., hateful or malicious suggestions) are not included in the Digital Dictionary Database for Slovenian. To ensure accuracy and clarity, we will use appropriate dictionary labels, which are particularly important when dealing with sensitive or negatively connoted vocabulary. This approach preserves the richness and diversity of the language while addressing potentially sensitive content in a thoughtful and respectful manner. We will integrate the cleaned data into the Digital Dictionary Database for Slovenian, which will significantly increase its value for the Large Language Models for Digital Humanities project.

We will also update the Thesaurus interface, enabling citizens and the research community to make better use of and gain insight into the project results. The validated data will contain at least 25,000 synonyms and antonyms collected with the help of citizens. The data will be published on the Clarin.si repository, ensuring open and long-term access for research and development.