{"id":946,"date":"2020-03-31T22:42:40","date_gmt":"2020-03-31T20:42:40","guid":{"rendered":"https:\/\/www.cjvt.starkmat.si\/kolos\/project\/"},"modified":"2026-03-08T14:33:37","modified_gmt":"2026-03-08T13:33:37","slug":"project","status":"publish","type":"page","link":"https:\/\/www.cjvt.si\/prop\/en\/","title":{"rendered":"About the project"},"content":{"rendered":"<div class=\"flex_column av_one_full  flex_column_div av-zero-column-padding first  avia-builder-el-0  avia-builder-el-no-sibling  \" style='border-radius:0px; '><div  class=\"togglecontainer  av-minimal-toggle  avia-builder-el-1  avia-builder-el-no-sibling \" >\n<section class=\"av_toggle_section\"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\"  >    <div role=\"tablist\" class=\"single_toggle\" data-tags=\"{All} \"  >        <p data-fake-id=\"#toggle-id-1\" class=\"toggler  av-inherit-border-color \"  itemprop=\"headline\"  style='border-color: #93c83f; '  role=\"tab\" tabindex=\"0\" aria-controls=\"toggle-id-1-container\">Project scope<span class=\"toggle_icon\" >        <span class=\"vert_icon\"><\/span><span class=\"hor_icon\"><\/span><\/span><\/p>        <div id=\"toggle-id-1-container\" class=\"toggle_wrap \"  >            <div class=\"toggle_content invers-color  av-inherit-border-color \"  itemprop=\"text\"  style='border-color: #93c83f; ' ><p>The aim of the project <em>Empirical foundations for digitally-supported development of writing skills<\/em> (PROP) is to support teachers who correct and grade student writing. The development of writing competency involves practising writing skills \u2013 however, more writing also means more work for teachers. Research has shown that giving individualised, goal-oriented and formative feedback leads to best literacy results. On the other hand, it is time consuming and demands support in the form of adequate descriptors, indicators and, not least, information about modern language use in various communication situations. In many languages, including Slovene, these conditions are not met, which is why there is (too) little school writing, while feedback often remains limited to surface corrections of errors, such as grammatical mistakes.<\/p>\n<p>We believe the solution lies in digital support of teachers\u2019 work. On the one hand, automatic identification and substantive categorisation of grammatical errors would free teachers from routine corrections and give them more time to pursue advanced teaching objectives. On the other, a digitally-supported model of providing feedback, based on empirically founded indicators and descriptors, would ease the preparation of corrective study materials and allow for peer assessment and long-term development monitoring. Advances in the field of natural language processing, machine learning, and corpus linguistics make this an attainable goal, which is attested by a variety of digital tools, prototypes, and portals currently being designed. The innovative aspect of the project, which necessitates interdisciplinary collaboration, lies in its proposal of solutions that are based on empirical data: authentic teacher practices coupled with data on real, modern language use.<\/p>\n<p>Pertaining to the latter, Slovene may have a relative advantage over other languages, since it already possesses a corpus of student texts from Slovene primary and secondary schools, which also includes teacher corrections categorised by type of language problem. This corpus represents untapped potential for empirical analyses of authentic school written production and language corrections and for development of a tool that would automatically categorise language problems based on real-life principles of teacher correction.<\/p>\n<p>During the course of the project, we will use automatic extraction and analyses of richly annotated corpora to collect empirical data needed for specifying developmental indicators and descriptors for various educational stages. We will then use this information to design feedback scenarios for different language levels: spelling, morphology, vocabulary, and syntax. We will expand the corpus of school texts by including examples of student writing from the tertiary level and empirically research the specifics of providing feedback in higher education settings. Next, we will develop a tool that automatically identifies language problems in a given text, taking into account the level of the writer. Given the rarity of language resources such as corpus \u0160olar, we will also test the performance of the tool on other comparable training datasets, adapt it to be more independent and thus make the methodology applicable to other languages.<\/p>\n<p>User research is a key part of our project: we will examine existing practices of providing teacher feedback in the development of writing skills, first by conducting a web survey and then by recording teachers\u2019 screens while they are correcting in a digital environment. Teachers and students will furthermore conduct user evaluations of the solutions, developed in the project. Lastly, we intend to combine findings in formative assessment with crowdsourcing and apply this to language didactics in order to develop a strategy of digitally-supported development of writing skills, which will take into consideration all the necessary didactical and ethical issues related to the field.<\/p>\n            <\/div>        <\/div>    <\/div><\/section>\n<section class=\"av_toggle_section\"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\"  >    <div role=\"tablist\" class=\"single_toggle\" data-tags=\"{All} \"  >        <p data-fake-id=\"#toggle-id-2\" class=\"toggler  av-inherit-border-color \"  itemprop=\"headline\"  style='border-color: #93c83f; '  role=\"tab\" tabindex=\"0\" aria-controls=\"toggle-id-2-container\">Project goals<span class=\"toggle_icon\" >        <span class=\"vert_icon\"><\/span><span class=\"hor_icon\"><\/span><\/span><\/p>        <div id=\"toggle-id-2-container\" class=\"toggle_wrap \"  >            <div class=\"toggle_content invers-color  av-inherit-border-color \"  itemprop=\"text\"  style='border-color: #93c83f; ' ><p><strong>Corpora and corpus data:<\/strong> richly annotate a corpus of student writing, a corpus of school textbooks, and a corpus of literature aimed at youngsters and young adults; use data extraction and corpus analyses to facilitate empirical foundations for developmental indicators and descriptors for various educational stages; use these results to develop feedback scenarios on the levels of spelling, morphology, vocabulary, and syntax; build a pilot corpus of student academic writing and include it in all of the above steps.<\/p>\n<p><strong>Software module:<\/strong> develop a software module which automatically identifies and categorises language errors on different language levels; adapt the software to teachers\u2019 needs and the specifics of providing feedback at different educational stages; create the foundation for applying the methodology to other languages.<\/p>\n<p><strong>User research:<\/strong> empirically research existing teaching practices of providing feedback for the development of writing skills with the aid of a) an online survey and b) screen recording of teacher corrections in a digital environment; create the basis for comparable research in other languages; include teachers\/lecturers in the evaluation of the software module and teachers\/lecturers and pupils\/students in the evaluation of the corpus-based feedback scenarios.<\/p>\n<p><strong>Models and strategies:<\/strong> combine findings from the fields of formative assessment and crowdsourcing for the needs of language education and create a model for providing digitally-supported feedback to help the development of writing skills; form a strategy of digitally-supported development of writing skills.<\/p>\n<p><strong>Dissemination of research results:<\/strong> ensure that the results are published in keeping with the National strategy for open access to scientific publications and research data in Slovenia; inform the scientific and general public about project results (scientific publications, events, website) and encourage further exploitation of the results.<\/p>\n            <\/div>        <\/div>    <\/div><\/section>\n<section class=\"av_toggle_section\"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\"  >    <div role=\"tablist\" class=\"single_toggle\" data-tags=\"{All} \"  >        <p data-fake-id=\"#toggle-id-3\" class=\"toggler  av-inherit-border-color \"  itemprop=\"headline\"  style='border-color: #93c83f; '  role=\"tab\" tabindex=\"0\" aria-controls=\"toggle-id-3-container\">Project group<span class=\"toggle_icon\" >        <span class=\"vert_icon\"><\/span><span class=\"hor_icon\"><\/span><\/span><\/p>        <div id=\"toggle-id-3-container\" class=\"toggle_wrap \"  >            <div class=\"toggle_content invers-color  av-inherit-border-color \"  itemprop=\"text\"  style='border-color: #93c83f; ' ><p><strong>University of Ljubljana, Faculty of Arts<\/strong><br \/>\n&#8211; \u0160pela Arhar Holdt, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opdescr=search&amp;opt=2&amp;subopt=1&amp;code1=cmn&amp;code2=auto&amp;search_term=arhar%20holdt,%20%C5%A1pela\">27674<\/a><br \/>\n&#8211; Iztok Kosem, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=iztok%20kosem\">33796<\/a><br \/>\n&#8211; Polona Gantar, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=apolonija%20gantar\">16313<\/a><br \/>\n&#8211; Marko Stabej, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=marko%20stabej\">11651<\/a><br \/>\n&#8211; Teja Goli, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=teja%20goli\">52176<\/a><br \/>\n&#8211; Magdalena Gapsa, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=magdalena%20gapsa\">53628<\/a><br \/>\n&#8211; Mija Bon, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=mija%20bon\">51891<\/a><br \/>\n&#8211; <em>Eva Pori,\u00a0<a href=\"https:\/\/cris.cobiss.net\/ecris\/si\/sl\/researcher\/47727\">51456<\/a><\/em><br \/>\n&#8211; <em>Tina Munda<\/em><\/p>\n<p><strong>University of Ljubljana, Faculty of Computer and Information Science<\/strong><br \/>\n&#8211; Marko Robnik-\u0160ikonja, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=marko%20robnik%20%C5%A1ikonja\">15295<\/a><br \/>\n&#8211; Simon Krek, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=simon%20krek\">26166<\/a><br \/>\n&#8211; Matej Ul\u010dar, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=matej%20ul%C4%8Dar\">55173<\/a><br \/>\n&#8211; Ale\u0161 \u017dagar, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=ale%C5%A1%20%C5%BEagar\">56007<\/a><br \/>\n&#8211; Matej Klemen, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/rsr.aspx?lang=slv&amp;opdescr=search&amp;opt=2&amp;subopt=300&amp;code1=cmn&amp;code2=auto&amp;psize=10&amp;hits=4&amp;page=1&amp;count=1&amp;id=52572&amp;slng=slv&amp;search_term=matej+klemen&amp;order_by=\">55754<\/a><br \/>\n&#8211; Martin Bo\u017ei\u010d,\u00a0<a href=\"https:\/\/cris.cobiss.net\/ecris\/si\/sl\/researcher\/55408\">58277<\/a><br \/>\n&#8211; Ga\u0161per Jelov\u010dan,\u00a0<a href=\"https:\/\/cris.cobiss.net\/ecris\/si\/sl\/researcher\/56881\">59561<\/a><br \/>\n&#8211; Tadej \u0160kvorc,\u00a0<a href=\"https:\/\/cris.cobiss.net\/ecris\/si\/sl\/researcher\/46947\">50769<\/a><br \/>\n&#8211; <em>Sara Sever<\/em><br \/>\n&#8211; <em>Tinca Lukan<\/em><\/p>\n<p><strong>University of Ljubljana, Faculty of Public Administration<\/strong><br \/>\n&#8211; Tadeja Rozman, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/rsr.aspx?lang=slv&amp;opdescr=search&amp;opt=2&amp;subopt=300&amp;code1=cmn&amp;code2=auto&amp;psize=10&amp;hits=2&amp;page=1&amp;count=1&amp;id=18963&amp;slng=slv&amp;search_term=tadeja+rozman&amp;order_by=\">25578<\/a><\/p>\n<p><strong>University of Ljubljana, Faculty of Education<\/strong><br \/>\n&#8211; Karmen Pi\u017eorn, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=karmen%20pi%C5%BEorn\">21612<\/a><br \/>\n&#8211; Alenka Rot Vrhovec, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=Alenka%20Rot%20Vrhovec\">34816<\/a><br \/>\n&#8211; Lara Godec Sor\u0161ak, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=lara%20godec\">25590<\/a><br \/>\n&#8211; Milena Ko\u0161ak Babuder, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/search_basic.aspx?lang=slv&amp;opt=2&amp;subopt=1&amp;opdescr=search&amp;code1=cmn&amp;code2=auto&amp;search_term=milena%20ko%C5%A1ak%20babuder\">26199<\/a><br \/>\n&#8211; Toma\u017e Petek, <a href=\"https:\/\/www.sicris.si\/public\/jqm\/rsr.aspx?lang=slv&amp;opdescr=search&amp;opt=2&amp;subopt=300&amp;code1=cmn&amp;code2=auto&amp;psize=10&amp;hits=2&amp;page=1&amp;count=2&amp;id=35226&amp;slng=slv&amp;search_term=toma%c5%be+petek&amp;order_by=\">32433<\/a><\/p>\n            <\/div>        <\/div>    <\/div><\/section>\n<section class=\"av_toggle_section\"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\"  >    <div role=\"tablist\" class=\"single_toggle\" data-tags=\"{All} \"  >        <p data-fake-id=\"#toggle-id-4\" class=\"toggler  av-inherit-border-color \"  itemprop=\"headline\"  style='border-color: #93c83f; '  role=\"tab\" tabindex=\"0\" aria-controls=\"toggle-id-4-container\">Project timeline<span class=\"toggle_icon\" >        <span class=\"vert_icon\"><\/span><span class=\"hor_icon\"><\/span><\/span><\/p>        <div id=\"toggle-id-4-container\" class=\"toggle_wrap \"  >            <div class=\"toggle_content invers-color  av-inherit-border-color \"  itemprop=\"text\"  style='border-color: #93c83f; ' ><p><strong>1. Corpus analysis of written production at various educational stages<\/strong><br \/>\n&#8211; Corpus data preparation for linguistic and machine tasks [M1-6]<br \/>\n&#8211; Compiling a pilot corpus of student academic texts [M1-12]<br \/>\n&#8211; Quantitative and qualitative linguistic analyses of student writing [M1-18]<br \/>\n&#8211; Empirical data for developmental indicators on levels of vocabulary and syntax [M7-18]<br \/>\n<strong>2. Practice-based digitally-supported development of writing skills<\/strong><br \/>\n&#8211; Questionnaire survey about teaching practices used to develop writing skills [M1-18]<br \/>\n&#8211; Recording teacher corrections of student writing and semi-structured interviews [M10-24]<br \/>\n&#8211; Designing a strategy for digitally-supported development of writing skills [M19-35]<br \/>\n<strong>3. Development and evaluation of automatic identification and categorisation of language problems<\/strong><br \/>\n&#8211; Designing a model for automatic error annotation [M7-35]<br \/>\n&#8211; Testing the applicability of methodology to other languages [M25-35]<br \/>\n&#8211; Linguistic and teacher evaluation of automatic annotation of texts [M13-35]<br \/>\n<strong>4. Providing feedback in digital environment<\/strong><br \/>\n&#8211; Developing a model combining formative assessment and crowdsourcing [M7-18]<br \/>\n&#8211; Corpus-based and scaffolded feedback scenarios [M13-24]<br \/>\n&#8211; Testing feedback with target user groups [M22-35]<br \/>\n<strong>5. Coordination and dissemination<\/strong><br \/>\n&#8211; Coordination, reporting and dissemination [M1-36]<br \/>\n&#8211; Scientific publications and research data [M1-36]<\/p>\n            <\/div>        <\/div>    <\/div><\/section>\n<section class=\"av_toggle_section\"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\"  >    <div role=\"tablist\" class=\"single_toggle\" data-tags=\"{All} \"  >        <p data-fake-id=\"#toggle-id-5\" class=\"toggler  av-inherit-border-color \"  itemprop=\"headline\"  style='border-color: #93c83f; '  role=\"tab\" tabindex=\"0\" aria-controls=\"toggle-id-5-container\">Corpus analysis of written production at various educational stages<span class=\"toggle_icon\" >        <span class=\"vert_icon\"><\/span><span class=\"hor_icon\"><\/span><\/span><\/p>        <div id=\"toggle-id-5-container\" class=\"toggle_wrap \"  >            <div class=\"toggle_content invers-color  av-inherit-border-color \"  itemprop=\"text\"  style='border-color: #93c83f; ' ><p><b>Corpus data preparation for linguistic and machine tasks<\/b><\/p>\n<p>As part of the project, we prepared three text corpora specialized for educational use: the corpus of student writing <em>\u0160olar 3.0<\/em>, the open-access school textbook corpus <em>ccU\u010dbeniki 1.0<\/em>, and the youth literature corpus <em>ccMaks 1.0<\/em>. Prior to the project, <em>\u0160olar<\/em> was available in version 2.0, <em>Maks<\/em> was only accessible via concordancers (not as a full database), and the <em>U\u010dbeniki<\/em> corpus was not publicly available. All three resources had previously been linguistically annotated, but using older automatic tagging tools. In this project, we uniformly re-annotated the corpora using the <em>CLASSLA v1.1.1 <\/em>tagger, covering tokenization, sentence segmentation, lemmatization, and morphosyntactic tagging according to the <em>MULTEXT-East<\/em> v6 standard, as well as dependency syntax (<em>JOS<\/em>) and named entity recognition.\u00a0In collaboration with other projects, we also ensured that two additional corpora for Slovene as a second language were prepared using the same standards: the <em>KUUS<\/em> textbook corpus and the <em>KOST<\/em> corpus of Slovene as a foreign language. This uniformity allows for more reliable data comparison, while higher-level linguistic annotations support more advanced linguistic and computational analyses, as well as better data usability for machine learning applications.<\/p>\n<p>The corpora are openly available in the CLARIN.SI repository:<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ARHAR HOLDT, \u0160pela, ROZMAN, Tadeja, STRITAR KU\u010cUK, Mojca, KREK, Simon, KRAP\u0160 VODOPIVEC, Irena, STABEJ, Marko, PORI, Eva, GOLI, Teja, LAVRI\u010c, Polona, LASKOWSKI, Cyprian Adam, KOCJAN\u010cI\u010c, Polonca, KLEMENC, Bojan, KRSNIK, Luka, KOSEM, Iztok. <em>Developmental corpus \u0160olar 3.0.<\/em> Slovenian language resource repository CLARIN.SI, ISSN 2820-4042,\u00a0<a href=\"http:\/\/hdl.handle.net\/11356\/1589\">http:\/\/hdl.handle.net\/11356\/1589<\/a>. [COBISS.SI-ID 124160003]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">KOSEM, Iztok, PORI, Eva, \u017dAGAR, Ale\u0161, ARHAR HOLDT, \u0160pela. <em>Corpus of Slovenian textbooks ccU\u010dbeniki 1.0<\/em>. Slovenian language resource repository CLARIN.SI, ISSN 2820-4042,\u00a0<a href=\"http:\/\/hdl.handle.net\/11356\/1693\">http:\/\/hdl.handle.net\/11356\/1693<\/a>. [COBISS.SI-ID 129443843]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">VERDONIK, Darinka, MAJNINGER, Sandi, DOBROVOLJC, Kaja, ANTLOGA, \u0160pela, Z\u00d6GLING MARKU\u0160, Aleksandra, VOR\u0160I\u010c, Ines, ZEMLJAK JONTES, Melita, KOLETNIK, Mihaela, VALH LOPERT, Alenka, \u0160EK, Polonca, KOSEM, Iztok, MAJHENI\u010c, Simona, FERME, Marko, \u017dAGAR, Ale\u0161, ARHAR HOLDT, \u0160pela. <em>Corpus of Slovenian texts for pedagogical purposes ccMAKS 1.0<\/em>. Slovenian language resource repository CLARIN.SI, ISSN 2820-4042,\u00a0<a href=\"http:\/\/hdl.handle.net\/11356\/1692\">http:\/\/hdl.handle.net\/11356\/1692<\/a>. [COBISS.SI-ID 129467395]<\/p>\n<p>The corpus preparation process\u2014which is particularly demanding for corpora containing linguistic corrections, such as <em>\u0160olar 3.0<\/em>\u2014was presented at conferences, in a monograph, and in the prestigious journal <em>Language Resources and Evaluation<\/em>.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ARHAR HOLDT, \u0160pela, KOSEM, Iztok. \u0160olar, the developmental corpus of Slovene.\u00a0<i><span style=\"font-weight: 400;\">Language resources and evaluation<\/span><\/i><span style=\"font-weight: 400;\">. 2024, str. 1-27. DOI: <\/span><a href=\"https:\/\/dx.doi.org\/10.1007\/s10579-024-09758-4\"><span style=\"font-weight: 400;\">10.1007\/s10579-024-09758-4<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">204228867<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ARHAR HOLDT, \u0160pela, KOSEM, Iztok, STRITAR KU\u010cUK, Mojca. Metode in orodja za la\u017ejo pripravo korpusov usvajanja jezika. V: PIRIH SVETINA, Nata\u0161a (ur.), FERBE\u017dAR, Ina (ur.).\u00a0<i><span style=\"font-weight: 400;\">Na sti\u010di\u0161\u010du svetov : sloven\u0161\u010dina kot drugi in tuji jezik<\/span><\/i><span style=\"font-weight: 400;\">. 1. natis. Ljubljana: Zalo\u017eba Univerze, 2022. Str. 23-30. Zbirka Obdobja, 41.<\/span><span style=\"font-weight: 400;\">\u00a0DOI:\u00a0<\/span><a href=\"https:\/\/dx.doi.org\/10.4312\/Obdobja.41.23-30\"><span style=\"font-weight: 400;\">10.4312\/Obdobja.41.23-30<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0129063939]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ARHAR HOLDT, \u0160pela, PORI, Eva, KOSEM, Iztok. Prihodnost korpusa \u0160olar. V: ARHAR HOLDT, \u0160pela (ur.), KREK, Simon (ur.).\u00a0<i><span style=\"font-weight: 400;\">Razvoj sloven\u0161\u010dine v digitalnem okolju<\/span><\/i><span style=\"font-weight: 400;\">. Ljubljana: Zalo\u017eba Univerze, 2023. Str. 61-91. Sporazumevanje. <\/span><a href=\"https:\/\/ebooks.uni-lj.si\/ZalozbaUL\/catalog\/view\/522\/852\/9442\"><span style=\"font-weight: 400;\">https:\/\/ebooks.uni-lj.si\/ZalozbaUL\/catalog\/view\/522\/852\/9442<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0185543683]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">KLEMEN, Matej, ARHAR HOLDT, \u0160pela, POLLAK, Senja, KOSEM, Iztok, HUBER, Damjan, LUTAR, Mateja. Korpus u\u010dbenikov za u\u010denje sloven\u0161\u010dine kot drugega in tujega jezika. V: PIRIH SVETINA, Nata\u0161a (ur.), FERBE\u017dAR, Ina (ur.).\u00a0<i><span style=\"font-weight: 400;\">Na sti\u010di\u0161\u010du svetov : sloven\u0161\u010dina kot drugi in tuji jezik<\/span><\/i><span style=\"font-weight: 400;\">. Ljubljana: Zalo\u017eba Univerze, 2022. Str. 165-174. Zbirka Obdobja, 41. <\/span><span style=\"font-weight: 400;\">DOI:\u00a0<\/span><a href=\"https:\/\/dx.doi.org\/10.4312\/Obdobja.41.165-174\"><span style=\"font-weight: 400;\">10.4312\/Obdobja.41.165-174<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0129975811]<\/span><\/p>\n<p><b>Compiling a pilot corpus of student academic texts<\/b><\/p>\n<p>We developed a new pilot corpus of student writing, <em>KO\u0160,<\/em> which includes written texts by students from the Faculty of Public Administration and the Faculty of Education at the University of Ljubljana. The corpus contains 426 texts (542,066 tokens). The texts were collected following the <em>\u0160olar<\/em> corpus preparation methodology, which involves recording all relevant metadata, including teacher-provided language corrections, ensuring legal compliance for open access, and formatting the data in a compatible format.<\/p>\n<blockquote>\n<p style=\"text-align: justified; font-size: 16px; line-height: 100%; padding-left: 10px;\">Text collection agreement for students: <a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2023\/01\/PROP_KOS_Pogodba-studenti-digitalni.pdf\">digital signature<\/a> \/\u00a0<a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2023\/01\/PROP_KOS_Pogodba-studenti-tiskana.pdf\">manual signature<\/a>.<\/p>\n<p style=\"text-align: justified; font-size: 16px; line-height: 100%; padding-left: 10px;\">Text collection agreement for professors: <a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2023\/01\/PROP_KOS_Pogodba-profesorji-digitalni.pdf\">digital signature<\/a> \/\u00a0<a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2023\/01\/PROP_KOS_Pogodba-profesorji-tiskana.pdf\">manual signature<\/a>.<\/p>\n<\/blockquote>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">ROZMAN, Tadeja, ARHAR HOLDT, \u0160pela, \u017dAGAR, Ale\u0161, STABEJ, Marko, PERME, Kaja, ZUPAN, Ne\u017ea, GODEC SOR\u0160AK, Lara.\u00a0<em>Pilot corpus of student academic texts KO\u0160 1.0.<\/em>\u00a0Slovenian language resource repository CLARIN.SI, ISSN 2820-4042, http:\/\/hdl.handle.net\/11356\/2048.\u00a0[COBISS.SI-ID\u00a0258382339]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ROZMAN, Tadeja, ARHAR HOLDT, \u0160pela. Gradnja Korpusa \u0161tudentskih besedil KO\u0160. V: FI\u0160ER, Darja (ur.), ERJAVEC, Toma\u017e (ur.).\u00a0<i><span style=\"font-weight: 400;\">Jezikovne tehnologije in digitalna humanistika: zbornik konference: 15.-16. september 2022, Ljubljana, Slovenija. <\/span><\/i><span style=\"font-weight: 400;\">Ljubljana: In\u0161titut za novej\u0161o zgodovino, 2022. Str. 267-270. <\/span><a href=\"https:\/\/nl.ijs.si\/jtdh22\/pdf\/JTDH2022_Rozman_ArharHoldt_Gradnja-Korpusa-studentskih-besedil-KOS.pdf\"><span style=\"font-weight: 400;\">https:\/\/nl.ijs.si\/jtdh22\/pdf\/JTDH2022_Rozman_ArharHoldt_Gradnja-Korpusa-studentskih-besedil-KOS.pdf<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">131012099<\/span><span style=\"font-weight: 400;\">] <a href=\"https:\/\/nl.ijs.si\/jtdh22\/video\/JTDH2022_Rozman_ArharHoldt_Gradnja-Korpusa-studentskih-besedil-KOS.mp4\">Posnetek predstavitve.<\/a><\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">ROZMAN, Tadeja. Pilotni korpus KO\u0160 in smernice za gradnjo korpusa \u0161tudentskih besedil. <em>Sloven\u0161\u010dina 2.0: empiri\u010dne, aplikativne in interdisciplinarne raziskave<\/em>. 2025, letn. 13, \u0161t. 1, str. 120-137<\/span><span style=\"font-weight: 400;\">, DOI: <\/span><a href=\"https:\/\/dx.doi.org\/10.4312\/slo2.0.2025.1.120-137\"><span style=\"font-weight: 400;\">10.4312\/slo2.0.2025.1.120-137<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID <\/span><span style=\"font-weight: 400;\">263403523<\/span><span style=\"font-weight: 400;\">]\u00a0<\/span><\/p>\n<p>Since writing and the provision of feedback at the tertiary level differ somewhat from the secondary level\u2014captured in the<em> \u0160olar<\/em> corpus\u2014we upgraded the methodology accordingly. To further explore current feedback practices, we conducted a survey involving as many as 459 educators teaching at Slovenian public universities and independent higher education institutions. The survey results were presented to the public, and the research data were published in open access.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ROZMAN, Tadeja, ARHAR HOLDT, \u0160pela, STABEJ, Marko.\u00a0<i><span style=\"font-weight: 400;\">Podajanje povratnih informacij o \u0161tudentskih besedilih: raziskovalni podatki = Feedback on student writing: research data<\/span><\/i><span style=\"font-weight: 400;\">. Ljubljana: [s. n.], 2023. <\/span><span style=\"font-weight: 400;\">Repozitorij Univerze v Ljubljani \u2013 RUL<\/span><span style=\"font-weight: 400;\">, DOI:\u00a0<\/span><a href=\"https:\/\/dx.doi.org\/20.500.12556\/RUL-152767\"><span style=\"font-weight: 400;\">20.500.12556\/RUL-152767<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">176945923<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ROZMAN, Tadeja, STABEJ, Marko. Univerzitetno pisanje in popravljanje besedil: prakse in stali\u0161\u010da. V: \u0160TUMBERGER, Sa\u0161ka (ur.). <i>Predpis in norma v jeziku<\/i>. Ljubljana: Zalo\u017eba Univerze, 2024. Str. 285-292. Zbirka Obdobja, 43. DOI:\u00a0<a href=\"https:\/\/dx.doi.org\/10.4312\/Obdobja.43.285-292\" target=\"_blank\" rel=\"noopener\">10.4312\/Obdobja.43.285-292<\/a>. [COBISS.SI-ID 215458307]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ROZMAN, Tadeja. Slovenske empiri\u010dne raziskave o razvoju strokovne sporazumevalne zmo\u017enosti na univerzi: kje smo in kako naprej?. V: KOVA\u010cEVI\u0106, Borko (ur.).\u00a0<em>Modern approaches to old and new challenges: book of abstracts: the 8th international congress of Applied Linguistics Today<\/em>: Faculty of Philology, University of Belgrade: 23\u201325 May 2025. Belgrade: University, Faculty of Philology, 2025. Str. 161-162.\u00a0<a href=\"https:\/\/alt8.fil.bg.ac.rs\/bookOfAbstracts\">https:\/\/alt8.fil.bg.ac.rs\/bookOfAbstracts<\/a>. [COBISS.SI-ID 245534211]<\/p>\n<p><b>Quantitative and qualitative linguistic analyses of student writing<\/b><\/p>\n<p>For both quantitative and qualitative analyses of school writing, high-quality and reliable annotation of language corrections in student texts is essential. In this project, we improved the annotation methodology, particularly by upgrading the <em data-start=\"301\" data-end=\"308\">\u0160olar<\/em> annotation scheme and the annotation tool <a href=\"https:\/\/orodja.cjvt.si\/svala\/#\"><em>CJVT Svala<\/em><\/a>. These advancements were presented at the established national symposium <em data-start=\"437\" data-end=\"446\">Obdobja<\/em> and at the international <em data-start=\"472\" data-end=\"478\">LREC<\/em> conference.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ARHAR HOLDT, \u0160pela, LAVRI\u010c, Polona, ROBLEK, Rebeka, GOLI, Teja, BON, Mija, 2023:\u00a0<em>Categorizing Teachers\u2019 Corrections: Guidelines for Annotating the \u0160olar Corpus.<\/em> Version 1.2. Prepared in the project Empirical foundations for digitally-supported development of writing skills. <a href=\"https:\/\/wiki.cjvt.si\/books\/11-developmental-corpus-solar\/page\/annotation-guidelines\">https:\/\/wiki.cjvt.si\/books\/11-developmental-corpus-solar\/page\/annotation-guidelines.<\/a><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ARHAR HOLDT, \u0160pela, ERJAVEC, Toma\u017e, KOSEM, Iztok, VOLODINA, Elena. Towards an ideal tool for learner error annotation. V: CALZOLARI, Nicoletta (ur.).\u00a0<i><span style=\"font-weight: 400;\">The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024): main conference proceedings: 20-25 May, 2024, Torino, Italia<\/span><\/i><span style=\"font-weight: 400;\">. [Paris]: ELRA Language Resources Association (ELRA); [Stroudsburg]: International Committee on Computational Linguistics, cop. 2024. Str. 16392-16398. <\/span><a href=\"https:\/\/aclanthology.org\/2024.lrec-main.1424.pdf\"><span style=\"font-weight: 400;\">https:\/\/aclanthology.org\/2024.lrec-main.1424.pdf<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">199958019<\/span><span style=\"font-weight: 400;\">]. <a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/06\/POSTER-LREC_2024-Towards-an-Ideal-Tool-for-Learner-Error-Annotation.pdf\">POSTER PDF<\/a><\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ARHAR HOLDT, \u0160pela, POPI\u010c, Damjan, STRITAR KU\u010cUK, Mojca. Primerjava sistemov za ozna\u010devanje jezikovnih popravkov v \u0161tirih slovenskih besedilnih korpusih. V: \u0160TUMBERGER, Sa\u0161ka (ur.).\u00a0<i>Predpis in norma v jeziku<\/i>. Ljubljana: Zalo\u017eba Univerze, 2024. Str. 11-20. Zbirka Obdobja, 43.\u00a0DOI:\u00a0<a href=\"https:\/\/dx.doi.org\/10.4312\/Obdobja.43.11-20\" target=\"_blank\" rel=\"noopener\">10.4312\/Obdobja.43.11-20<\/a>. [COBISS.SI-ID\u00a0215306243]<\/p>\n<p>Using advanced data extraction from the <em data-start=\"97\" data-end=\"108\">\u0160olar 3.0<\/em> corpus, we compiled a frequency list of language issues containing 36,570 sentences from student writing, each corrected by a teacher. The corrections were manually categorized into 180 distinct types based on their content. Each sentence is annotated with metadata such as the type of source text, the educational level of the author, and the type and region of the school where the text was produced. The dataset reveals which issues teachers at various educational levels focus on most, how they correct them, which problems are most frequent, and which are regionally conditioned. We conducted statistical analyses of the data and presented the most persistent language difficulties\u2014those that remain present in student writing up to the end of secondary school\u2014at the <em data-start=\"882\" data-end=\"888\">TALC<\/em> conference. For the qualitative linguistic analysis, which focused both on typical writing difficulties and correction patterns, we selected two topics: comma usage and related errors, and language variants in the use of multi-word expressions.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ARHAR HOLDT, \u0160pela, ROZMAN, Tadeja, STRITAR KU\u010cUK, Mojca, KREK, Simon, KRAP\u0160 VODOPIVEC, Irena, STABEJ, Marko, PORI, Eva, GOLI, Teja, LAVRI\u010c, Polona, LASKOWSKI, Cyprian Adam, KOCJAN\u010cI\u010c, Polonca, KLEMENC, Bojan, KRSNIK, Luka, \u017dAGAR, Ale\u0161, KOSEM, Iztok.\u00a0<i><span style=\"font-weight: 400;\">Frequency list of language problems from \u0160olar 3.0<\/span><\/i><span style=\"font-weight: 400;\">. Slovenian language resource repository CLARIN.SI, ISSN 2820-4042, <a href=\"http:\/\/hdl.handle.net\/11356\/1716\">http:\/\/hdl.handle.net\/11356\/1716<\/a>.<\/span><span style=\"font-weight: 400;\">\u00a0[COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">130413571<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ARHAR HOLDT, \u0160pela. Leveraging frequency list of language problems from \u0160olar 3.0. V:\u00a0<i><span style=\"font-weight: 400;\">TaLC 2024: 16th Teaching and Language Corpora Conference: July 7th to 10th 2024, Manchester Metropolitan University, Manchester, UK<\/span><\/i><i><span style=\"font-weight: 400;\">: book of abstracts<\/span><\/i><span style=\"font-weight: 400;\">. [Manchester: Manchester Metropolitan University], 2024. Str. [121]. <\/span><a href=\"https:\/\/talc2024.co.uk\/wp-content\/uploads\/2024\/07\/book-of-abstracts-talc-2024_final-4.pdf\"><span style=\"font-weight: 400;\">https:\/\/talc2024.co.uk\/wp-content\/uploads\/2024\/07\/book-of-abstracts-talc-2024_final-4.pdf<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">204247555<\/span><span style=\"font-weight: 400;\">] <a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/06\/POSTER-TALC_2024-Leveraging-Frequency-List-of-Language-Problems-from-Solar-3.0.pdf\">POSTER PDF<\/a><\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">BON, Mija, GAPSA, Magdalena. Analiza napak pri rabi vejice v \u0161olskih spisih. V: MARU\u0160I\u010c, Franc (ur.), et al.\u00a0<i>\u0160krab\u010devi dnevi 13: zbornik prispevkov s simpozija 2023. Nova Gorica<\/i>. Nova Gorica: Zalo\u017eba univerze, 2025. Str. 1\u201315.\u00a0<a href=\"https:\/\/ung.si\/media\/publishing\/2025\/03\/12\/08\/24\/17\/Zbornik-SD13-2025-koncna.pdf\">https:\/\/ung.si\/media\/publishing\/2025\/03\/12\/08\/24\/17\/Zbornik-SD13-2025-koncna.pdf<\/a>. <span style=\"font-weight: 400;\">[COBISS.SI-ID <\/span><span style=\"font-weight: 400;\">247838723<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">GANTAR, Polona, BON, Mija. Dati skozi ali prestati?: Napake in jezikovne variante v rabi ve\u010dbesednih enot pri samostojnem tvorjenju besedil v osnovni in srednji \u0161oli. <em>Sodobna pedagogika,<\/em> okt. 2025, letn. 76 = 142, \u0161t. 3, str. 39-58,<\/span><span style=\"font-weight: 400;\">\u00a0DOI: <\/span><a href=\"https:\/\/dx.doi.org\/10.63384\/sptB5_z789s\"><span style=\"font-weight: 400;\">10.63384\/sptB5_z789s<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID <\/span><span style=\"font-weight: 400;\">259193859<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p><b>Empirical data for developmental indicators on levels of vocabulary and syntax<\/b><\/p>\n<p>We established a methodology for extracting core vocabulary lists from pedagogical corpora, which included updating the corpus extraction tool <em data-start=\"200\" data-end=\"206\">LIST<\/em> to version 1.3. We also developed and described a methodology for extracting syntactic information from pedagogical corpora. We generated frequency lists of lemmas from the textbook corpus and compiled core vocabulary lists for levels A1, A2, and B1, based on the Common European Framework of Reference for Languages (CEFR). Special attention was given to vocabulary at the A1 level, for which we developed a lexical description concept that includes both authentic and pedagogically adapted corpus examples and collocations. All resources and tools were published openly on the CLARIN.SI repository, and the results were presented at an international conference.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">KRSNIK, Luka, ARHAR HOLDT, \u0160pela, \u010cIBEJ, Jaka, DOBROVOLJC, Kaja, KLJU\u010cEV\u0160EK, Aleksander, KREK, Simon, ROBNIK \u0160IKONJA, Marko. <i><span style=\"font-weight: 400;\">Corpus extraction tool LIST 1.3<\/span><\/i><span style=\"font-weight: 400;\">. Slovenian language resource repository CLARIN.SI, ISSN 2820-4042,\u00a0<a href=\"http:\/\/hdl.handle.net\/11356\/1964\">http:\/\/hdl.handle.net\/11356\/1964<\/a>.<\/span><span style=\"font-weight: 400;\">\u00a0[COBISS.SI-ID <\/span><span style=\"font-weight: 400;\">218014211<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">KOSEM, Iztok, PORI, Eva, ARHAR HOLDT, \u0160pela.\u00a0<i><span style=\"font-weight: 400;\">Frequency list of textbook vocabulary by level of education in elementary and secondary schools<\/span><\/i><span style=\"font-weight: 400;\">. Slovenian language resource repository CLARIN.SI, ISSN 2820-4042,\u00a0<a href=\"http:\/\/hdl.handle.net\/11356\/1719\">http:\/\/hdl.handle.net\/11356\/1719<\/a>.<\/span><span style=\"font-weight: 400;\">\u00a0[COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">192040707<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">KLEMEN, Matej, ARHAR HOLDT, \u0160pela, POLLAK, Senja.\u00a0<i><span style=\"font-weight: 400;\">Core vocabulary for Slovenian as L2 1.0<\/span><\/i><span style=\"font-weight: 400;\">. Slovenian language resource repository CLARIN.SI, ISSN 2820-4042,\u00a0<a href=\"http:\/\/hdl.handle.net\/11356\/1697\">http:\/\/hdl.handle.net\/11356\/1697<\/a>.<\/span><span style=\"font-weight: 400;\">\u00a0[COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">130844419<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">PORI, Eva, KNEZ, Mihaela, KOSEM, Iztok, ARHAR HOLDT, \u0160pela, KLEMEN, Matej, GANTAR, Polona, ZGAGA, Karolina, ROBLEK, Rebeka.\u00a0<i><span style=\"font-weight: 400;\">A1 core vocabulary with lexical information for Slovenian 1.0<\/span><\/i><span style=\"font-weight: 400;\">. Slovenian language resource repository CLARIN.SI, ISSN 2820-4042,\u00a0<a href=\"http:\/\/hdl.handle.net\/11356\/1896\">http:\/\/hdl.handle.net\/11356\/1896<\/a>.<\/span><span style=\"font-weight: 400;\">\u00a0[COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">192040963<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">KLEMEN, Matej, ARHAR HOLDT, \u0160pela, POLLAK, Senja, KOSEM, Iztok, PORI, Eva, GANTAR, Polona, KNEZ, Mihaela. Building a CEFR-labeled core vocabulary and developing a lexical resource for Slovenian as a second and foreign language. V: MEDVE\u010e, Marek (ur.), et al.\u00a0<i><span style=\"font-weight: 400;\">eLex 2023: electronic lexicography in the 21st century (eLex 2023): proceedings of the eLex 2023 conference: [Brno], 27\u201329 June 2023<\/span><\/i><span style=\"font-weight: 400;\">. Brno: Lexical Computing CZ, 2023. Str. 654-668. Electronic lexicography in the 21st century, <\/span><a href=\"https:\/\/elex.link\/elex2023\/wp-content\/uploads\/118.pdf\"><span style=\"font-weight: 400;\">https:\/\/elex.link\/elex2023\/wp-content\/uploads\/118.pdf<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">158856451<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p>For syntactic studies, we developed a methodology for extracting syntactic information from pedagogical corpora. We published openly accessible frequency lists of collocations from the <em data-start=\"186\" data-end=\"197\">\u0160olar 3.0<\/em> corpus and the <em data-start=\"213\" data-end=\"227\">U\u010dbeniki 1.0<\/em> corpus in the CLARIN.SI repository, as well as frequency lists of syntactic structures from both corpora. These data can serve as a basis for developing empirically grounded developmental benchmarks, descriptors, and other materials for learning Slovene. As part of the project, we also prepared two linguistic analyses comparing the characteristics of student writing and textbooks across educational levels, focusing on both vocabulary and syntax.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">MUNDA, Tina, ARHAR HOLDT, \u0160pela, ROZMAN, Tadeja, STRITAR KU\u010cUK, Mojca, KREK, Simon, KRAP\u0160 VODOPIVEC, Irena, STABEJ, Marko, PORI, Eva, GOLI, Teja, LAVRI\u010c, Polona, LASKOWSKI, Cyprian Adam, KOCJAN\u010cI\u010c, Polonca, KLEMENC, Bojan, KRSNIK, Luka, KOSEM, Iztok. <i>Frequency list of collocations from the \u0160olar 3.0 corpus<\/i>. Ljubljana: University of Ljubljana, Centre for Language Resources and Technologies: University of Ljubljana, Faculty of Arts, 2025. CLARIN.SI data &amp; tools. ISSN 2820-4042. <a href=\"http:\/\/hdl.handle.net\/11356\/2011\" target=\"_blank\" rel=\"noopener\">http:\/\/hdl.handle.net\/11356\/2011<\/a>. [COBISS.SI-ID\u00a0225465859]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">MUNDA, Tina, ARHAR HOLDT, \u0160pela, KOSEM, Iztok, PORI, Eva, KREK, Simon.\u00a0<i>Frequency list of collocations from the U\u010dbeniki 1.0 corpus<\/i>. Ljubljana: University of Ljubljana, Centre for Language Resources and Technologies: University of Ljubljana, Faculty of Arts, 2025. CLARIN.SI data &amp; tools. ISSN 2820-4042. <a href=\"http:\/\/hdl.handle.net\/11356\/2012\" target=\"_blank\" rel=\"noopener\">http:\/\/hdl.handle.net\/11356\/2012<\/a>. [COBISS.SI-ID\u00a0225461251]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">\u00a0MUNDA, Tina, ARHAR HOLDT, \u0160pela, DOBROVOLJC, Kaja, ROZMAN, Tadeja, STRITAR KU\u010cUK, Mojca, KREK, Simon, KRAP\u0160 VODOPIVEC, Irena, STABEJ, Marko, PORI, Eva, GOLI, Teja, LAVRI\u010c, Polona, LASKOWSKI, Cyprian Adam, KOCJAN\u010cI\u010c, Polonca, KLEMENC, Bojan, KRSNIK, Luka, KOSEM, Iztok.\u00a0<i>Frequency lists of syntactic structures from the \u0160olar 3.0 corpus<\/i>. Ljubljana: University of Ljubljana, Centre for Language Resources and Technologies: University of Ljubljana, Faculty of Arts, 2025. CLARIN.SI data &amp; tools. ISSN 2820-4042. <a href=\"http:\/\/hdl.handle.net\/11356\/2009\" target=\"_blank\" rel=\"noopener\">http:\/\/hdl.handle.net\/11356\/2009<\/a>. [COBISS.SI-ID\u00a0225469443]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">MUNDA, Tina, ARHAR HOLDT, \u0160pela, DOBROVOLJC, Kaja, KOSEM, Iztok, PORI, Eva, KREK, Simon.\u00a0<i>Frequency lists of syntactic structures from the U\u010dbeniki 1.0 corpus<\/i>. Ljubljana: University of Ljubljana, Centre for Language Resources and Technologies: University of Ljubljana, Faculty of Arts, 2025. CLARIN.SI data &amp; tools. ISSN 2820-4042. <a href=\"http:\/\/hdl.handle.net\/11356\/2010\" target=\"_blank\" rel=\"noopener\">http:\/\/hdl.handle.net\/11356\/2010<\/a>. [COBISS.SI-ID\u00a0225467395]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">MUNDA, Tina, ARHAR HOLDT, \u0160pela. Na poti k skladenjskim analizam \u0161olskega pisanja: skladenjski vzorci v korpusu \u0160olar 3.0. V: ARHAR HOLDT, \u0160pela (ur.), ERJAVEC, Toma\u017e (ur.). <i><span style=\"font-weight: 400;\">Jezikovne tehnologije in digitalna humanistika: zbornik konference: 19.-20. september 2024, Ljubljana, Slovenija = Language technologies and digital humanities: proceedings of the conference: 19-20 September 2024, Ljubljana, Slovenia<\/span><\/i><span style=\"font-weight: 400;\">. Ljubljana: In\u0161titut za novej\u0161o zgodovino: = Institute of Contemporary History, 2024. Str. 577-588. <\/span><a href=\"https:\/\/zenodo.org\/records\/13912515\"><span style=\"font-weight: 400;\">https:\/\/zenodo.org\/records\/13912515.<\/span><\/a><span style=\"font-weight: 400;\">\u00a0[COBISS.SI-ID <\/span><span style=\"font-weight: 400;\">212016387<\/span><span style=\"font-weight: 400;\">] <a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/06\/POSTER-JT-DH_2024-Na-poti-k-skladenjskim-analizam-solskega-pisanja.pdf\">POSTER PDF<\/a><\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">MUNDA, Tina, ARHAR HOLDT, \u0160pela. First Insights into the Syntax of Slovene Student Writing: A Statistical Analysis of \u0160olar 3.0 vs. U\u010dbeniki 1.0. In\u00a0<i>Proceedings of the Third Workshop on Quantitative Syntax (QUASY, SyntaxFest 2025)<\/i>, str. 105\u2013114, Ljubljana, Slovenia. Association for Computational Linguistics.\u00a0<a href=\"https:\/\/aclanthology.org\/2025.quasy-1.13\/\">https:\/\/aclanthology.org\/2025.quasy-1.13\/<\/a><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">KOSEM, Iztok, PORI, Eva. Prvi koraki do seznama temeljnega \u0161olskega besedi\u0161\u010da. <\/span><i><span style=\"font-weight: 400;\">Sodobna pedagogika<\/span><\/i><span style=\"font-weight: 400;\">, okt. 2025, letn. 76 = 142, \u0161t. 3, str. 9-38, DOI: <\/span><a href=\"https:\/\/dx.doi.org\/10.63384\/sptB53z794s\"><span style=\"font-weight: 400;\">10.63384\/sptB53z794s<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID 259189507]<\/span><\/p>\n            <\/div>        <\/div>    <\/div><\/section>\n<section class=\"av_toggle_section\"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\"  >    <div role=\"tablist\" class=\"single_toggle\" data-tags=\"{All} \"  >        <p data-fake-id=\"#toggle-id-6\" class=\"toggler  av-inherit-border-color \"  itemprop=\"headline\"  style='border-color: #93c83f; '  role=\"tab\" tabindex=\"0\" aria-controls=\"toggle-id-6-container\">Practice-based digitally-supported development of writing skills<span class=\"toggle_icon\" >        <span class=\"vert_icon\"><\/span><span class=\"hor_icon\"><\/span><\/span><\/p>        <div id=\"toggle-id-6-container\" class=\"toggle_wrap \"  >            <div class=\"toggle_content invers-color  av-inherit-border-color \"  itemprop=\"text\"  style='border-color: #93c83f; ' ><p><b>Questionnaire survey about teaching practices used to develop writing skills<\/b><\/p>\n<p>We conducted a large-scale study among teachers who guide the production of written texts through language corrections and other forms of feedback. We investigated practices related to the correction of written texts, including: how much time participants devote to correction on a weekly or monthly basis; what types of feedback they provide; the format of texts and corrections (written or digital); which tools and resources they use for correction; and which aspects of their current practices they consider most problematic. The questionnaire was prepared in two language versions\u2014Slovene and English\u2014which will enable comparable studies in other countries. The project collected a total of 1,024 valid responses, including 609 fully completed questionnaires. The results were statistically analyzed, and the appropriately anonymized research data were published in open access. The findings were presented to both the research and teaching communities, and a journal article is in preparation.<\/p>\n<blockquote>\n<p style=\"text-align: justified; font-size: 16px; line-height: 100%; padding-left: 10px;\">Validated survey questionnaire in <a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2023\/01\/Validirani-vprasalnik-popravljanje-solskih-besedil-slovenski.pdf\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">Slovene<\/span><\/a><span style=\"font-weight: 400;\"> and <\/span><a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2023\/01\/Validated-Questionnaire-Correcting-School-Texts-English.pdf\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">English<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n<\/blockquote>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ROT VRHOVEC, Alenka, ARHAR HOLDT, \u0160pela, PI\u017dORN, Karmen, GODEC SOR\u0160AK, Lara. <i><span style=\"font-weight: 400;\">Popravljanje pisnih besedil u\u010dencev\/dijakov: raziskovalni podatki<\/span><\/i><span style=\"font-weight: 400;\">. Ljubljana: Pedago\u0161ka fakulteta: Filozofska fakulteta, 2024.<\/span><a href=\"https:\/\/repozitorij.uni-lj.si\/IzpisGradiva.php?lang=slv&amp;id=153481\">\u00a0<span style=\"font-weight: 400;\">https:\/\/repozitorij.uni-lj.si\/IzpisGradiva.php?lang=slv&amp;id=153481<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID<\/span> <span style=\"font-weight: 400;\">206148867<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ROT VRHOVEC, Alenka. <i><span style=\"font-weight: 400;\">Kako u\u010ditelji popravljajo besedila u\u010dencev\/dijakov?: <\/span><\/i><span style=\"font-weight: 400;\">predavanje na [konferenci] Popravljanje jezika in besedil \u2013 u\u010diteljska povratna informacija v \u0161olski praksi, Fakulteta za upravo, Univerza v Ljubljani, 5. 4. 2023. <a href=\"https:\/\/ebooks.uni-lj.si\/ZalozbaUL\/catalog\/view\/500\/832\/9274\">https:\/\/ebooks.uni-lj.si\/ZalozbaUL\/catalog\/view\/500\/832\/9274<\/a>. [COBISS.SI-ID <\/span><span style=\"font-weight: 400;\">149745667<\/span><span style=\"font-weight: 400;\">] <\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">GODEC SOR\u0160AK, Lara, ROT VRHOVEC, Alenka. Popravljanje pisnih besedil u\u010dencev in dijakov pri razli\u010dnih predmetih: vpogled v rezultate ankete u\u010diteljev. <i>Sodobna pedagogika, <\/i>okt. 2025, letn. 76, \u0161t. 3, str. 59\u201385, DOI: <a href=\"https:\/\/dx.doi.org\/10.63384\/sptB53z792s\">10.63384\/sptB53z792s<\/a>. [COBISS.SI-ID 257058819]<\/span><\/p>\n<p><b>Recording teacher corrections of student writing and semi-structured interviews<\/b><\/p>\n<p>This study focused on the use of existing digital tools for providing feedback to students. We recruited 18 Slovene language teachers from different types of schools (6 from primary schools, 6 from vocational and technical schools, and 6 from gymnasiums). As part of the study, they corrected two pre-selected authentic student texts, during which we recorded their screen, minimal work environment (face), and think-aloud commentary. This independent work with texts was followed by interviews, where we explored the characteristics of their work, the capabilities and limitations of correction tools, and the participants\u2019 wishes regarding additional functionalities. The results were transcribed, and the appropriately anonymized data were published in open access. An article for a scientific journal is also in preparation, presenting the results together with evaluations of automated error correction conducted as part of the work package <em data-start=\"177\" data-end=\"273\">Development and evaluation of automatic identification and categorization of language problems<\/em> (see below).<\/p>\n<blockquote>\n<p style=\"text-align: justified; font-size: 16px; line-height: 100%; padding-left: 10px;\">Teacher interview questionnaire in <a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/06\/Vprasalnik-za-ucitelje-1-Popravljanje-solske-pisne-produkcije.pdf\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">Slovene<\/span><\/a><span style=\"font-weight: 400;\"> and <\/span><a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/06\/Teacher-Questionnaire-1-Teacher-Corrections-of-Student-Writing.pdf\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">English<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n<\/blockquote>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">ARHAR HOLDT, \u0160pela, MUNDA, Tina.\u00a0<i>U\u010diteljsko popravljanje \u0161olskih besedil v digitalnem okolju: intervjuji z u\u010ditelji slovenskih O\u0160 in S\u0160<\/i>. Ljubljana: Zaklju\u010dena znanstvena zbirka raziskovalnih podatkov. 2025.\u00a0<a href=\"https:\/\/repozitorij.uni-lj.si\/IzpisGradiva.php?lang=slv&amp;id=169549\">https:\/\/repozitorij.uni-lj.si\/IzpisGradiva.php?lang=slv&amp;id=169549<\/a>. <\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">ARHAR HOLDT, \u0160pela, MUNDA, Tina. Jezikovno popravljanje v digitalnem okolju: kvalitativna \u0161tudija z u\u010diteljicami in u\u010ditelji sloven\u0161\u010dine. <i>Sodobna pedagogika,<\/i> okt. 2025, letn. 76 = 142, \u0161t. 3, str. 86-106, DOI:\u00a0<a href=\"https:\/\/dx.doi.org\/10.63384\/sptB53s791s\" target=\"_blank\" rel=\"noopener\">10.63384\/sptB53s791s<\/a>. [COBISS.SI-ID\u00a0259204611]<\/span><\/p>\n<p><b>Designing a strategy for digitally-supported development of writing skills<\/b><\/p>\n<p data-start=\"456\" data-end=\"899\">An increasingly important part of contemporary language use takes place in digital environments. However, the integration of digital media into teaching must be age-appropriate, inclusive for all groups of learners, and effective in order to avoid unnecessary screen time. During the course of the project, generative artificial intelligence tools for text co-creation also became easily accessible, raising numerous new questions.\u00a0The strategy we developed\u2014in dialogue with the guiding principles for the renewal of Slovene language curricula in primary and secondary schools\u2014emphasizes the need for the school system to be prepared for the impact of generative AI. It also underscores the importance of open educational materials and thoughtfully designed, problem-oriented digital solutions.<\/p>\n<p data-start=\"1277\" data-end=\"1640\">Teachers must be empowered not only to use but also to co-create digital linguistic resources, tools, and technologies. Shifts in communication practices\u2014such as increasing informality and anonymity\u2014call for new understandings and approaches to language education, including the development of stylistic awareness and the ability to critically evaluate texts.\u00a0We also highlight the need for a thorough reform of Slovene teacher education, so that in the future, teachers can implement new approaches to teaching and assessment more effectively and inclusively.<\/p>\n<p data-start=\"1277\" data-end=\"1640\">The strategy was published in a thematic issue of the scientific journal <em data-start=\"1921\" data-end=\"1940\">Jezik in slovstvo<\/em>, dedicated to curriculum reform. Selected topics were also presented in a panel discussion marking the publication of the thematic issue, and at the professional conference \u201cCorrecting Language and Texts \u2013 Teacher Feedback in School Practice.\u201d<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ARHAR HOLDT, \u0160pela, FERBE\u017dAR, Ina, KALIN GOLOB, Monika, KREK, Simon, PAVLE, Andreja, ROZMAN, Tadeja, STABEJ, Marko. Nova sloven\u0161\u010dina.\u00a0<i>Jezik in slovstvo<\/i>. [Tiskana izd.]. 2024, letn. 69, \u0161t. 3, str. 117-138. DOI:\u00a0<a href=\"https:\/\/dx.doi.org\/10.4312\/jis.69.3.117-138\" target=\"_blank\" rel=\"noopener\">10.4312\/jis.69.3.117-138<\/a>. [COBISS.SI-ID\u00a0210323971]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">\u017dBOGAR, Alenka, AHA\u010cI\u010c, Kozma, HARAMIJA, Dragica, MIKOLI\u010c, Vesna, STABEJ, Marko, TIVADAR, Hotimir. <i>Sloven\u0161\u010dina v \u0161oli: izzivi in prilo\u017enosti sloven\u0161\u010dine kot materin\u0161\u010dine na primarni in sekundarni stopnji vzgoje in izobra\u017eevanja:<\/i> okrogla miza Oddelka za slovenistiko Filozofske fakultete ob izidu tematske \u0161tevilke revije Jezik in slovstvo (69\/3, 2024), posve\u010dene prenovi u\u010dnih na\u010drtov za sloven\u0161\u010dino v osnovnih in srednjih \u0161olah v okviru Tedna Univerze v Ljubljani, Filozofska fakulteta, Ljubljana, 3. 12. 2024. [COBISS.SI-ID\u00a0219169539]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">STABEJ, Marko. Kdo ali kaj naj koga ali kaj, kako in zakaj?: (prosti spis o popravljanju in povratni informaciji). V: PORI, Eva (ur.), ARHAR HOLDT, \u0160pela (ur.). <i>Popravljanje jezika in besedil &#8211; u\u010diteljska povratna informacija v \u0161olski praksi: zbornik konference<\/i>. Ljubljana: Zalo\u017eba Univerze, 2023. Str. 22-24. <a href=\"https:\/\/ebooks.uni-lj.si\/ZalozbaUL\/catalog\/view\/500\/832\/9312\" target=\"_blank\" rel=\"noopener\">https:\/\/ebooks.uni-lj.si\/ZalozbaUL\/catalog\/view\/500\/832\/9312<\/a>. [COBISS.SI-ID\u00a0193563139]<\/p>\n            <\/div>        <\/div>    <\/div><\/section>\n<section class=\"av_toggle_section\"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\"  >    <div role=\"tablist\" class=\"single_toggle\" data-tags=\"{All} \"  >        <p data-fake-id=\"#toggle-id-7\" class=\"toggler  av-inherit-border-color \"  itemprop=\"headline\"  style='border-color: #93c83f; '  role=\"tab\" tabindex=\"0\" aria-controls=\"toggle-id-7-container\">Development and evaluation of automatic identification and categorisation of language problems<span class=\"toggle_icon\" >        <span class=\"vert_icon\"><\/span><span class=\"hor_icon\"><\/span><\/span><\/p>        <div id=\"toggle-id-7-container\" class=\"toggle_wrap \"  >            <div class=\"toggle_content invers-color  av-inherit-border-color \"  itemprop=\"text\"  style='border-color: #93c83f; ' ><p><b>Designing a model for automatic error annotation<\/b><\/p>\n<p class=\"\" data-start=\"0\" data-end=\"696\">We explored the potential of new methodologies for the automatic correction of Slovene texts. The machine learning models are based on large pre-trained language models, which we adapted to the task of correcting student writing using selected authentic and synthetically prepared datasets. We first examined the applicability of the language models multilingual <em>BERT, CroSloEngual BERT,<\/em> and <em>SloBERTa<\/em>. For the task of machine question answering, we tested several pre-trained encoder-decoder models of the <em>T5<\/em> type. T5-type models were also tested for the automatic generation of the correct form of misspelt words, and we investigated the effectiveness of procedures for machine-generated explanations. Using an optimized neural methodology, we addressed spelling, orthographic, morphological, and syntactic errors, achieving particularly strong results for the first two categories. We then developed <em>SloNSpell<\/em>, a neural spellchecker for Slovene that currently delivers the best results for the language. Its key advantage is the ability to detect not only traditional spelling errors but also instances where a misspelling results in a legitimate word form.<\/p>\n<p data-start=\"1455\" data-end=\"1550\">The various stages of model development were documented in conference and journal publications.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">UL\u010cAR, Matej, ROBNIK \u0160IKONJA, Marko. Sequence-to-sequence pretraining for a less-resourced Slovenian language.\u00a0<i>Frontiers in artificial intelligence<\/i>. Mar. 2023, vol. 6, str. 1-13, DOI:\u00a0<a class=\"visited\" href=\"https:\/\/dx.doi.org\/10.3389\/frai.2023.932519\" target=\"_blank\" rel=\"noopener\">10.3389\/frai.2023.932519<\/a>. [COBISS.SI-ID\u00a0147683587]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">KMECL, Tim, ROBNIK \u0160IKONJA, Marko. Logi\u010dno sklepanje v naravnem jeziku za sloven\u0161\u010dino.\u00a0<i>Sloven\u0161\u010dina 2.0: empiri\u010dne, aplikativne in interdisciplinarne raziskave<\/i>. 2024, letn. 12, \u0161t. 1, str. 1-53,\u00a0DOI:\u00a0<a class=\"visited\" href=\"https:\/\/dx.doi.org\/10.4312\/slo2.0.2024.1.1-53\" target=\"_blank\" rel=\"noopener\">10.4312\/slo2.0.2024.1.1-53<\/a>. [COBISS.SI-ID\u00a0206551299]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">LOGAR, Katja, ROBNIK \u0160IKONJA, Marko. Unified question answering in Slovene. V: LU\u0160TREK, Mitja (ur.), GAMS, Matja\u017e (ur.), PILTAVER, Rok (ur.).\u00a0<i>Slovenska konferenca o umetni inteligenci = Slovenian Conference on Artificial Intelligence: Informacijska dru\u017eba &#8211; IS 2022 = Information Society &#8211; IS 2022: zbornik 25. mednarodne multikonference = proceedings of the 25th international multiconference: zvezek A = volume A: 11. oktober 2022, 11 October 2022, Ljubljana, Slovenija<\/i>. Ljubljana: Institut &#8220;Jo\u017eef Stefan&#8221;, 2022. Str. 23-26, <a href=\"https:\/\/doi.org\/10.48550\/arXiv.2211.09159\">https:\/\/doi.org\/10.48550\/arXiv.2211.09159<\/a>. [COBISS.SI-ID\u00a0129718275]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">KLEMEN, Matej, BO\u017dI\u010c, Martin, ARHAR HOLDT, \u0160pela, ROBNIK \u0160IKONJA, Marko. Neural spell-checker: beyond words with synthetic data generation. V: N\u00d6TH, Elmar (ur.), HOR\u00c1K, Ale\u0161 (ur.), SOJKA, Petr (ur.). <i><span style=\"font-weight: 400;\">Text, speech, and dialogue. Part 1: 27th International Conference, TSD 2024, Brno, Czech Republic, September 9\u201313, 2024: proceedings<\/span><\/i><span style=\"font-weight: 400;\">. Cham: Springer, cop. 2024. Str. 85-96. Lecture notes in computer science, SL7, Lecture notes in artificial intelligence,<\/span><span style=\"font-weight: 400;\">\u00a0DOI: <\/span><a href=\"https:\/\/dx.doi.org\/10.1007\/978-3-031-70563-2_7\"><span style=\"font-weight: 400;\">10.1007\/978-3-031-70563-2_7, <\/span><\/a><span style=\"font-weight: 400;\">dostopno na <a href=\"https:\/\/doi.org\/10.48550\/arXiv.2410.23514\">https:\/\/doi.org\/10.48550\/arXiv.2410.23514<\/a>. [COBISS.SI-ID 213519107] <\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">PETRI\u010c, Timotej, ARHAR HOLDT, \u0160pela, ROBNIK \u0160IKONJA, Marko. Pomembnost realisti\u010dne evalvacije: primer popravkov sklona in \u0161tevila v sloven\u0161\u010dini z velikim jezikovnim modelom. Sloven\u0161\u010dina 2.0: empiri\u010dne, aplikativne in interdisciplinarne raziskave. 2024, letn. 12, \u0161t. 1, str. 106-130, DOI: <a href=\"https:\/\/doi.org\/10.4312\/slo2.0.2024.1.106-130\">https:\/\/doi.org\/10.4312\/slo2.0.2024.1.106-130<\/a>. [COBISS.SI-ID 227633411] <\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">KLEMEN, Matej, BO\u017dI\u010c, Martin, ARHAR HOLDT, \u0160pela, ROBNIK \u0160IKONJA, Marko. Grammatical error correction of Slovenian school essays using large language models. <i>Sodobna pedagogika,<\/i> okt. 2025, letn. 76, \u0161t. 3, str. 162\u2013176, DOI: <a href=\"https:\/\/dx.doi.org\/10.63384\/sptB53z793a\">10.63384\/sptB53z793a<\/a>. [COBISS.SI-ID 259208195]\u00a0 <\/span><\/p>\n<p><span style=\"font-weight: 400;\">We published the modules for different linguistic levels under an open license on the <em>HuggingFace<\/em> platform, enabling their further use and development.<\/span><\/p>\n<blockquote>\n<p style=\"text-align: justified; font-size: 16px; line-height: 100%;\"><a href=\"https:\/\/huggingface.co\/cjvt\/SloBERTa-slo-word-spelling-annotator\">Modul that identifies orthographic and spelling errors<\/a>.<\/p>\n<p style=\"text-align: justified; font-size: 16px; line-height: 100%;\"><a href=\"https:\/\/huggingface.co\/cjvt\/t5-slo-word-spelling-corrector\">Modul that corrects orthographic and spelling errors<\/a>.<\/p>\n<p style=\"text-align: justified; font-size: 16px; line-height: 100%;\"><a href=\"https:\/\/huggingface.co\/cjvt\/t5-slo-word-form-corrector\">Modul that corrects morphosyntactic erros.<\/a><\/p>\n<p style=\"text-align: justified; font-size: 16px; line-height: 100%;\"><a href=\"https:\/\/huggingface.co\/cjvt\/t5-slo-word-order-corrector\">Modul that corrects word order errors<\/a>.<\/p>\n<\/blockquote>\n<p><b>Testing the applicability of the methodology to other languages<\/b><\/p>\n<p data-start=\"276\" data-end=\"741\">Following the development of large language models and technologies such as ChatGPT, it became evident that there is a critical lack of well-designed and high-quality evaluation datasets that would enable reliable assessment of neural approaches for various tasks, including grammatical error correction (GEC). This is particularly true for Slovene and other low-resource languages, where the amount of available text and structured linguistic resources is limited.\u00a0At the same time, it has been shown that cross-lingual approaches work well for less-resourced languages, as large language models transfer (linguistic) knowledge across all languages they have been trained on. For this reason, we joined the MultiGEC-2025 shared task, under which uniformly designed evaluation datasets for grammatical error correction were developed for 12 languages.\u00a0Developers participating in the task competed in automatic grammatical correction across all included languages\u2014including Slovene\u2014and reported on the performance of various approaches. The dataset is publicly available for further use and will be integrated into future activities. Our participation in the shared task was presented in a technical report and a scientific article.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">MASCIOLINI, Arianna, ARHAR HOLDT, \u0160pela, \u017dAGAR, Ale\u0161, et al. An overview of grammatical error correction for the twelve MultiGEC-2025 languages. Go\u0308teborg: Faculty of Humanities, Department of Swedish, Multilingualism, Language Technology, 2025. GU-ISS Forskningsrapporter fr\u00e5n Institutionen f\u00f6r svenska, flerspr\u00e5kighet och spr\u00e5kteknologi (2011-), ISSN 1401-5919, <a href=\"https:\/\/gupea.ub.gu.se\/bitstream\/handle\/2077\/84800\/2025_MultiGEC_GEC_overview.pdf?sequence=1&amp;isAllowed=y\">https:\/\/gupea.ub.gu.se\/bitstream\/handle\/2077\/84800\/2025_MultiGEC_GEC_overview.pdf?sequence=1&amp;isAllowed=y<\/a>. [COBISS.SI-ID 232510723]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">MASCIOLINI, Arianna, CAINES, Andrew, ARHAR HOLDT, \u0160pela, \u017dAGAR, Ale\u0161, et al. Towards better language representation in natural language processing: a multilingual dataset for text-level grammatical error correction. <em>International journal of learner corpus research,<\/em> 2025, vol. 11, iss. 2, pp. 309-335, ISSN 2215-1478, <a href=\"https:\/\/doi.org\/10.1075\/ijlcr.24033.mas\">DOI: 10.1075\/ijlcr.24033.mas<\/a>. [COBISS.SI-ID 234594051]<\/p>\n<p>In cooperation with other research projects, several articles were produced focusing on the development of language resources, the evaluation of large language models for less-resourced languages, the creation of specialized tools for processing Slovene texts, and the testing of cross-lingual approaches for various NLP tasks. These studies are indirectly connected to the development of grammar correction methods for Slovene, as they establish methodological and data-related frameworks essential for the effective use of large language models in processing and generating Slovene texts.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">YADAV, Anjali, GARG, Tanya, KLEMEN, Matej, UL\u010cAR, Matej, AGARWAL, Basant, ROBNIK \u0160IKONJA, Marko. From translation to generative LLMs: classification of code-mixed affective tasks.\u00a0<em>IEEE transactions on affective computing<\/em>. 2025, vol. 12, str. pp. 2090-2101. DOI:\u00a0<a href=\"https:\/\/www.computer.org\/csdl\/journal\/ta\/2025\/03\/10938193\/25mYwuU9XDG\">10.1109\/TAFFC.2025.3553399<\/a>. [COBISS.SI-ID 232748291]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">UL\u010cAR, Matej, \u017dAGAR, Ale\u0161, ARMENDARIZ, Carlos S., REPAR, Andra\u017e, POLLAK, Senja, PURVER, Matthew, ROBNIK \u0160IKONJA, Marko. Mono- and cross-lingual evaluation of representation language models on less-resourced languages.<em>\u00a0Computer speech &amp; language<\/em>. Jan. 2026, vol. 95, [article no.] 101852, 1-29, DiRROS \u2013 Digitalni repozitorij raziskovalnih organizacij Slovenije,\u00a0<a href=\"https:\/\/doi.org\/10.1016\/j.csl.2025.101852\">https:\/\/doi.org\/10.1016\/j.csl.2025.101852<\/a>. [COBISS.SI-ID 241622275]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">\u017dAGAR, Ale\u0161, KLEMEN, Matej, KOSEM, Iztok, ROBNIK \u0160IKONJA, Marko. SENTA: sentence simplification system for Slovene. V: CALZOLARI, Nicoletta (ur.).\u00a0<em>The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)<\/em>: main conference proceedings: 20-25 May, 2024, Torino, Italia. [Paris]: ELRA Language Resources Association (ELRA); [Stroudsburg]: International Committee on Computational Linguistics, cop. 2024. Str. 14687-14692,\u00a0<a href=\"https:\/\/aclanthology.org\/2024.lrec-main.1279.pdf\">https:\/\/aclanthology.org\/2024.lrec-main.1279.pdf<\/a>. [COBISS.SI-ID 197916675]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">MIOK, Kristian, HIDALGO TENORIO, Encarnaci\u00f3n, OSENOVA, Petja, BEN\u00cdTEZ-CASTRO, Miguel-\u00c1ngel, ROBNIK \u0160IKONJA, Marko. Multi-aspect multilingual and cross-lingual parliamentary speech analysis.\u00a0<em>Intelligent data analysis.<\/em>\u00a0[Print ed.]. Feb. 2024, vol. 28, no. 1, str. 239-260,\u00a0<a href=\"https:\/\/doi.org\/10.3233\/IDA-227347\">https:\/\/doi.org\/10.3233\/IDA-227347<\/a>. [COBISS.SI-ID 178091523]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">KLEMEN, Matej, \u017dAGAR, Ale\u0161, \u010cIBEJ, Jaka, ROBNIK \u0160IKONJA, Marko. SI-NLI: a Slovene natural language inference dataset and its evaluation. V: CALZOLARI, Nicoletta (ur.).<em>\u00a0The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)<\/em>: main conference proceedings: 20-25 May, 2024, Torino, Italia. [Paris]: ELRA Language Resources Association (ELRA); [Stroudsburg]: International Committee on Computational Linguistics, cop. 2024. Str. 14859-14870,\u00a0<a href=\"https:\/\/aclanthology.org\/2024.lrec-main.1294.pdf\">https:\/\/aclanthology.org\/2024.lrec-main.1294.pdf<\/a>. [COBISS.SI-ID 197916931]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">\u0110OKOVI\u0106, Lazar, ROBNIK \u0160IKONJA, Marko. Sarcasm detection in a less-resourced language. V: LU\u0160TREK, Mitja (ur.), GAMS, Matja\u017e (ur.), PILTAVER, Rok (ur.).\u00a0<em>Slovenian Conference on Artificial Intelligence. Vol. A : proceedings of the 27th International Multiconference Information Society \u2013 IS 2024 : 10\u201311 October 2024, Ljubljana, Slovenia<\/em>. Ljubljana: Institut \u201cJo\u017eef Stefan\u201d, 2024. Str. 19-22. Informacijska dru\u017eba.\u00a0<a href=\"https:\/\/is.ijs.si\/wp-content\/uploads\/2024\/11\/IS2024_Volume-A.pdf\">https:\/\/is.ijs.si\/wp-content\/uploads\/2024\/11\/IS2024_Volume-A.pdf<\/a>. [COBISS.SI-ID 216268291]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">\u017dAGAR, Ale\u0161, ROBNIK \u0160IKONJA, Marko. One model to rule them all: ranking Slovene summarizers. V: EK\u0160TEIN, Kamil (ur.), P\u00c1RTL, Franti\u0161ek (ur.), KONOP\u00cdK, Miloslav (ur.).\u00a0<em>Text, speech, and dialogue: 26th International Conference, TSD 2023, Pilsen, Czech Republic, September 4\u20136, 2023 : proceedings.<\/em>\u00a0Cham: Springer, cop. 2023. Str. 15-24. Lecture notes in computer science (Internet), Lecture notes in artificial intelligence, 14102.\u00a0<a href=\"https:\/\/link.springer.com\/chapter\/10.1007\/978-3-031-40498-6_2\">https:\/\/link.springer.com\/chapter\/10.1007\/978-3-031-40498-6_2<\/a>. [COBISS.SI-ID 165084419]<\/p>\n<p><b>Linguistic and teacher evaluation of automatic annotation of texts<\/b><\/p>\n<p data-start=\"288\" data-end=\"681\">We developed a reference dataset for both quantitative and qualitative evaluation of automatic grammatical error correction. The dataset is based on the <em>\u0160olar 3.0<\/em> corpus, but instead of using teacher corrections\u2014which are often adapted to the learner&#8217;s developmental level and can vary in form\u2014it contains consistently and systematically annotated corrections.\u00a0The new dataset, named <em>\u0160olar-Eval 1.0,<\/em> includes 109 texts produced in Slovene primary and secondary schools. The texts were linguistically analyzed, and 9,808 language issues were manually annotated across multiple linguistic levels. The dataset has been published under an open license in the CLARIN.SI repository and described in a peer-reviewed article. <em>\u0160olar-Eval 1.0<\/em> was used for both machine and linguistic evaluation of the models developed within the project, with results reported on the <em>HuggingFace<\/em> platform and in the articles listed above.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ARHAR HOLDT, \u0160pela, GANTAR, Polona, BON, Mija, GAPSA, Magdalena, LAVRI\u010c, Polona, KLEMEN, Matej.\u00a0<i><span style=\"font-weight: 400;\">Dataset for evaluation of Slovene spell- and grammar-checking tools \u0160olar-Eval 1.0<\/span><\/i><span style=\"font-weight: 400;\">. Slovenian language resource repository CLARIN.SI, ISSN 2820-4042,\u00a0<a href=\"http:\/\/hdl.handle.net\/11356\/1902\">http:\/\/hdl.handle.net\/11356\/1902<\/a><\/span><span style=\"font-weight: 400;\">. [COBISS.SI-ID <\/span><span style=\"font-weight: 400;\">185626115<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">GANTAR, Polona, BON, Mija, GAPSA, Magdalena, ARHAR HOLDT, \u0160pela. \u0160olar-Eval: evalvacijska mno\u017eica za strojno popravljanje jezikovnih napak v slovenskih besedilih. <i><span style=\"font-weight: 400;\">Jezik in slovstvo<\/span><\/i><span style=\"font-weight: 400;\">. [Tiskana izd.]. 2023, letn. 68, \u0161t. 4, pp. 89-108,<\/span><span style=\"font-weight: 400;\">\u00a0DOI:\u00a0<\/span><a href=\"https:\/\/dx.doi.org\/10.4312\/jis.68.4.89-108\"><span style=\"font-weight: 400;\">10.4312\/jis.68.4.89-108<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">187559683<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p data-start=\"0\" data-end=\"343\">We conducted the teacher evaluation with the same team that participated in the work package <em data-start=\"93\" data-end=\"159\">Practice-based digitally-supported development of writing skills<\/em>: 18 teachers of Slovene from different types of schools. In the study, participants examined two authentically produced school texts that had been automatically corrected by the system and presented in a simple interface. During the evaluation, we recorded their screen activity, a minimal view of their working environment (face), and their think-aloud commentary. The independent evaluation of the automatic corrections was followed by interviews focusing on the capabilities and limitations of the developed correction models, as well as participants\u2019 wishes regarding additional functionalities. The results of the study were transcribed and, after appropriate anonymization, published in the RUL repository. The findings, including the evaluation report and specifications for further tool development, were published in a scientific journal. In addition, the tool for automatic comma placement in Slovene was evaluated separately with a group of students, and the results were presented at a conference.<\/p>\n<blockquote>\n<p style=\"text-align: justified; font-size: 16px; line-height: 100%; padding-left: 10px;\">Teacher evaluation questionnaire in <a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/06\/Vprasalnik-za-ucitelje-2-Uciteljska-evalvacija-strojnega-oznacevanja-besedil.pdf\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">Slovene<\/span><\/a><span style=\"font-weight: 400;\"> and <\/span><a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/06\/Teacher-Questionnaire-2-Teacher-Evaluation-of-Automatic-Text-Annotation.pdf\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">English<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n<\/blockquote>\n<p>We evaluated the comma correction tool <em>CJVT Vejice <\/em>among students and presented the results at a conference.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">ARHAR HOLDT, \u0160pela, MUNDA, Tina. <\/span><i><span style=\"font-weight: 400;\">U\u010diteljsko popravljanje \u0161olskih besedil v digitalnem okolju: intervjuji z u\u010ditelji slovenskih O\u0160 in S\u0160<\/span><\/i><span style=\"font-weight: 400;\">. Ljubljana: Zaklju\u010dena znanstvena zbirka raziskovalnih podatkov. 2025.<\/span><a href=\"https:\/\/repozitorij.uni-lj.si\/IzpisGradiva.php?lang=slv&amp;id=169549\"><span style=\"font-weight: 400;\"> https:\/\/repozitorij.uni-lj.si\/IzpisGradiva.php?lang=slv&amp;id=169549<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID <\/span><a href=\"https:\/\/plus-legacy.cobiss.net\/cobiss\/si\/sl\/bib\/248986115\"><span style=\"font-weight: 400;\">248986115<\/span><\/a><span style=\"font-weight: 400;\">] <\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">ARHAR HOLDT, \u0160pela, MUNDA, Tina. Jezikovno popravljanje v digitalnem okolju: kvalitativna \u0161tudija z u\u010diteljicami in u\u010ditelji sloven\u0161\u010dine. <i>Sodobna pedagogika,<\/i> okt. 2025, letn. 76 = 142, \u0161t. 3, str. 86-106, DOI: <a href=\"https:\/\/dx.doi.org\/10.63384\/sptB53s791s\">10.63384\/sptB53s791s<\/a>. [COBISS.SI-ID 259204611]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">GODEC SOR\u0160AK, Lara. Raba vejice v pisnih besedilih \u0161tudentov in uporabnost spletnega orodja Vejice 1.0. V: \u0160TUMBERGER, Sa\u0161ka (ur.). <i><span style=\"font-weight: 400;\">Predpis in norma v jeziku<\/span><\/i><span style=\"font-weight: 400;\">. Ljubljana: Zalo\u017eba Univerze, 2024. Str. 103-111. Zbirka Obdobja, 43.<\/span><span style=\"font-weight: 400;\">\u00a0DOI: <\/span><a href=\"https:\/\/dx.doi.org\/10.4312\/Obdobja.43.103-111\"><span style=\"font-weight: 400;\">10.4312\/Obdobja.43.103-111<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID <\/span><span style=\"font-weight: 400;\">215559939<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n            <\/div>        <\/div>    <\/div><\/section>\n<section class=\"av_toggle_section\"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\"  >    <div role=\"tablist\" class=\"single_toggle\" data-tags=\"{All} \"  >        <p data-fake-id=\"#toggle-id-8\" class=\"toggler  av-inherit-border-color \"  itemprop=\"headline\"  style='border-color: #93c83f; '  role=\"tab\" tabindex=\"0\" aria-controls=\"toggle-id-8-container\">Providing feedback in digital environment<span class=\"toggle_icon\" >        <span class=\"vert_icon\"><\/span><span class=\"hor_icon\"><\/span><\/span><\/p>        <div id=\"toggle-id-8-container\" class=\"toggle_wrap \"  >            <div class=\"toggle_content invers-color  av-inherit-border-color \"  itemprop=\"text\"  style='border-color: #93c83f; ' ><p><b>Developing a model combining formative assessment and crowdsourcing<\/b><\/p>\n<p data-start=\"327\" data-end=\"817\">In this project activity, we reviewed literature and tools related to digital monitoring of writing competence, with a particular focus on the effects of automated written corrective feedback provided by digital writing support tools. A systematic review of 22 studies showed that these tools are especially beneficial due to their fast, accurate, and varied feedback formats, with hybrid approaches\u2014combining automated tools with teacher support\u2014proving most effective.\u00a0We also addressed the challenges faced by students with specific learning difficulties in higher education, highlighting the importance of digitally supported learning and UDL\u00a0to enhance accessibility, equity, and flexibility in pedagogical approaches.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">PI\u017dORN, Karmen, LEMUT BAJEC, Melita. Systematic review of digital writing assistants in EFL writing instruction. <i>Sodobna pedagogika<\/i>, okt. 2025, letn. 76, \u0161t. 3, str. 141\u2013161, DOI: <a href=\"https:\/\/dx.doi.org\/10.63384\/sptB5_z796a\">10.63384\/sptB5_z796a<\/a>. [COBISS.SI-ID 256918531]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">KO\u0160AK BABUDER, Milena, POREDO\u0160, Mojca, PI\u017dORN, Karmen. Digitalno podprto u\u010denje \u0161tudentov s specifi\u010dnimi u\u010dnimi te\u017eavami v visoko\u0161olskem izobra\u017eevanju. <i>Sodobna pedagogika<\/i>, okt. 2025, letn. 76, \u0161t. 3, str. 107\u2013125, 177-197, DOI: <a href=\"https:\/\/dx.doi.org\/10.63384\/sptB53s795as\">10.63384\/sptB53s795as<\/a>. [COBISS.SI-ID 256927747]<\/span><\/p>\n<p data-start=\"1616\" data-end=\"1990\">We explored new solutions in digital collaborative practices between teachers and learners. As a model for linking formative assessment with crowdsourcing, we implemented a gamified crowdsourcing approach for reviewing and validating pedagogically appropriate corpus examples, which serve as the foundation for learning materials, exercises, and assessments.\u00a0The aim is to save time through collaborative content creation and to develop a large, openly accessible, and carefully reviewed collection of language examples. The game, named <em>CrowLL<\/em> (<em>Crowdsourcing for Language Learning<\/em>), supports Slovene and several other languages and was presented at conferences and in a peer-reviewed journal article.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ZINGANO KUHN, Tanara, ARHAR HOLDT, \u0160pela, KOSEM, Iztok, TIBERIUS, Carole, KOPPEL, Kristina, ZVIEL-GIRSHIN, Rina. Data preparation in crowdsourcing for pedagogical purposes : the case of the CrowLL game.\u00a0<i><span style=\"font-weight: 400;\">Sloven\u0161\u010dina 2.0: empiri\u010dne, aplikativne in interdisciplinarne raziskave<\/span><\/i><span style=\"font-weight: 400;\">. 2022, letn. 10, \u0161t. 2, str. 62-100, <\/span><span style=\"font-weight: 400;\">DOI:\u00a0<\/span><a href=\"https:\/\/dx.doi.org\/10.4312\/slo2.0.2022.2.62-100\"><span style=\"font-weight: 400;\">10.4312\/slo2.0.2022.2.62-100<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">146362883<\/span><span style=\"font-weight: 400;\">] <\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ZINGANO KUHN, Tanara, TIBERIUS, Carole, ARHAR HOLDT, \u0160pela, KOPPEL, Kristina, KOSEM, Iztok, ZVIEL-GIRSHIN, Rina, LU\u00cdS, Ana R. Developing manually annotated corpora for teaching and learning purposes of Brazilian Portuguese, Dutch, Estonian, and Slovene (the CrowLL Project). V: LIND\u00c9N, Krister (ur.), NIEMI, Jyrki (ur.), KONTINO, Thalassia (ur.).\u00a0<i><span style=\"font-weight: 400;\">CLARIN annual conference proceedings 2023: 16 \u2013 18 October 2023 Leuven, Belgium<\/span><\/i><span style=\"font-weight: 400;\">. [S. l.: s. n.], 2023. Str. 173-177. CLARIN Annual Conference Proceedings. <\/span><a href=\"https:\/\/office.clarin.eu\/v\/CE-2023-2328_CLARIN2023_ConferenceProceedings.pdf\"><span style=\"font-weight: 400;\">https:\/\/office.clarin.eu\/v\/CE-2023-2328_CLARIN2023_ConferenceProceedings.pdf<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">200002819<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">ZINGANO KUHN, Tanara, KOPPEL, Kristina, ARHAR HOLDT, \u0160pela, TIBERIUS, Carole, ZVIEL-GIRSHIN, Rina, KOSEM, Iztok. Annotating corpora for language learning and lexicography with the Crowdsourcing for Language Learning (CrowLL) game. V: MEDVE\u010e, Marek (ur.), et al. <i><span style=\"font-weight: 400;\">eLex 2023 : electronic lexicography in the 21st century (eLex 2023): invisible lexicography: <\/span><\/i><i><span style=\"font-weight: 400;\">book of abstracts<\/span><\/i><i><span style=\"font-weight: 400;\"> : Brno, 27\u201329 June 2023<\/span><\/i><span style=\"font-weight: 400;\">. Brno: Lexical Computing CZ, 2023. Str. 13-14.\u00a0<\/span><a href=\"https:\/\/elex.link\/elex2023\/wp-content\/uploads\/elex2023_book_of_abstracts.pdf\"><span style=\"font-weight: 400;\">https:\/\/elex.link\/elex2023\/wp-content\/uploads\/elex2023_book_of_abstracts.pdf<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">184965379<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p data-start=\"3757\" data-end=\"4219\">We also investigated the attitudes of the teacher community toward crowdsourcing, using the <em>Thesaurus of Modern Slovene<\/em>\u2014the first Slovene thesaurus to include user-contributed synonym candidates\u2014as a case study. Although the thesaurus has strong potential for use in education, no prior studies had examined how dictionary users\u2014especially Slovene teachers\u2014evaluate user participation compared to the lexicographers who designed the resource.\u00a0Our results show that teachers consider user-contributed synonyms to be both relevant and useful. At the same time, the findings emphasize the importance of involving users not only as data contributors but also as evaluators in the development of language resources.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">GAPSA, Magdalena, ARHAR HOLDT, \u0160pela. How lexicographers evaluate user contributions in the Thesaurus of Modern Slovene in comparison to dictionary users. V: MEDVE\u010e, Marek (ur.), et al.\u00a0<i>eLex 2023: electronic lexicography in the 21st century (eLex 2023): proceedings of the eLex 2023 conference : [Brno], 27\u201329 June 2023<\/i>. Brno: Lexical Computing CZ, 2023. Str. 178-200. Electronic lexicography in the 21st century. Proceedings of eLex &#8230; conference. <a href=\"https:\/\/elex.link\/elex2023\/wp-content\/uploads\/47.pdf\" target=\"_blank\" rel=\"noopener\">https:\/\/elex.link\/elex2023\/wp-content\/uploads\/47.pdf<\/a>. [COBISS.SI-ID\u00a0162928387]<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">GAPSA, Magdalena (2024). U\u010diteljske ocene uporabni\u0161ko dodanih sopomenk v Slovarju sopomenk sodobne sloven\u0161\u010dine. <i>Jezik in Slovstvo<\/i>,\u00a0<i>69<\/i>(4), 35-50. <a href=\"https:\/\/doi.org\/10.4312\/jis.69.4.35-50\">https:\/\/doi.org\/10.4312\/jis.69.4.35-50<\/a><\/p>\n<p><b>Corpus-based and scaffolded feedback scenarios<\/b><\/p>\n<p data-start=\"398\" data-end=\"890\">Text corpora and responsive lexical resources provide data on real-life contemporary language use across broader contexts and various communicative situations, making them a fundamental reference for literacy education. In the project, we analysed the challenges and limitations of corpus-based responsive dictionaries\u2014specifically, the <em data-start=\"743\" data-end=\"772\">Thesaurus of Modern Slovene<\/em> and the <em data-start=\"781\" data-end=\"824\">Collocations Dictionary of Modern Slovene<\/em>\u2014for use in educational settings, and introduced key improvements.\u00a0For the <em data-start=\"900\" data-end=\"911\">\u0160olar 3.0<\/em> corpus, which serves as the basis for corpus-based study of language problems and corrections in student writing, we developed a <a href=\"https:\/\/viri.cjvt.si\/solar\/en\/\">powerful and specialized concordancer interface<\/a>. This significantly improved access to corpus data for a broader audience\u2014for example, teachers preparing teaching materials and exercises, as well as students training to become language educators. The new concordancer allows for targeted searches of specific linguistic corrections across various language levels, and supports examination of how teacher feedback differs across educational stages, school types, and Slovene regions. These innovations were presented at academic conferences, and in a journal article.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">KOSEM, Iztok, ARHAR HOLDT, \u0160pela, GANTAR, Polona, KREK, Simon. Collocations Dictionary of Modern Slovene 2.0. V: MEDVE\u010e, Marek (ur.), et al.\u00a0<\/span><i><span style=\"font-weight: 400;\">eLex 2023 : electronic lexicography in the 21st century (eLex 2023) : proceedings of the eLex 2023 conference : [Brno], 27\u201329 June 2023<\/span><\/i><span style=\"font-weight: 400;\">. Brno: Lexical Computing CZ, 2023. Str. 491-507, ilustr. Electronic lexicography in the 21st century. Proceedings of eLex &#8230; conference. ISSN 2533-5626.\u00a0<\/span><a href=\"https:\/\/elex.link\/elex2023\/wp-content\/uploads\/100.pdf\"><span style=\"font-weight: 400;\">https:\/\/elex.link\/elex2023\/wp-content\/uploads\/100.pdf<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">158852867<\/span><span style=\"font-weight: 400;\">]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">ARHAR HOLDT, \u0160pela, GANTAR, Polona, KOSEM, Iztok, PORI, Eva, ROBNIK \u0160IKONJA, Marko, KREK, Simon. Thesaurus of Modern Slovene 2.0. V: MEDVE\u010e, Marek (ur.), et al.\u00a0<\/span><i><span style=\"font-weight: 400;\">eLex 2023 : electronic lexicography in the 21st century (eLex 2023) : proceedings of the eLex 2023 conference : [Brno], 27\u201329 June 2023<\/span><\/i><span style=\"font-weight: 400;\">. Brno: Lexical Computing CZ, 2023. Str. 366-381, ilustr. Electronic lexicography in the 21st century. Proceedings of eLex &#8230; conference. ISSN 2533-5626.\u00a0<\/span><a href=\"https:\/\/elex.link\/elex2023\/wp-content\/uploads\/82.pdf\"><span style=\"font-weight: 400;\">https:\/\/elex.link\/elex2023\/wp-content\/uploads\/82.pdf<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">158818819<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">ARHAR HOLDT, \u0160pela, KOSEM, Iztok, STRITAR KU\u010cUK, Mojca. Developing a specialised concordancer for corpora with language corrections. V:\u00a0<\/span><i><span style=\"font-weight: 400;\">TaLC 2024 : 16th Teaching and Language Corpora Conference : July 7th to 10th 2024, Manchester Metropolitan University, Manchester, UK : <\/span><\/i><i><span style=\"font-weight: 400;\">book of abstracts<\/span><\/i><span style=\"font-weight: 400;\">.<\/span><span style=\"font-weight: 400;\"> [Manchester: Manchester Metropolitan University], 2024. Str. [77].\u00a0<\/span><a href=\"https:\/\/talc2024.co.uk\/wp-content\/uploads\/2024\/07\/book-of-abstracts-talc-2024_final-4.pdf\"><span style=\"font-weight: 400;\">https:\/\/talc2024.co.uk\/wp-content\/uploads\/2024\/07\/book-of-abstracts-talc-2024_final-4.pdf<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">204245507<\/span><span style=\"font-weight: 400;\">] <\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">KOSEM, Iztok, STRITAR KU\u010cUK, Mojca, ARHAR HOLDT, \u0160pela. Corplus: a new concordancer for exploring authentic texts with language corrections. <i>Journal of responsible technology<\/i>. Mar. 2026, vol. 25, [article no.] 100144, str. 1-9, DOI: <a href=\"https:\/\/dx.doi.org\/10.1016\/j.jrt.2025.100144\">10.1016\/j.jrt.2025.100144<\/a>. [COBISS.SI-ID 263757059]<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Corpora and corpus-based resources for Slovene\u2014such as <em data-start=\"55\" data-end=\"68\">Sloleks 2.0<\/em>, <em data-start=\"70\" data-end=\"85\">Sopomenke 2.0<\/em>, <em data-start=\"87\" data-end=\"103\">Kolokacije 2.0<\/em>, the reference corpus <em data-start=\"126\" data-end=\"140\">Gigafida 2.0<\/em>, and <em data-start=\"146\" data-end=\"157\">\u0160olar 3.0<\/em>\u2014can be used in various ways: by directing students to specific entries in a language resource; by integrating and displaying selected datasets within a digital tool; or by incorporating explanations, examples, and exercises based on linguistic analyses. With the emergence of generative artificial intelligence, it is now also important to consider machine-generated feedback, provided it supports teachers and aligns with their expectations and needs. We presented AI-supported selection of pedagogically appropriate corpus examples as a model for future applications at a recent international conference.<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">KOSEM, Iztok, ZINGANO KUHN, Tanara, ARHAR HOLDT, \u0160pela, KOPPEL, Kristina, TIBERIUS, Carole, ZVIEL-GIRSHIN, Rina. <em data-start=\"394\" data-end=\"485\">Examining the potential of AI in the annotation of corpus examples for language learning.<\/em> In: CILC2024: XV Congreso Internacional de Ling\u00fc\u00edstica de Corpus, Las Palmas de Gran Canaria, Espa\u00f1a = 15th International Corpus Linguistics Conference, Las Palmas de Gran Canaria, Spain, 22\u201324 May 2024: [book of abstracts]. [S. l.]: Aelinco, 2024. pp. 93\u201395. <a class=\"\" href=\"https:\/\/drive.google.com\/file\/d\/1rHS4OwztEPvYOPwHE5Mxn-lnK2ErbiHV\/view\" target=\"_new\" rel=\"noopener\" data-start=\"746\" data-end=\"890\">https:\/\/drive.google.com\/file\/d\/1rHS4OwztEPvYOPwHE5Mxn-lnK2ErbiHV\/view<\/a>. [COBISS.SI-ID 199984643]<\/p>\n<p><b>Testing feedback with target user groups<\/b><\/p>\n<p>Project participants from the teaching community expressed a desire for a professional conference where they could share their experiences, practices, and perspectives on providing feedback with a broader audience of educators. As the project\u2019s first public event, we therefore organized a professional conference titled <em data-start=\"676\" data-end=\"745\">Correcting Language and Texts \u2013 Teacher Feedback in School Practice<\/em>. The event was very well attended, and as a result, a conference volume was compiled, featuring 25 peer-reviewed contributions by teachers addressing topics such as language correction, feedback, formative assessment, and other themes relevant to the project.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\">PORI, Eva (urednik), ARHAR HOLDT, \u0160pela (urednik).\u00a0<i><span style=\"font-weight: 400;\">Popravljanje jezika in besedil &#8211; u\u010diteljska povratna informacija v \u0161olski praksi: zbornik konference<\/span><\/i><span style=\"font-weight: 400;\">. 1. izd. Ljubljana: Zalo\u017eba Univerze, 2023. Spletni vir, 374 strani<\/span><span style=\"font-weight: 400;\">, DOI:\u00a0<\/span><a href=\"https:\/\/dx.doi.org\/10.4312\/9789612972394\"><span style=\"font-weight: 400;\">10.4312\/9789612972394<\/span><\/a><span style=\"font-weight: 400;\">. [COBISS.SI-ID\u00a0<\/span><span style=\"font-weight: 400;\">178525187<\/span><span style=\"font-weight: 400;\">] <\/span><\/p>\n<p data-start=\"1325\" data-end=\"2000\">Questions regarding the preferred form of digital feedback were included in teacher interviews described in the activity <em data-start=\"1446\" data-end=\"1510\">Linguistic and Teacher Evaluation of Automatic Annotation of Texts<\/em>. We explored preferences for the visualisation of feedback in digital tools, with results showing the importance of didactically appropriate presentation. Particularly valued were links to relevant language resources and the integration of statistics on textual features. However, an excessive number of corrections may demotivate students, while automated corrections may reduce their active engagement. Teachers thus emphasised the need for graded correction options.\u00a0This concept of <em data-start=\"2018\" data-end=\"2035\">graded feedback<\/em> was interpreted in different ways: as adjustable strictness of corrections for different user groups, the possibility of choosing between automatic corrections or simple colour highlighting of errors, or even staged correction across language levels\u2014for instance, from structural issues to orthographic ones. The findings specifying user needs and preferences for future model upgrades were published in a peer-reviewed journal.<\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">ARHAR HOLDT, \u0160pela, MUNDA, Tina.\u00a0<i>U\u010diteljsko popravljanje \u0161olskih besedil v digitalnem okolju: intervjuji z u\u010ditelji slovenskih O\u0160 in S\u0160<\/i>. Ljubljana: Zaklju\u010dena znanstvena zbirka raziskovalnih podatkov. 2025.\u00a0<a href=\"https:\/\/repozitorij.uni-lj.si\/IzpisGradiva.php?lang=slv&amp;id=169549\">https:\/\/repozitorij.uni-lj.si\/IzpisGradiva.php?lang=slv&amp;id=169549<\/a>. [COBISS.SI-ID 248986115]<\/span><\/p>\n<p style=\"text-align: justified; font-size: 12px; line-height: 100%; padding-left: 10px;\"><span style=\"font-weight: 400;\">ARHAR HOLDT, \u0160pela, MUNDA, Tina. Jezikovno popravljanje v digitalnem okolju: kvalitativna \u0161tudija z u\u010diteljicami in u\u010ditelji sloven\u0161\u010dine. <i>Sodobna pedagogika,<\/i> okt. 2025, letn. 76 = 142, \u0161t. 3, str. 86-106, DOI: <a href=\"https:\/\/dx.doi.org\/10.63384\/sptB53s791s\">10.63384\/sptB53s791s<\/a>. [COBISS.SI-ID 259204611]<\/span><\/p>\n            <\/div>        <\/div>    <\/div><\/section>\n<section class=\"av_toggle_section\"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\"  >    <div role=\"tablist\" class=\"single_toggle\" data-tags=\"{All} \"  >        <p data-fake-id=\"#toggle-id-9\" class=\"toggler  av-inherit-border-color \"  itemprop=\"headline\"  style='border-color: #93c83f; '  role=\"tab\" tabindex=\"0\" aria-controls=\"toggle-id-9-container\">Project events and lectures<span class=\"toggle_icon\" >        <span class=\"vert_icon\"><\/span><span class=\"hor_icon\"><\/span><\/span><\/p>        <div id=\"toggle-id-9-container\" class=\"toggle_wrap \"  >            <div class=\"toggle_content invers-color  av-inherit-border-color \"  itemprop=\"text\"  style='border-color: #93c83f; ' ><p><b>Teacher conference<\/b><\/p>\n<p>On <strong>April 5, 2023,<\/strong> we organized the professional conference <em>Correcting Language and Texts \u2013 Teacher Feedback in School Practice<\/em> (<em>Popravljanje jezika in besedil \u2013 u\u010diteljska povratna informacija v \u0161olski praksi) <\/em>at the Faculty of Public Administration, University of Ljubljana.<\/p>\n<ul>\n<li>You can view the programme and event photos <a href=\"https:\/\/www.cjvt.si\/blog\/konferenca-prop-popravljanje-jezika-in-besedil-uciteljska-povratna-informacija-v-solski-praksi\/\">at this link<\/a>.<\/li>\n<li>The conference proceedings volume with peer-reviewed contributions is available\u00a0<a href=\"https:\/\/ebooks.uni-lj.si\/ZalozbaUL\/catalog\/book\/500\">here<\/a>.<\/li>\n<\/ul>\n<p><b>Teacher training<\/b><\/p>\n<p>On <strong>November 24, 2023,<\/strong> a teacher training session was held at the Faculty of Arts, University of Ljubljana, organized by the Department of Slovene Studies. Among the topics presented was <em data-start=\"594\" data-end=\"670\">Preparing Teaching Materials Using the \u0160olar 3.0 Corpus of Student Writing (Priprava u\u010dnih gradiv s korpusom \u0161olskih pisnih izdelkov \u0160olar 3.0)<\/em>. The following materials are available below:<\/p>\n<ul>\n<li>slides (<a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2023\/11\/Priprava-ucnih-gradiv-s-korpusom-solskih-pisnih-izdelkov-Solar-3.0.pdf\">PDF<\/a>)<\/li>\n<li>guidelines for annotating language corrections in the \u0160olar corpus (<a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2023\/11\/Smernice-za-oznacevanje-korpusa-Solar-v1.1.pdf\">PDF<\/a>)<\/li>\n<li>frequency list of language problems from the \u0160olar corpus (<a href=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2023\/11\/Izobrazevanje-uciteljev-Frekvencni-seznam-jezikovnih-popravkov-Solar-3.0.xlsx\">XLSX<\/a>)<\/li>\n<li>CJVT concordancer demo (<a href=\"https:\/\/solar.cjvt.si\/\">povezava<\/a>)<\/li>\n<li>noSketch Engine concordancer on Clarin.si (<a href=\"https:\/\/www.clarin.si\/noske\/run.cgi\/first_form?corpname=solar30_orig;align=\">povezava<\/a>)<\/li>\n<\/ul>\n<p><b>Invited lectures<\/b><\/p>\n<ul>\n<li style=\"text-align: justified; font-size: 12px; line-height: 100%;\">ARHAR HOLDT, \u0160pela. <em>Leveraging error-annotated corpora and the Svala Tool: the case of Slovene<\/em>: guest talk at Department of Swedish, Multilingualism, Language Technology, University of Gothenburg, Sweden, 20 June 2023. [COBISS.SI-ID\u00a0171445507]<\/li>\n<li style=\"text-align: justified; font-size: 12px; line-height: 100%;\">ARHAR HOLDT, \u0160pela. <em>A specialised concordancer for corpora with annotated language corrections<\/em>: invited presentation at Department of Swedish, Multilingualism, and Language Technology, University of Gothenburg, 23rd of April 2024, Gothenburg, Sweden. [COBISS.SI-ID 214135299]<\/li>\n<li style=\"text-align: justified; font-size: 12px; line-height: 100%;\">ARHAR HOLDT, \u0160pela. From developmental corpus to developed applications: the journey of \u0160olar 3.0: presentation at the Faculty of Arts and Humanities of the University of Coimbra, Coimbra, Portugal, 29 Nov. 2024. [COBISS.SI-ID 229553923]<\/li>\n<li style=\"text-align: justified; font-size: 12px; line-height: 100%;\">KOSEM, Iztok. Corpus tools and language resources at the University of Ljubljana: purposes and people behind the development: presentation at the Faculty of Arts and Humanities of the University of Coimbra, Coimbra, Portugal, 29 Nov. 2024. [COBISS.SI-ID 229749251]<\/li>\n<\/ul>\n<p data-start=\"273\" data-end=\"623\"><strong>Final Project Event<\/strong><\/p>\n<p data-start=\"273\" data-end=\"623\">The final event of the research project took place on <strong data-start=\"327\" data-end=\"348\">23 September 2025<\/strong> in <em>Zborni\u010dna dvorana <\/em>of the University of Ljubljana. During the event, the project team presented an overview of key outcomes, selected studies, resources, and tools developed throughout the project. All lectures will be made available on the <strong data-start=\"598\" data-end=\"615\">VideoLectures<\/strong> portal.<\/p>\n<p data-start=\"625\" data-end=\"639\"><strong data-start=\"625\" data-end=\"639\">Programme:<\/strong><\/p>\n<ul data-start=\"641\" data-end=\"1834\">\n<li data-start=\"641\" data-end=\"787\">\n<p data-start=\"643\" data-end=\"787\"><strong data-start=\"643\" data-end=\"658\">10:30\u201310:40<\/strong>\u202f Welcome and opening remarks by the Dean of the Faculty of Arts, University of Ljubljana, Prof. Dr. Mojca Schlamberger Brezar [<a href=\"https:\/\/videolectures.net\/videos\/PROP2025_schlamberger_brezar\">VIDEO<\/a>]<\/p>\n<\/li>\n<li data-start=\"788\" data-end=\"931\">\n<p data-start=\"790\" data-end=\"931\"><strong data-start=\"790\" data-end=\"805\">10:40\u201311:00 <\/strong>\u202f\u0160pela Arhar Holdt: <em data-start=\"829\" data-end=\"931\">Results of the project \u201cEmpirical Foundations for Digitally-Supported Development of Writing Skills\u201d <\/em>[<a href=\"https:\/\/videolectures.net\/videos\/PROP2025_arhar_holdt\">VIDEO<\/a>]<\/p>\n<\/li>\n<li data-start=\"932\" data-end=\"1026\">\n<p data-start=\"934\" data-end=\"1026\"><strong data-start=\"934\" data-end=\"949\">11:00\u201311:15<\/strong>\u202f\u00a0 Iztok Kosem: <em data-start=\"967\" data-end=\"1026\">First steps towards a core vocabulary list for school use\u00a0<\/em>[<a href=\"https:\/\/videolectures.net\/videos\/PROP2025_kosem_koraki\">VIDEO<\/a>]<\/p>\n<\/li>\n<li data-start=\"1027\" data-end=\"1096\">\n<p data-start=\"1029\" data-end=\"1096\"><strong data-start=\"1029\" data-end=\"1044\">11:15\u201311:30<\/strong>\u202f Tadeja Rozman: <em data-start=\"1064\" data-end=\"1096\">The student writing corpus KO\u0160\u00a0<\/em>[<a href=\"https:\/\/videolectures.net\/videos\/PROP2025_rozman_korpus_kos\">VIDEO<\/a>]<\/p>\n<\/li>\n<li data-start=\"1097\" data-end=\"1220\">\n<p data-start=\"1099\" data-end=\"1220\"><strong data-start=\"1099\" data-end=\"1114\">11:30\u201311:45<\/strong>\u202f Matej Klemen: <em data-start=\"1133\" data-end=\"1220\">Automatic correction of language errors using language models: from data to solutions\u00a0<\/em>[<a href=\"https:\/\/videolectures.net\/videos\/PROP2025_rozman_modeli\">VIDEO<\/a>]<\/p>\n<\/li>\n<li data-start=\"1221\" data-end=\"1248\">\n<p data-start=\"1223\" data-end=\"1248\"><strong data-start=\"1223\" data-end=\"1238\">11:45\u201312:00<\/strong>\u202f<strong data-start=\"1239\" data-end=\"1248\">Break<\/strong><\/p>\n<\/li>\n<li data-start=\"1249\" data-end=\"1362\">\n<p data-start=\"1251\" data-end=\"1362\"><strong data-start=\"1251\" data-end=\"1266\">12:00\u201312:15<\/strong>\u202f Alenka Rot Vrhovec: <em data-start=\"1291\" data-end=\"1362\">Insights from the teacher survey on the correction of student writing\u00a0<\/em>[VIDEO]<\/p>\n<\/li>\n<li data-start=\"1363\" data-end=\"1497\">\n<p data-start=\"1365\" data-end=\"1497\"><strong data-start=\"1365\" data-end=\"1380\">12:15\u201312:30<\/strong>\u202f Tina Munda: <em data-start=\"1397\" data-end=\"1497\">Language correction in the digital environment: a qualitative study with Slovene language teachers\u00a0<\/em>[<a href=\"https:\/\/videolectures.net\/videos\/PROP2025_rot_vrhovec\">VIDEO<\/a>]<\/p>\n<\/li>\n<li data-start=\"1498\" data-end=\"1627\">\n<p data-start=\"1500\" data-end=\"1627\"><strong data-start=\"1500\" data-end=\"1515\">12:30\u201312:45<\/strong>\u202f Karmen Pi\u017eorn: <em data-start=\"1535\" data-end=\"1627\">The role of digital tools in developing EFL writing skills: a systematic literature review\u00a0<\/em>[<a href=\"https:\/\/videolectures.net\/videos\/PROP2025_pizorn_vloga_orodij\">VIDEO<\/a>]<\/p>\n<\/li>\n<li data-start=\"1628\" data-end=\"1753\">\n<p data-start=\"1630\" data-end=\"1753\"><strong data-start=\"1630\" data-end=\"1645\">12:45\u201313:00<\/strong>\u202f Milena Ko\u0161ak Babuder: <em data-start=\"1672\" data-end=\"1753\">Digitally-supported learning among students with specific learning difficulties\u00a0<\/em>[<a href=\"https:\/\/videolectures.net\/videos\/PROP2025_kosak_babuder\">VIDEO<\/a>]<\/p>\n<\/li>\n<li data-start=\"1754\" data-end=\"1834\">\n<p data-start=\"1756\" data-end=\"1834\"><strong data-start=\"1756\" data-end=\"1771\">13:00\u201314:00<\/strong>\u202f <strong data-start=\"1772\" data-end=\"1834\">Audience discussion, refreshments, and networking<\/strong><\/p>\n<\/li>\n<\/ul>\n<p><img decoding=\"async\" class=\"alignnone wp-image-2506 size-full\" src=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/10\/1758648860804.jpeg\" alt=\"\" width=\"800\" height=\"600\" srcset=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/10\/1758648860804.jpeg 800w, https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/10\/1758648860804-300x225.jpeg 300w, https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/10\/1758648860804-768x576.jpeg 768w, https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/10\/1758648860804-705x529.jpeg 705w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/p>\n            <\/div>        <\/div>    <\/div><\/section>\n<\/div><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"_relevanssi_hide_post":"","_relevanssi_hide_content":"","_relevanssi_pin_for_all":"","_relevanssi_pin_keywords":"","_relevanssi_unpin_keywords":"","_relevanssi_related_keywords":"","_relevanssi_related_include_ids":"","_relevanssi_related_exclude_ids":"","_relevanssi_related_no_append":"","_relevanssi_related_not_related":"","_relevanssi_related_posts":"","_relevanssi_noindex_reason":"","inline_featured_image":false,"episode_type":"","audio_file":"","podmotor_file_id":"","podmotor_episode_id":"","cover_image":"","cover_image_id":"","duration":"","filesize":"","filesize_raw":"","date_recorded":"","explicit":"","block":"","itunes_episode_number":"","itunes_title":"","itunes_season_number":"","itunes_episode_type":"","footnotes":""},"class_list":["post-946","page","type-page","status-publish","hentry"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>About the project - Empiri\u010dna podlaga za digitalno podprt razvoj pisne jezikovne zmo\u017enosti<\/title>\n<meta name=\"description\" content=\"The aim of the project is to support teachers who correct and grade student writing.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.cjvt.si\/prop\/en\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"About the project - Empiri\u010dna podlaga za digitalno podprt razvoj pisne jezikovne zmo\u017enosti\" \/>\n<meta property=\"og:description\" content=\"The aim of the project is to support teachers who correct and grade student writing.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.cjvt.si\/prop\/en\/\" \/>\n<meta property=\"og:site_name\" content=\"Empiri\u010dna podlaga za digitalno podprt razvoj pisne jezikovne zmo\u017enosti\" \/>\n<meta property=\"article:modified_time\" content=\"2026-03-08T13:33:37+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/10\/1758648860804.jpeg\" \/>\n\t<meta property=\"og:image:width\" content=\"800\" \/>\n\t<meta property=\"og:image:height\" content=\"600\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"13 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/\",\"url\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/\",\"name\":\"About the project - Empiri\u010dna podlaga za digitalno podprt razvoj pisne jezikovne zmo\u017enosti\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/wp-content\\\/uploads\\\/sites\\\/23\\\/2025\\\/10\\\/1758648860804.jpeg\",\"datePublished\":\"2020-03-31T20:42:40+00:00\",\"dateModified\":\"2026-03-08T13:33:37+00:00\",\"description\":\"The aim of the project is to support teachers who correct and grade student writing.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/wp-content\\\/uploads\\\/sites\\\/23\\\/2025\\\/10\\\/1758648860804.jpeg\",\"contentUrl\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/wp-content\\\/uploads\\\/sites\\\/23\\\/2025\\\/10\\\/1758648860804.jpeg\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"About the project\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/#website\",\"url\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/\",\"name\":\"Empiri\u010dna podlaga za digitalno podprt razvoj pisne jezikovne zmo\u017enosti\",\"description\":\"Podprt razvoj pisanja\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.cjvt.si\\\/prop\\\/en\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"About the project - Empiri\u010dna podlaga za digitalno podprt razvoj pisne jezikovne zmo\u017enosti","description":"The aim of the project is to support teachers who correct and grade student writing.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.cjvt.si\/prop\/en\/","og_locale":"en_US","og_type":"article","og_title":"About the project - Empiri\u010dna podlaga za digitalno podprt razvoj pisne jezikovne zmo\u017enosti","og_description":"The aim of the project is to support teachers who correct and grade student writing.","og_url":"https:\/\/www.cjvt.si\/prop\/en\/","og_site_name":"Empiri\u010dna podlaga za digitalno podprt razvoj pisne jezikovne zmo\u017enosti","article_modified_time":"2026-03-08T13:33:37+00:00","og_image":[{"width":800,"height":600,"url":"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/10\/1758648860804.jpeg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"13 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.cjvt.si\/prop\/en\/","url":"https:\/\/www.cjvt.si\/prop\/en\/","name":"About the project - Empiri\u010dna podlaga za digitalno podprt razvoj pisne jezikovne zmo\u017enosti","isPartOf":{"@id":"https:\/\/www.cjvt.si\/prop\/en\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.cjvt.si\/prop\/en\/#primaryimage"},"image":{"@id":"https:\/\/www.cjvt.si\/prop\/en\/#primaryimage"},"thumbnailUrl":"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/10\/1758648860804.jpeg","datePublished":"2020-03-31T20:42:40+00:00","dateModified":"2026-03-08T13:33:37+00:00","description":"The aim of the project is to support teachers who correct and grade student writing.","breadcrumb":{"@id":"https:\/\/www.cjvt.si\/prop\/en\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.cjvt.si\/prop\/en\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.cjvt.si\/prop\/en\/#primaryimage","url":"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/10\/1758648860804.jpeg","contentUrl":"https:\/\/www.cjvt.si\/prop\/wp-content\/uploads\/sites\/23\/2025\/10\/1758648860804.jpeg"},{"@type":"BreadcrumbList","@id":"https:\/\/www.cjvt.si\/prop\/en\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.cjvt.si\/prop\/en\/"},{"@type":"ListItem","position":2,"name":"About the project"}]},{"@type":"WebSite","@id":"https:\/\/www.cjvt.si\/prop\/en\/#website","url":"https:\/\/www.cjvt.si\/prop\/en\/","name":"Empiri\u010dna podlaga za digitalno podprt razvoj pisne jezikovne zmo\u017enosti","description":"Podprt razvoj pisanja","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.cjvt.si\/prop\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/www.cjvt.si\/prop\/en\/wp-json\/wp\/v2\/pages\/946","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.cjvt.si\/prop\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.cjvt.si\/prop\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.cjvt.si\/prop\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.cjvt.si\/prop\/en\/wp-json\/wp\/v2\/comments?post=946"}],"version-history":[{"count":145,"href":"https:\/\/www.cjvt.si\/prop\/en\/wp-json\/wp\/v2\/pages\/946\/revisions"}],"predecessor-version":[{"id":2599,"href":"https:\/\/www.cjvt.si\/prop\/en\/wp-json\/wp\/v2\/pages\/946\/revisions\/2599"}],"wp:attachment":[{"href":"https:\/\/www.cjvt.si\/prop\/en\/wp-json\/wp\/v2\/media?parent=946"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}