{"id":959,"date":"2020-03-31T22:48:33","date_gmt":"2020-03-31T20:48:33","guid":{"rendered":"https:\/\/www.cjvt.starkmat.si\/template-projekt\/work-packages\/work-package-2\/"},"modified":"2024-01-25T11:37:24","modified_gmt":"2024-01-25T10:37:24","slug":"work-package-2","status":"publish","type":"page","link":"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/","title":{"rendered":"Work Package 2: SloLLaMai"},"content":{"rendered":"<div class=\"flex_column av_one_full  no_margin flex_column_div av-zero-column-padding first  avia-builder-el-0  el_before_av_one_full  avia-builder-el-first  \" style='margin-top:0px; margin-bottom:30px; border-radius:0px; '><section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><h2>Open-access computationally efficient models for Slovenian<\/h2>\n<\/div><\/section><br \/>\n<section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><h3>SloLLaMai<\/h3>\n<\/div><\/section><\/p><\/div>\n<div class=\"flex_column av_one_full  no_margin flex_column_div av-zero-column-padding first  avia-builder-el-3  el_after_av_one_full  el_before_av_one_full  column-top-margin\" style='margin-top:0px; margin-bottom:30px; border-radius:0px; '><section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><p>Extremely large language models (ChatGPT and GPT-4) have recently shown remarkable progress in certain tasks, but they also face numerous practical challenges in their utilization, such as closedness and lack of transparency, high computational requirements, and the high cost of customization and broader usage, which is unattainable for most research organizations and companies. Further scaling up these models is no longer practical. Smaller, open-access models like LLaMA, Alpaca, GPT4All, and Koala have also emerged, which can be trained or adapted for specific tasks on regular GPU computers and achieve similar or nearly equal performance to the largest models.<\/p>\n<p>In the SloLLaMAi project, we will develop an open-access, computationally efficient generative language model for Slovenian. This model will be the first of its kind for a morphologically rich language with limited resources, presenting a significant research challenge. The development of this new large general model will serve as fundamental infrastructure for industrial projects and all new products requiring natural language processing. Previously developed models (e.g., SloT5, SloBERTa, CroSloEn BERT) have already enabled the creation of technologies that, just a few years ago, could not be developed for Slovenian with comparable accuracy to larger languages (e.g., machine translation, summarization, question answering). The development of such technologies would not have been possible if models for Slovenian did not exist.<\/p>\n<p>The results of the first project (RRP1) are necessary for the development of the next generation of large language models. This will make it possible to prepare general language models that can be specialized for specific natural language processing tasks.<\/p>\n<\/div><\/section><\/div>\n<div class=\"flex_column av_one_full  no_margin flex_column_div av-zero-column-padding first  avia-builder-el-5  el_after_av_one_full  el_before_av_one_full  column-top-margin\" style='margin-top:0px; margin-bottom:30px; border-radius:0px; '><section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><h3>Specific objectives:<\/h3>\n<ol>\n<li>Development and construction of modern, computationally efficient large generative language models of the GPT type (variants like LLaMa, Alpaca, Koala) and their adaptation for command tracking, dialogical communication, and the Slovenian language. The built large models represent the foundational infrastructure for the rest of the project and application projects.<\/li>\n<li>Improvement of models by incorporating additional knowledge into the constructed large generative language models to enhance logical reasoning, common-sense reasoning, linguistic and morphological peculiarities of the Slovenian language, and adherence to ethical and legal norms.<\/li>\n<li>Adaptation of large language models for computationally low-capacity devices and industrial applications; researching the possibilities of compression and distillation of large language models, quantization, and approximate computing, taking into account the morphological richness of the addressed language.<\/li>\n<\/ol>\n<\/div><\/section><\/div>\n<div class=\"flex_column av_one_full  no_margin flex_column_div av-zero-column-padding first  avia-builder-el-7  el_after_av_one_full  el_before_av_one_full  column-top-margin\" style='margin-top:0px; margin-bottom:30px; border-radius:0px; '><section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><h3>Results:<\/h3>\n<ul>\n<li><strong>D2.1: <\/strong>Open-access large generative language model adapted for dialogue and commands with a size of one billion parameters (August 2024).<\/li>\n<li><strong>D2.2:<\/strong> Open-access large generative language model adapted for dialogue and commands with a size of 10 billion parameters (February 2025).<\/li>\n<li><strong>D2.3:<\/strong> Open-access computationally lightweight generative language model adapted for dialogue and commands (August 2025).<\/li>\n<li><strong>D2.4:<\/strong> Open-access large language model with embedded additional knowledge (February 2026).<\/li>\n<\/ul>\n<\/div><\/section><\/div>\n<div class=\"flex_column av_one_full  no_margin flex_column_div av-zero-column-padding first  avia-builder-el-9  el_after_av_one_full  el_before_av_one_fourth  column-top-margin\" style='margin-top:0px; margin-bottom:30px; border-radius:0px; '><section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><h3>Project partners:<\/h3>\n<\/div><\/section><\/div>\n<div class=\"flex_column av_one_fourth  flex_column_div av-zero-column-padding first  avia-builder-el-11  el_after_av_one_full  el_before_av_one_fourth  column-top-margin\" style='border-radius:0px; '><section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><h5>Project leader:<\/h5>\n<\/div><\/section><\/div>\n<div class=\"flex_column av_one_fourth  flex_column_div av-zero-column-padding   avia-builder-el-13  el_after_av_one_fourth  el_before_av_one_fourth  column-top-margin\" style='border-radius:0px; '><div  class='avia-button-wrap avia-button-center  avia-builder-el-14  avia-builder-el-no-sibling ' ><a href='https:\/\/www.fri.uni-lj.si\/en' class='avia-button avia-button-fullwidth   avia-icon_select-no avia-color-theme-color '  style='color:#ffffff; ' ><span class='avia_iconbox_title' >Faculty of Computer and Information Science UL<\/span><span class='avia_button_background avia-button avia-button-fullwidth avia-color-theme-color-highlight' ><\/span><\/a><\/div><\/div><div class=\"flex_column av_one_fourth  flex_column_div av-zero-column-padding   avia-builder-el-15  el_after_av_one_fourth  el_before_av_one_fourth  column-top-margin\" style='border-radius:0px; '><\/div><\/p>\n<div class=\"flex_column av_one_fourth  flex_column_div av-zero-column-padding   avia-builder-el-16  el_after_av_one_fourth  el_before_av_one_fourth  column-top-margin\" style='border-radius:0px; '><\/div>\n<div class=\"flex_column av_one_fourth  flex_column_div av-zero-column-padding first  avia-builder-el-17  el_after_av_one_fourth  el_before_av_one_fourth  column-top-margin\" style='border-radius:0px; '><section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><h5>Partners:<\/h5>\n<\/div><\/section><\/div>\n<div class=\"flex_column av_one_fourth  flex_column_div av-zero-column-padding   avia-builder-el-19  el_after_av_one_fourth  el_before_av_one_fourth  column-top-margin\" style='border-radius:0px; '><div  class='avia-button-wrap avia-button-center  avia-builder-el-20  avia-builder-el-no-sibling ' ><a href='https:\/\/semantika.eu\/en-us\/' class='avia-button avia-button-fullwidth   avia-icon_select-no avia-color-theme-color '  style='color:#ffffff; ' ><span class='avia_iconbox_title' >Semantika d.o.o.<\/span><span class='avia_button_background avia-button avia-button-fullwidth avia-color-theme-color-highlight' ><\/span><\/a><\/div><\/div>\n<div class=\"flex_column av_one_fourth  flex_column_div av-zero-column-padding   avia-builder-el-21  el_after_av_one_fourth  el_before_av_one_fourth  column-top-margin\" style='border-radius:0px; '><div  class='avia-button-wrap avia-button-center  avia-builder-el-22  avia-builder-el-no-sibling ' ><a href='https:\/\/xlab.si\/' class='avia-button avia-button-fullwidth   avia-icon_select-no avia-color-theme-color '  style='color:#ffffff; ' ><span class='avia_iconbox_title' >XLAB d.o.o.<\/span><span class='avia_button_background avia-button avia-button-fullwidth avia-color-theme-color-highlight' ><\/span><\/a><\/div><\/div>\n<div class=\"flex_column av_one_fourth  flex_column_div av-zero-column-padding   avia-builder-el-23  el_after_av_one_fourth  el_before_av_one_fourth  column-top-margin\" style='border-radius:0px; '><div  class='avia-button-wrap avia-button-center  avia-builder-el-24  avia-builder-el-no-sibling ' ><a href='https:\/\/vitasis.si\/home' class='avia-button avia-button-fullwidth   avia-icon_select-no avia-color-theme-color '  style='color:#ffffff; ' ><span class='avia_iconbox_title' >VITASIS d.o.o.<\/span><span class='avia_button_background avia-button avia-button-fullwidth avia-color-theme-color-highlight' ><\/span><\/a><\/div><\/div>\n<div class=\"flex_column av_one_fourth  flex_column_div av-zero-column-padding first  avia-builder-el-25  el_after_av_one_fourth  el_before_av_one_fourth  column-top-margin\" style='border-radius:0px; '><\/div>\n<div class=\"flex_column av_one_fourth  flex_column_div av-zero-column-padding   avia-builder-el-26  el_after_av_one_fourth  el_before_av_one_fourth  column-top-margin\" style='border-radius:0px; '><div  class='avia-button-wrap avia-button-center  avia-builder-el-27  avia-builder-el-no-sibling ' ><a href='https:\/\/www.better.care\/' class='avia-button avia-button-fullwidth   avia-icon_select-no avia-color-theme-color '  style='color:#ffffff; ' ><span class='avia_iconbox_title' >BETTER, d.o.o.<\/span><span class='avia_button_background avia-button avia-button-fullwidth avia-color-theme-color-highlight' ><\/span><\/a><\/div><\/div>\n<div class=\"flex_column av_one_fourth  flex_column_div av-zero-column-padding   avia-builder-el-28  el_after_av_one_fourth  avia-builder-el-last  column-top-margin\" style='border-radius:0px; '><div  class='avia-button-wrap avia-button-center  avia-builder-el-29  avia-builder-el-no-sibling ' ><a href='https:\/\/cutt.ly\/XwLLp01O' class='avia-button avia-button-fullwidth   avia-icon_select-no avia-color-theme-color '  style='color:#ffffff; ' ><span class='avia_iconbox_title' >\u0160PICA d.o.o.<\/span><span class='avia_button_background avia-button avia-button-fullwidth avia-color-theme-color-highlight' ><\/span><\/a><\/div><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":1,"featured_media":0,"parent":953,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"_relevanssi_hide_post":"","_relevanssi_hide_content":"","_relevanssi_pin_for_all":"","_relevanssi_pin_keywords":"","_relevanssi_unpin_keywords":"","_relevanssi_related_keywords":"","_relevanssi_related_include_ids":"","_relevanssi_related_exclude_ids":"","_relevanssi_related_no_append":"","_relevanssi_related_not_related":"","_relevanssi_related_posts":"","_relevanssi_noindex_reason":"","inline_featured_image":false,"episode_type":"","audio_file":"","cover_image":"","cover_image_id":"","duration":"","filesize":"","filesize_raw":"","date_recorded":"","explicit":"","block":"","itunes_episode_number":"","itunes_title":"","itunes_season_number":"","itunes_episode_type":"","footnotes":""},"class_list":["post-959","page","type-page","status-publish","hentry"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.9 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Work Package 2: SloLLaMai - PoVeJMo<\/title>\n<meta name=\"robots\" content=\"noindex, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Work Package 2: SloLLaMai - PoVeJMo\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/\" \/>\n<meta property=\"og:site_name\" content=\"PoVeJMo\" \/>\n<meta property=\"article:modified_time\" content=\"2024-01-25T10:37:24+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"15 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/\",\"url\":\"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/\",\"name\":\"Work Package 2: SloLLaMai - PoVeJMo\",\"isPartOf\":{\"@id\":\"https:\/\/www.cjvt.si\/povejmo\/en\/#website\"},\"datePublished\":\"2020-03-31T20:48:33+00:00\",\"dateModified\":\"2024-01-25T10:37:24+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.cjvt.si\/povejmo\/o-programu\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Work Packages\",\"item\":\"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Work Package 2\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.cjvt.si\/povejmo\/en\/#website\",\"url\":\"https:\/\/www.cjvt.si\/povejmo\/en\/\",\"name\":\"PoVeJMo\",\"description\":\"Work site\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.cjvt.si\/povejmo\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Work Package 2: SloLLaMai - PoVeJMo","robots":{"index":"noindex","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"og_locale":"en_US","og_type":"article","og_title":"Work Package 2: SloLLaMai - PoVeJMo","og_url":"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/","og_site_name":"PoVeJMo","article_modified_time":"2024-01-25T10:37:24+00:00","twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"15 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/","url":"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/","name":"Work Package 2: SloLLaMai - PoVeJMo","isPartOf":{"@id":"https:\/\/www.cjvt.si\/povejmo\/en\/#website"},"datePublished":"2020-03-31T20:48:33+00:00","dateModified":"2024-01-25T10:37:24+00:00","breadcrumb":{"@id":"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/slollamai\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.cjvt.si\/povejmo\/o-programu\/"},{"@type":"ListItem","position":2,"name":"Work Packages","item":"https:\/\/www.cjvt.si\/povejmo\/en\/work-packages\/"},{"@type":"ListItem","position":3,"name":"Work Package 2"}]},{"@type":"WebSite","@id":"https:\/\/www.cjvt.si\/povejmo\/en\/#website","url":"https:\/\/www.cjvt.si\/povejmo\/en\/","name":"PoVeJMo","description":"Work site","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.cjvt.si\/povejmo\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/www.cjvt.si\/povejmo\/en\/wp-json\/wp\/v2\/pages\/959","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.cjvt.si\/povejmo\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.cjvt.si\/povejmo\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.cjvt.si\/povejmo\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.cjvt.si\/povejmo\/en\/wp-json\/wp\/v2\/comments?post=959"}],"version-history":[{"count":5,"href":"https:\/\/www.cjvt.si\/povejmo\/en\/wp-json\/wp\/v2\/pages\/959\/revisions"}],"predecessor-version":[{"id":1711,"href":"https:\/\/www.cjvt.si\/povejmo\/en\/wp-json\/wp\/v2\/pages\/959\/revisions\/1711"}],"up":[{"embeddable":true,"href":"https:\/\/www.cjvt.si\/povejmo\/en\/wp-json\/wp\/v2\/pages\/953"}],"wp:attachment":[{"href":"https:\/\/www.cjvt.si\/povejmo\/en\/wp-json\/wp\/v2\/media?parent=959"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}