{"id":946,"date":"2020-03-31T22:42:40","date_gmt":"2020-03-31T20:42:40","guid":{"rendered":"https:\/\/www.cjvt.starkmat.si\/nadgradnja-korpusov\/project\/"},"modified":"2020-05-11T13:20:50","modified_gmt":"2020-05-11T11:20:50","slug":"project","status":"publish","type":"page","link":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/","title":{"rendered":"Upgrade of the Gigafida, Kres, ccGigafida and ccKres Corpora"},"content":{"rendered":"

PROJECT DESCRIPTION<\/h1>\n<\/div><\/section>
\n
<\/span><\/span><\/div>
\n

The Gigafida, Kres, ccGigafida and ccKres corpora form the basis for the development of modern language handbooks and language technologies for Slovene.\u00a0Gigafida<\/a>\u00a0and\u00a0Kres\u00a0<\/a>have user friendly interfaces and are used often by linguists, translators, editors, proofreaders, teachers and other similar user groups. These corpora are essential for language research and development, however they can only serve their purpose if they are continually updated and upgraded.<\/p>\n

The project Upgrade of Gigafida, Kres, ccGigafida and ccKres is financed by the\u00a0Ministry of Culture<\/a>\u00a0under the contract nr. 33400-15-141007 between the Ministry and the\u00a0University of Ljubljana<\/a>\u00a0for the period 2015\u20132018. It is run by the\u00a0Centre for Language Resources and Technologies<\/a>\u00a0and has three objectives: targeted acquisition of new materials, machine processing of new and existing materials, public availability and dissemination of upgraded corpora.<\/p>\n

We will focus on types of texts which are currently underrepresented in Gigafida and Kres, i.e. mainly school reading materials and other popular literature. On the other hand we will add texts from selected news websites, which will ensure that the corpus data is more up-to-date. The new materials will enlarge the existing corpora by about a quarter, which in the case of Gigafida means it will grow from 1.2 to around 1.5 billion words. The technical aspects will be updated as well: we will develop tools for removing surplus copies of texts, improve the accuracy of linguistic annotation and divide standard language texts from texts which deviate from linguistic standards into subcorpora.<\/p>\n<\/div><\/section><\/p><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"_relevanssi_hide_post":"","_relevanssi_hide_content":"","_relevanssi_pin_for_all":"","_relevanssi_pin_keywords":"","_relevanssi_unpin_keywords":"","_relevanssi_related_keywords":"","_relevanssi_related_include_ids":"","_relevanssi_related_exclude_ids":"","_relevanssi_related_no_append":"","_relevanssi_related_not_related":"","_relevanssi_related_posts":"","_relevanssi_noindex_reason":"","inline_featured_image":false,"episode_type":"","audio_file":"","cover_image":"","cover_image_id":"","duration":"","filesize":"","date_recorded":"","explicit":"","block":"","itunes_episode_number":"","itunes_title":"","itunes_season_number":"","itunes_episode_type":"","filesize_raw":"","footnotes":""},"class_list":["post-946","page","type-page","status-publish","hentry"],"acf":[],"yoast_head":"\nUpgrade of the Gigafida, Kres, ccGigafida and ccKres Corpora - Nadgradnja korpusov Gigafida, Kres, ccGigafida in ccKres<\/title>\n<meta name=\"robots\" content=\"noindex, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Upgrade of the Gigafida, Kres, ccGigafida and ccKres Corpora - Nadgradnja korpusov Gigafida, Kres, ccGigafida in ccKres\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/\" \/>\n<meta property=\"og:site_name\" content=\"Nadgradnja korpusov Gigafida, Kres, ccGigafida in ccKres\" \/>\n<meta property=\"article:modified_time\" content=\"2020-05-11T11:20:50+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/\",\"url\":\"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/\",\"name\":\"Upgrade of the Gigafida, Kres, ccGigafida and ccKres Corpora - Nadgradnja korpusov Gigafida, Kres, ccGigafida in ccKres\",\"isPartOf\":{\"@id\":\"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/#website\"},\"datePublished\":\"2020-03-31T20:42:40+00:00\",\"dateModified\":\"2020-05-11T11:20:50+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Upgrade of the Gigafida, Kres, ccGigafida and ccKres Corpora\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/#website\",\"url\":\"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/\",\"name\":\"Nadgradnja korpusov Gigafida, Kres, ccGigafida in ccKres\",\"description\":\"Posodobljeni viri za sloven\u0161\u010dino\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Upgrade of the Gigafida, Kres, ccGigafida and ccKres Corpora - Nadgradnja korpusov Gigafida, Kres, ccGigafida in ccKres","robots":{"index":"noindex","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"og_locale":"en_US","og_type":"article","og_title":"Upgrade of the Gigafida, Kres, ccGigafida and ccKres Corpora - Nadgradnja korpusov Gigafida, Kres, ccGigafida in ccKres","og_url":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/","og_site_name":"Nadgradnja korpusov Gigafida, Kres, ccGigafida in ccKres","article_modified_time":"2020-05-11T11:20:50+00:00","twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/","url":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/","name":"Upgrade of the Gigafida, Kres, ccGigafida and ccKres Corpora - Nadgradnja korpusov Gigafida, Kres, ccGigafida in ccKres","isPartOf":{"@id":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/#website"},"datePublished":"2020-03-31T20:42:40+00:00","dateModified":"2020-05-11T11:20:50+00:00","breadcrumb":{"@id":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/project\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/"},{"@type":"ListItem","position":2,"name":"Upgrade of the Gigafida, Kres, ccGigafida and ccKres Corpora"}]},{"@type":"WebSite","@id":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/#website","url":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/","name":"Nadgradnja korpusov Gigafida, Kres, ccGigafida in ccKres","description":"Posodobljeni viri za sloven\u0161\u010dino","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/wp-json\/wp\/v2\/pages\/946","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/wp-json\/wp\/v2\/comments?post=946"}],"version-history":[{"count":19,"href":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/wp-json\/wp\/v2\/pages\/946\/revisions"}],"predecessor-version":[{"id":1194,"href":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/wp-json\/wp\/v2\/pages\/946\/revisions\/1194"}],"wp:attachment":[{"href":"https:\/\/www.cjvt.si\/nadgradnja-korpusov\/en\/wp-json\/wp\/v2\/media?parent=946"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}