{"id":2346,"date":"2020-05-29T17:31:37","date_gmt":"2020-05-29T15:31:37","guid":{"rendered":"https:\/\/www.cjvt.starkmat.si\/?page_id=2346"},"modified":"2020-11-05T22:30:11","modified_gmt":"2020-11-05T21:30:11","slug":"access-to-historic-versions-of-the-gigafida-corpus","status":"publish","type":"page","link":"https:\/\/www.cjvt.si\/en\/infrastructure-support\/access-to-historic-versions-of-the-gigafida-corpus\/","title":{"rendered":"Access to historic versions of the Gigafida corpus"},"content":{"rendered":"<div class='flex_column_table av-equal-height-column-flextable -flextable' style='margin-top:0px; margin-bottom:0px; '><div class=\"flex_column av_two_third  flex_column_table_cell av-equal-height-column av-align-middle av-zero-column-padding first  avia-builder-el-0  el_before_av_one_third  avia-builder-el-first  \" style='border-radius:0px; '><section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><h2>ACCESS TO HISTORIC VERSIONS OF THE SLOVENE GIGAFIDA CORPUS<\/h2>\n<\/div><\/section><\/div>\n<div class='av-flex-placeholder'><\/div><div class=\"flex_column av_one_third  flex_column_table_cell av-equal-height-column av-align-middle av-zero-column-padding   avia-builder-el-2  el_after_av_two_third  el_before_av_two_third  \" style='border-radius:0px; '><p><div  class='avia-button-wrap avia-button-left  avia-builder-el-3  el_before_av_button  avia-builder-el-first  gumb-sodelavci-levo' title=\"STARK \u2013 Statistical analysis of dependency-parsed corpora\"><a href='https:\/\/www.cjvt.si\/en\/infrastructure-support\/stark\/'  class='avia-button  av-button-notext   avia-icon_select-yes-left-icon avia-color-theme-color avia-size-small avia-position-left '   ><span class='avia_button_icon avia_button_icon_left ' aria-hidden='true' data-av_icon='\ue87c' data-av_iconfont='entypo-fontello'><\/span><span class='avia_iconbox_title' ><\/span><\/a><\/div><br \/>\n<div  class='avia-button-wrap avia-button-left  avia-builder-el-4  el_after_av_button  avia-builder-el-last  gumb-sodelavci-desno' title=\"Sloleks lexicon accentuation\"><a href='https:\/\/www.cjvt.si\/en\/infrastructure-support\/sloleks-lexicon-accentuation\/'  class='avia-button  av-button-notext   avia-icon_select-yes-right-icon avia-color-theme-color avia-size-small avia-position-left '   ><span class='avia_iconbox_title' ><\/span><span class='avia_button_icon avia_button_icon_right' aria-hidden='true' data-av_icon='\ue87d' data-av_iconfont='entypo-fontello'><\/span><\/a><\/div><\/p><\/div><\/div><!--close column table wrapper. Autoclose: 1 --><div class=\"flex_column av_two_third  flex_column_div av-zero-column-padding first  avia-builder-el-5  el_after_av_one_third  el_before_av_one_third  column-top-margin\" style='margin-top:36px; margin-bottom:0px; border-radius:0px; '><section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><p>In the online concordancers <a href=\"http:\/\/www.clarin.si\/noske\">noSketch Engine<\/a> in <a href=\"http:\/\/www.clarin.si\/kontext\">KonText<\/a> on CLARIN.SI, only the latest version of the Gigafida 2.0 corpus was available. Although this edition contains texts from older versions, it has a few differences: duplicated texts and texts in non-standard Slovene were removed. Furthermore, the corpus contains updated linguistic tags.<\/p>\n<p>From time to time, the need to access older versions of the corpus arises, e.g. to analyse texts containing non-standard language features. This is especially important for researching Slovene spoken in neighbouring countries \u2013 such language is present in the bulletin <em>Novi Matajur<\/em>, which was removed from Gigafida 2.0. Furthermore, access to older versions enables previous research repeatability and reproducibility.<\/p>\n<p>In this project, access the previous versions of the Gigafida corpus was granted through the online concordancers noSketch Engine and KonText. More specifically, the corpora FidaPLUS, Gigafida 1.0 and Gigafida 1.1 are accessible. It was planned that the very first version of Gigafida (the corpus FIDA) would also be available \u2013 agreements with the owners of the corpus, the companies Amebis, d.o.o. and DZS, d.d. have been signed. Unfortunately, all project funds have been exhausted for copyright transfer from DZS, d.d. to the University of Ljubljana. Thus, no funding was left to actually transfer the corpus from physical CDs to a digital format which would be suitable for the concordancers.<\/p>\n<p>The following versions of the Gigafida corpus are available through noSketch Engine and KonText:<\/p>\n<ul>\n<li>Gigafida v2.0 proto (non-deduplicated): <a href=\"https:\/\/www.clarin.si\/noske\/run.cgi\/corp_info?corpname=gfida20&amp;struct_attr_stats=1\">noSketch Engine<\/a>, <a href=\"https:\/\/www.clarin.si\/kontext\/first_form?corpname=gfida20_dedup\">KonText,<\/a><\/li>\n<li>Gigafida v2.0 (deduplicated): <a href=\"https:\/\/www.clarin.si\/noske\/run.cgi\/corp_info?corpname=gfida20_dedup&amp;struct_attr_stats=1\">noSketch Engine<\/a>, <a href=\"https:\/\/www.clarin.si\/kontext\/first_form?corpname=gfida20_dedup\">KonText,<\/a><\/li>\n<li>Gigafida v1.1 (non-deduplicated): <a href=\"https:\/\/www.clarin.si\/noske\/run.cgi\/corp_info?corpname=gfida&amp;struct_attr_stats=1\">noSketch Engine<\/a>, <a href=\"https:\/\/www.clarin.si\/kontext\/first_form?corpname=gfida\">KonText,<\/a><\/li>\n<li>Gigafida v1.1 dedup (deduplicated): <a href=\"https:\/\/www.clarin.si\/noske\/run.cgi\/corp_info?corpname=gfida10&amp;struct_attr_stats=1\">noSketch Engine<\/a>, <a href=\"https:\/\/www.clarin.si\/kontext\/first_form?corpname=gfida_dedup\">KonText,<\/a><\/li>\n<li>Gigafida v1.0: <a href=\"https:\/\/www.clarin.si\/noske\/run.cgi\/corp_info?corpname=gfida10&amp;struct_attr_stats=1\">noSketch Engine<\/a>, <a href=\"https:\/\/www.clarin.si\/kontext\/first_form?corpname=gfida10\">KonText,<\/a><\/li>\n<li>FidaPLUS: <a href=\"https:\/\/www.clarin.si\/noske\/run.cgi\/corp_info?corpname=fidaplus&amp;struct_attr_stats=1\">noSketch Engine<\/a>, <a href=\"https:\/\/www.clarin.si\/kontext\/first_form?corpname=fidaplus\">KonText.<\/a><\/li>\n<\/ul>\n<\/div><\/section><\/div><\/p>\n<div class=\"flex_column av_one_third  flex_column_div   avia-builder-el-7  el_after_av_two_third  avia-builder-el-last  column-top-margin\" style='margin-top:36px; margin-bottom:0px; background: #f0f0f0; padding:30px; background-color:#f0f0f0; border-radius:0px; '><section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><h3 class=\"zn_text_box-title zn_text_box-title--style1 text-custom\">LINKS AND CONTACT<\/h3>\n<\/div><\/section><br \/>\n<div  style='height:20px' class='hr hr-invisible   avia-builder-el-9  el_after_av_textblock  el_before_av_textblock '><span class='hr-inner ' ><span class='hr-inner-style'><\/span><\/span><\/div><br \/>\n<section class=\"av_textblock_section \"  itemscope=\"itemscope\" itemtype=\"https:\/\/schema.org\/CreativeWork\" ><div class='avia_textblock  '   itemprop=\"text\" ><p>Andra\u017e Repar<\/p>\n<p>Centre for Language Resources and Technologies, University of Ljubljana<br \/>\nVe\u010dna pot 113,<\/p>\n<p>SI-1000 Ljubljana, Slovenia<\/p>\n<ul>\n<li>e-mail: <a href=\"mailto:andra&#122;&#46;&#114;&#101;&#112;&#97;&#114;&#64;&#99;&#106;&#118;&#116;&#46;&#115;&#105;\">&#97;&#x6e;&#x64;r&#97;&#x7a;&#x2e;r&#101;&#x70;a&#114;&#x40;&#x63;j&#118;&#x74;&#46;&#115;&#x69;<\/a><\/li>\n<\/ul>\n<\/div><\/section><\/p><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":3,"featured_media":0,"parent":985,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"_relevanssi_hide_post":"","_relevanssi_hide_content":"","_relevanssi_pin_for_all":"","_relevanssi_pin_keywords":"","_relevanssi_unpin_keywords":"","_relevanssi_related_keywords":"","_relevanssi_related_include_ids":"","_relevanssi_related_exclude_ids":"","_relevanssi_related_no_append":"","_relevanssi_related_not_related":"","_relevanssi_related_posts":"","_relevanssi_noindex_reason":"","inline_featured_image":false,"episode_type":"","audio_file":"","podmotor_file_id":"","podmotor_episode_id":"","cover_image":"","cover_image_id":"","duration":"","filesize":"","filesize_raw":"","date_recorded":"","explicit":"","block":"","itunes_episode_number":"","itunes_title":"","itunes_season_number":"","itunes_episode_type":"","footnotes":""},"class_list":["post-2346","page","type-page","status-publish","hentry"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Access to historic versions of the Gigafida corpus - CJVT<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.cjvt.si\/en\/infrastructure-support\/access-to-historic-versions-of-the-gigafida-corpus\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Access to historic versions of the Gigafida corpus - CJVT\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.cjvt.si\/en\/infrastructure-support\/access-to-historic-versions-of-the-gigafida-corpus\/\" \/>\n<meta property=\"og:site_name\" content=\"CJVT\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/centerzajezikovnevireintehnologije\" \/>\n<meta property=\"article:modified_time\" content=\"2020-11-05T21:30:11+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/infrastructure-support\\\/access-to-historic-versions-of-the-gigafida-corpus\\\/\",\"url\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/infrastructure-support\\\/access-to-historic-versions-of-the-gigafida-corpus\\\/\",\"name\":\"Access to historic versions of the Gigafida corpus - CJVT\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/#website\"},\"datePublished\":\"2020-05-29T15:31:37+00:00\",\"dateModified\":\"2020-11-05T21:30:11+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/infrastructure-support\\\/access-to-historic-versions-of-the-gigafida-corpus\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.cjvt.si\\\/en\\\/infrastructure-support\\\/access-to-historic-versions-of-the-gigafida-corpus\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/infrastructure-support\\\/access-to-historic-versions-of-the-gigafida-corpus\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Infrastructure Support\",\"item\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/infrastructure-support\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Access to historic versions of the Gigafida corpus\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/#website\",\"url\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/\",\"name\":\"CJVT\",\"description\":\"Center za jezikovne vire in tehnologije\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/#organization\",\"name\":\"CJVT\",\"url\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.cjvt.si\\\/wp-content\\\/uploads\\\/2020\\\/06\\\/CJVT-logo-red.jpg\",\"contentUrl\":\"https:\\\/\\\/www.cjvt.si\\\/wp-content\\\/uploads\\\/2020\\\/06\\\/CJVT-logo-red.jpg\",\"width\":1300,\"height\":683,\"caption\":\"CJVT\"},\"image\":{\"@id\":\"https:\\\/\\\/www.cjvt.si\\\/en\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/centerzajezikovnevireintehnologije\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Access to historic versions of the Gigafida corpus - CJVT","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.cjvt.si\/en\/infrastructure-support\/access-to-historic-versions-of-the-gigafida-corpus\/","og_locale":"en_US","og_type":"article","og_title":"Access to historic versions of the Gigafida corpus - CJVT","og_url":"https:\/\/www.cjvt.si\/en\/infrastructure-support\/access-to-historic-versions-of-the-gigafida-corpus\/","og_site_name":"CJVT","article_publisher":"https:\/\/www.facebook.com\/centerzajezikovnevireintehnologije","article_modified_time":"2020-11-05T21:30:11+00:00","twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.cjvt.si\/en\/infrastructure-support\/access-to-historic-versions-of-the-gigafida-corpus\/","url":"https:\/\/www.cjvt.si\/en\/infrastructure-support\/access-to-historic-versions-of-the-gigafida-corpus\/","name":"Access to historic versions of the Gigafida corpus - CJVT","isPartOf":{"@id":"https:\/\/www.cjvt.si\/en\/#website"},"datePublished":"2020-05-29T15:31:37+00:00","dateModified":"2020-11-05T21:30:11+00:00","breadcrumb":{"@id":"https:\/\/www.cjvt.si\/en\/infrastructure-support\/access-to-historic-versions-of-the-gigafida-corpus\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.cjvt.si\/en\/infrastructure-support\/access-to-historic-versions-of-the-gigafida-corpus\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.cjvt.si\/en\/infrastructure-support\/access-to-historic-versions-of-the-gigafida-corpus\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.cjvt.si\/en\/"},{"@type":"ListItem","position":2,"name":"Infrastructure Support","item":"https:\/\/www.cjvt.si\/en\/infrastructure-support\/"},{"@type":"ListItem","position":3,"name":"Access to historic versions of the Gigafida corpus"}]},{"@type":"WebSite","@id":"https:\/\/www.cjvt.si\/en\/#website","url":"https:\/\/www.cjvt.si\/en\/","name":"CJVT","description":"Center za jezikovne vire in tehnologije","publisher":{"@id":"https:\/\/www.cjvt.si\/en\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.cjvt.si\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.cjvt.si\/en\/#organization","name":"CJVT","url":"https:\/\/www.cjvt.si\/en\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.cjvt.si\/en\/#\/schema\/logo\/image\/","url":"https:\/\/www.cjvt.si\/wp-content\/uploads\/2020\/06\/CJVT-logo-red.jpg","contentUrl":"https:\/\/www.cjvt.si\/wp-content\/uploads\/2020\/06\/CJVT-logo-red.jpg","width":1300,"height":683,"caption":"CJVT"},"image":{"@id":"https:\/\/www.cjvt.si\/en\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/centerzajezikovnevireintehnologije"]}]}},"_links":{"self":[{"href":"https:\/\/www.cjvt.si\/en\/wp-json\/wp\/v2\/pages\/2346","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.cjvt.si\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.cjvt.si\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.cjvt.si\/en\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.cjvt.si\/en\/wp-json\/wp\/v2\/comments?post=2346"}],"version-history":[{"count":5,"href":"https:\/\/www.cjvt.si\/en\/wp-json\/wp\/v2\/pages\/2346\/revisions"}],"predecessor-version":[{"id":3152,"href":"https:\/\/www.cjvt.si\/en\/wp-json\/wp\/v2\/pages\/2346\/revisions\/3152"}],"up":[{"embeddable":true,"href":"https:\/\/www.cjvt.si\/en\/wp-json\/wp\/v2\/pages\/985"}],"wp:attachment":[{"href":"https:\/\/www.cjvt.si\/en\/wp-json\/wp\/v2\/media?parent=2346"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}