Jaka Čibej

Research interests

  • Corpus linguistics
  • Natural language processing
  • Research on dictionary use
  • Crowdsourcing in linguistics
  • Computer-mediated communication

Selected publications

1. ČIBEJ, Jaka. Framework for an analysis of Slovene regional language variants on Twitter. V: FIŠER, Darja (ur.), BEIßWENGER, Michael (ur.). Proceedings of the 4th Conference on CMC and Social Media Corpora for the Humanities, 27-28 September 2016, Faculty of Arts, University of Ljubljana, Ljubljana, Slovenia. 1st ed. Ljubljana: Znanstvena založba Filozofske fakultete. 2016, str. 17-21, ilustr. [COBISS.SI-ID 62121058]

2. ČIBEJ, Jaka, FIŠER, Darja, ERJAVEC, Tomaž. Normalisation, tokenisation and sentence segmentation of Slovene tweets. V: UTKA, Andrius (ur.). Normalisation and analysis of social media texts (NormSoMe) : [workshop proceedings]. [S. l.: s. n. 2016], str. 5-10, ilustr. [COBISS.SI-ID 60917346]

3. ARHAR HOLDT, Špela, ČIBEJ, Jaka, ZWITTER VITEZ, Ana. Value of language-related questions and comments in digital media for lexicographical user research. International journal of lexicography, ISSN 1477-4577. [Spletna izd.], 20. 4. 2016, 24 str., ilustr., doi: 10.1093/ijl/ecw017. [COBISS.SI-ID 60439138]

4. ČIBEJ, Jaka, ARHAR HOLDT, Špela, ERJAVEC, Tomaž, FIŠER, Darja. Razvoj učne množice za izboljšano označevanje spletnih besedil. V: ERJAVEC, Tomaž (ur.), FIŠER, Darja (ur.). Zbornik konference Jezikovne tehnologije in digitalna humanistika, 29. september - 1. oktober 2016, Filozofska fakulteta, Univerza v Ljubljani, Ljubljana, Slovenija = Proceedings of the Conference on Language Technologies & Digital Humanities, September 29th - October 1st, 2016 Faculty of Arts, University of Ljubljana, Ljubljana, Slovenia. 1. izd. V Ljubljani: Znanstvena založba Filozofske fakultete: = Ljubljana University Press, Faculty of Arts. 2016, str. 40-46, ilustr. [COBISS.SI-ID 62529890]

5. ČIBEJ, Jaka, FIŠER, Darja, KOSEM, Iztok. The role of crowdsourcing in lexicography. V: KOSEM, Iztok (ur.), et al. Electronic lexicography in the 21st century : linking lexical data in the digital age : proceedings of eLex 2015 Conference, 11-13 August 2015, Herstmonceux Castle, United Kingdom. Ljubljana: Trojina, Institute for Applied Slovene Studies; Brighton: Lexical Computing. 2015, str. 70-83, graf. prikaz. [COBISS.SI-ID 38744621]

Contact & Links


By continuing to browse the site, you are agreeing to our use of cookies. More Information >

More information: COOKIE POLICY

Our website uses “cookies” to distinguish between visitors and to perform website statistics usage. This allows us to improve the page constantly. Users who do not allow our website "cookies" to be recorded on their computer, will not be able to use all the functionalities of the website (video, comment on Facebook, etc.).Cookies are small files that a website that you visited records on your computer. The next time you are visiting the same site, the system can recognize you.

Our website uses the following types of cookies:

First-Party Cookies

PHPSESSID: this cookie is used for managing user session on the website. Session cookies: are used for temporary storage of information.

wordpress_test_cookie: A session cookie, deleted when you close your web browser.

_icl_current_language: WPML cookie, stores selected language version of the page. Expires in 24 hours.

Third-Party Cookies

datr: Facebook tracking cookie. Lifespan: 2 years.

fr: Facebook advertising cookie. Lifespan: 3 months.

reg_fb_gate: session cookie

reg_fb_ref: session cookie

Google Map (SID - expires after 2 years, SAPISID - expires after 2 years, APISID - expires after 2 years, SSID - expires after 2 years, HSID - expires after 2 years, NID - expires after 6 months, PREF - expires after 8 months): are used to follow the number of users and to track their behavior on Google Maps.

Hide Information