Jaka Čibej
RESEARCH INTERESTS
- corpus linguistics
- natural language processing
- research on dictionary use
- crowdsourcing in linguistics
- computer-mediated communication
PROJECTS
European projects
- eNetCollect – European Network for Combining Language Learning with Crowdsourcing Techniques
- ELEXIS – European Lexicographic Infrastructure
National Research Agency projects
- Resources, Tools and Methods for the Research of Nonstandard Internet Slovene (2014 – 2017)
- Slovene scientific texts: resources and description (2016 – 2018)
- Collocations as a Basis for Language Description: Semantic and Temporal Perspectives (2017 – 2020)
- New grammar of contemporary standard Slovene: sources and methods (2017 – 2020)
- Resources, methods, and tools for the understanding, identification, and classification of various forms of socially unacceptable discourse in the information society – FRENK (2017 – 2020)
Other projects
SELECTED PUBLICATIONS
-
ČIBEJ, Jaka. Framework for an analysis of Slovene regional language variants on Twitter. V: FIŠER, Darja (ur.), BEIßWENGER, Michael (ur.). Proceedings of the 4th Conference on CMC and Social Media Corpora for the Humanities, 27-28 September 2016, Faculty of Arts, University of Ljubljana, Ljubljana, Slovenia. 1st ed. Ljubljana: Znanstvena založba Filozofske fakultete. 2016, str. 17-21, ilustr. http://nl.ijs.si/janes
/wp-content/uploads/2016/09/ CMC-2016_Cibej_Framework-Analy sis-of-Slovene-Regional-Langua ge-Variants-Twitter.pdf. [COBISS.SI-ID 62121058]
-
ČIBEJ, Jaka, FIŠER, Darja, ERJAVEC, Tomaž. Normalisation, tokenisation and sentence segmentation of Slovene tweets. V: UTKA, Andrius (ur.). Normalisation and analysis of social media texts (NormSoMe) : [workshop proceedings]. [S. l.: s. n. 2016], str. 5-10, ilustr. http://www.lrec-conf.o
rg/proceedings/lrec2016/index. html. [COBISS.SI-ID 60917346]
-
ARHAR HOLDT, Špela, ČIBEJ, Jaka, ZWITTER VITEZ, Ana. Value of language-related questions and comments in digital media for lexicographical user research. International journal of lexicography, ISSN 1477-4577. [Spletna izd.], 20. 4. 2016, 24 str., ilustr. http://ijl.oxfordjourna
ls.org/content/early/2016/04/ 20/ijl.ecw017.full.pdf? keytype=ref&ijkey=SP5Yb4PHvfyk Rkk, doi: 10.1093/ijl/ecw017. [COBISS.SI-ID 60439138]
-
ČIBEJ, Jaka, ARHAR HOLDT, Špela, ERJAVEC, Tomaž, FIŠER, Darja. Razvoj učne množice za izboljšano označevanje spletnih besedil. V: ERJAVEC, Tomaž (ur.), FIŠER, Darja (ur.). Zbornik konference Jezikovne tehnologije in digitalna humanistika, 29. september – 1. oktober 2016, Filozofska fakulteta, Univerza v Ljubljani, Ljubljana, Slovenija = Proceedings of the Conference on Language Technologies & Digital Humanities, September 29th – October 1st, 2016 Faculty of Arts, University of Ljubljana, Ljubljana, Slovenia. 1. izd. V Ljubljani: Znanstvena založba Filozofske fakultete: = Ljubljana University Press, Faculty of Arts. 2016, str. 40-46, ilustr. http://www.sdjt.si/wp/
wp-content/uploads/2016/09/JTD H-2016_Cibej-et-al_Razvoj-ucne -mnozice.pdf. [COBISS.SI-ID 62529890]
-
ČIBEJ, Jaka, FIŠER, Darja, KOSEM, Iztok. The role of crowdsourcing in lexicography. V: KOSEM, Iztok (ur.), et al. Electronic lexicography in the 21st century : linking lexical data in the digital age : proceedings of eLex 2015 Conference, 11-13 August 2015, Herstmonceux Castle, United Kingdom. Ljubljana: Trojina, Institute for Applied Slovene Studies; Brighton: Lexical Computing. 2015, str. 70-83, graf. prikaz. https://elex.link/elex
2015/proceedings/eLex_2015_05_ Cibej+Fiser+Kosem.pdf. [COBISS.SI-ID 38744621]