Adaptation of Large Language Models for Use in Slovenian Medicine
VeMo-Med
In this project, we will extend the use of language models to the field of medicine. Better d.o.o. offers electronic prescription options, automatic speech transcriptions for discharge letters, identification of active ingredients/medications, dosages, time constraints, and stakeholders (doctors, patients, others); automatic digitization of forms with text field and input recognition.
Specific objectives:
- Creation of command and dialogue-based training datasets with specific dialogs and commands related to medical applications.
- Development of a large generative Slovenian language model tailored for medicine.
- Creation of a precise Slovenian speech recognizer specialized for the medical field.
- Development of an advanced medical application utilizing speech recognition and large language models for more efficient and quicker healthcare personnel tasks.
Results:
- D4.1: Training datasets with specific dialogs and commands related to medical applications, with a size of at least 10,000 examples (August 2024).
- D4.2: Large generative language model adapted for the field of medicine (February 2025).
- D4.3: Precise Slovenian speech recognizer specialized for the field of medicine (August 2025).
- D4.4: Medical application utilizing speech recognizer and large generative language model (February 2026).