Computer Science, Artificial Intelligence, Linguistics and Language
5
Scopus Publications
49
Scholar Citations
5
Scholar h-index
1
Scholar i10-index
Scopus Publications
Methods and Algorithms of POS-tagging of Adverbs and Pronouns in Uzbek Texts Elov Botir Boltayevich, Dilrabo Bakhronova, Khudayberganov Nizomaddin Uktambay O‘G‘Li, Svetlana Umirova, Karimova Zilola Dilmurod Kizi, Mansurova Shahinabonu Najmiddin Qizi International Conference on Computer Science and Engineering Ubmk, 2025 In this scientific article, the problem of automatic identification of adverbs and pronouns in Uzbek texts is analyzed based on linguistic and computational approaches. For the Uzbek language, which has an agglutinative structure, the task of POS-tagging causes not only morphological, but also syntactic difficulties. In particular, the multifunctional and contextual variability of adverbs and pronouns requires precise approaches to their automatic classification. The article analyzes performance indicators between traditional rule-based algorithms, as well as modern models based on statistical and neural networks (Conditional Random Fields and BiLSTM). Methodologically, the formal classification of models, their functionality and experimental foundations. Based on the results, the advantages and weaknesses of each approach are identified, and proposals for the optimal tagging model for the Uzbek language are put forward. The research results will be of great importance for morphological analysis, machine translation, automatic text indexation, and improving the overall quality of Uzbek NLP systems.
Statistical POS Tagging Algorithms (HMM, CRF) Odinakhon Jamoldinova, Elov Botir Boltayevich, Maftunakhon Sharipova, Shakhzoda Miralimova, Zilola Yuldashevna Xusainova, Nizomaddin Uktambay O‘G‘Li Khudayberganov, Kholmurod Karimov International Conference on Computer Science and Engineering Ubmk, 2025 This paper presents a comprehensive study of two statistical approaches to part-of-speech (POS) tagging in Uzbek - the Hidden Markov Model (HMM) and the Conditional Random Field (CRF) - from both mathematical and empirical perspectives. We first formalize each model: transition and emission probabilities for HMM, and feature functions with weight parameters for CRF. Both models were trained on a 205 k-token (77821 sentences) CONLL-U corpus annotated with 15 Uzbek-specific POS tags, employing Laplace-smoothed Viterbi decoding for HMM and an L-BFGS-optimized CRF with Viterbi inference. On the held-out test set, the HMM achieved 82% tagging accuracy, while the CRF reached 88%, outperforming HMM by six percentage points thanks to its richer contextual and linguistic features. The results confirm that statistical models remain robust for agglutinative, low-resource languages like Uzbek, yet are sensitive to feature engineering. We conclude with an error analysis, guidelines for model selection, and perspectives on migrating to neural architectures such as BiLSTM-CRF and BERT-based taggers.
The Stages of Creation of LMS Model Jamoldinova Odinaxon Rasulovna, Elov Botir Boltayevich, Primova Mastura Hakim qizi, Xusainova Zilola Yuldashevna, Aloyev Narzillo Raxmatilloyevich, Khudayberganov Nizomaddin Uktambay o’g’li Lecture Notes in Networks and Systems, 2024
Semantic Differentiation of Uzbek Homonyms Using the Lesk Algorithm Elov Botir Boltayevich, Axmedova Xolisxon Ilxomovna, Primova Mastura Hakim Qizi, Khudayberganov Nizomaddin Uktambay O'g'li Ubmk 2023 Proceedings 8th International Conference on Computer Science and Engineering, 2023 The development of a semantic analyzer of natural language is considered one of the factors that develop the language. Homonymy is one of the main elements of semantic analysis. Different methods can be used for semantic analysis of homonyms. Homonyms can also be determined using Lesk's algorithm. Lesk's algorithm is based on WordNet of natural language. The weight of the compounds of the homonymous word in the sentence entered through WordNet is determined. The meaning of the word homonym was determined according to the compounds with high weight.
The Problem of Pos Tagging and Stemming for Agglutinative Languages (Turkish, Uyghur, Uzbek Languages) Elov Botir Boltayevich, Eşref Adalι, Khamroeva Shahlo Mirdjonovna, Abdullayeva Oqila Xolmo'Minovna, Xusainova Zilola Yuldashevna, Xudayberganov Nizomaddin Uktamboy O'g'li Ubmk 2023 Proceedings 8th International Conference on Computer Science and Engineering, 2023 The number of possible word forms in agglutinative languages is theoretically unlimited. This, in turn, creates the problem of POS tagging (part-of-speech) of out-of-vocabulary (OOV) words in agglutinative languages. In agglutinative languages, words are formed by adding suffixes to the stem. Due to the occurrence of phonetic harmony and disharmony while adding suffixes to the stem, it is necessary to analyze both phonetic and morphological changes. When solving many NLP tasks, it is necessary to reduce word forms to the stem (stemming). Removing all inflectional affixes from a word and lemmatizing the rest of the word is considered one of the important tasks of natural language processing (NLP), and this process is called stemming. The stemming process is important in information retrieval (IR) systems.
RECENT SCHOLAR PUBLICATIONS
Methods and Algorithms of POS-tagging of Adverbs and Pronouns in Uzbek Texts EB Boltayevich, D Bakhronova, KNU O‘G‘Li, S Umirova, KZD Kizi, ... 2025 10th International Conference on Computer Science and Engineering (UBMK … , 2025 2025
Statistical POS Tagging Algorithms (HMM, CRF) O Jamoldinova, EB Boltayevich, M Sharipova, S Miralimova, ... 2025 10th International Conference on Computer Science and Engineering (UBMK … , 2025 2025
RAVISH SOʻZ TURKUMINI GRAMMATIK POS TEGLASH ALGORITMLARI: NAZARIY YONDASHUV, AMALIY TATBIQ VA KELGUSIDAGI YOʻNALISHLAR N Xudayberganov, Z Karimova COMPUTER LINGUISTICS: PROBLEMS, SOLUTIONS, PROSPECTS 1 (1) , 2025 2025
O ‘ZBEK TILIDAGI SO ‘ZLARNI POS TEGLASHNING ZAMONAVIY YONDASHUVLARI N Xudayberganov COMPUTER LINGUISTICS: PROBLEMS, SOLUTIONS, PROSPECTS 1 (1) , 2025 2025
O ‘ZBEK TILI MATNLARINI NAIVE BAYES USULI ASOSIDA SENTIMENT TAHLIL QILISH B Elov, A Abdullayev, N Xudayberganov DIGITAL TRANSFORMATION AND ARTIFICIAL INTELLIGENCE 3 (2), 153-159 , 2025 2025 Citations: 1
O ‘ZBEK TILI KORPUSI MATNLARI ASOSIDA TIL MODELLARINI YARATISH E Botir, A Abdulla, X Nizomaddin «CONTEMPORARY TECHNOLOGIES OF COMPUTATIONAL LINGUISTICS» 2 (22.04), 344-353 , 2024 2024
O ‘zbek tili korpusi matnlarini pos teglash usullari B Elov, N Xudayberganov Computer Linguistics: problems, solutions, prospects 1 (1) , 2024 2024 Citations: 9
O ‘zbek tili korpusiga morfologik ishlov berish N Xudayberganov Computer linguistics: problems, solutions, prospects 1 (1) , 2024 2024 Citations: 1
The Stages of Creation of LMS Model JO Rasulovna, EB Boltayevich, PM Hakim qizi, XZ Yuldashevna, ... International Congress on Information and Communication Technology, 433-443 , 2024 2024
Fake News Classification Using Morphological Tag and N-Grams B Elov, N Khudayberganov, Z Khusainova EasyChair , 2023 2023
Agglutinativ tillar uchun pos teglash va stemming masalasi (turk, uyg ‘ur, o ‘zbek tillari misolida) B Elov, S Hamroyeva, O Abdullayeva, Z Husainova, N Xudayberganov Uzbekistan: Language and Culture 2 (2) , 2023 2023 Citations: 3
Semantic Differentiation of Uzbek Homonyms Using the Lesk Algorithm EB Boltayevich, AX Ilxomovna, PMH Qizi, KN Uktambay O'g'li 2023 8th International Conference on Computer Science and Engineering (UBMK … , 2023 2023 Citations: 3
The problem of POS tagging and stemming for agglutinative languages (Turkish, Uyghur, Uzbek languages) EB Boltayevich, E Adalι, KS Mirdjonovna, AO Xolmo'Minovna, ... 2023 8th International Conference on Computer Science and Engineering (UBMK … , 2023 2023 Citations: 12
OʻZBEK TILI UCHUN STEMMERNI ISHLAB CHIQISH N Xudayberganov, N Aloyev COMPUTER LINGUISTICS: PROBLEMS, SOLUTIONS, PROSPECTS 1 (1) , 2023 2023
O ‘zbek, turk va uyg ‘ur tillarida POS teglash va stemming B Elov, S Hamroyeva, O Abdullayeva, Z Xusainova, N Xudayberganov Uzbekistan: Language and Culture 1 (1) , 2023 2023 Citations: 5
Pos tagging of Uzbek texts using hidden Markov models (HMM) and Viterbi algorithm B Elov, H Sh, N Xudayberganov, U Yodgorov, A Yuldashev O ‘zMU xabarlari. Mirzo Ulug ‘bek nomidagi O ‘zbekiston Milliy universiteti … , 2023 2023 Citations: 4
A STATISTICAL INDEX CALCULATED USING THE TF-IDF FOR TEXTS IN THE UZBEK LANGUAGE CORPUS B Elov, Z Xusainova, N Xudayberganov Science and Innovation 1 (8), 1774-1785 , 2022 2022
PYTHON DASTURLASH TILIDA IMLOVIY TAHRIR QILISH DASTURLARI HAQIDA N Xudayberganov COMPUTER LINGUISTICS: PROBLEMS, SOLUTIONS, PROSPECTS 1 (1) , 2022 2022
Tabiiy tilni qayta ishlashda Bag of Words algoritmidan foydalanish B Elov, Z Xusainova, N Xudayberganov Ozbekiston: til va madaniyat(Amaliy filologiya), 2022, 5 (4) , 2022 2022 Citations: 5
Tabiiy tilni qayta ishlashda bag of words algoritmidan foydalanish N Xudayberganov, S Hasanov Uzbekistan language and culture 5 (2), 69-83 , 2022 2022
MOST CITED SCHOLAR PUBLICATIONS
The problem of POS tagging and stemming for agglutinative languages (Turkish, Uyghur, Uzbek languages) EB Boltayevich, E Adalι, KS Mirdjonovna, AO Xolmo'Minovna, ... 2023 8th International Conference on Computer Science and Engineering (UBMK … , 2023 2023 Citations: 12
O ‘zbek tili korpusi matnlarini pos teglash usullari B Elov, N Xudayberganov Computer Linguistics: problems, solutions, prospects 1 (1) , 2024 2024 Citations: 9
O ‘zbek tili korpusi matnlari uchun TF-IDF statistik ko ‘rsatkichni hisoblash B Elov, Z Xusainova, N Xudayberganov Science and innovation 1 (B8), 1774-1785 , 2022 2022 Citations: 6
O ‘zbek, turk va uyg ‘ur tillarida POS teglash va stemming B Elov, S Hamroyeva, O Abdullayeva, Z Xusainova, N Xudayberganov Uzbekistan: Language and Culture 1 (1) , 2023 2023 Citations: 5
Tabiiy tilni qayta ishlashda Bag of Words algoritmidan foydalanish B Elov, Z Xusainova, N Xudayberganov Ozbekiston: til va madaniyat(Amaliy filologiya), 2022, 5 (4) , 2022 2022 Citations: 5
Pos tagging of Uzbek texts using hidden Markov models (HMM) and Viterbi algorithm B Elov, H Sh, N Xudayberganov, U Yodgorov, A Yuldashev O ‘zMU xabarlari. Mirzo Ulug ‘bek nomidagi O ‘zbekiston Milliy universiteti … , 2023 2023 Citations: 4
Agglutinativ tillar uchun pos teglash va stemming masalasi (turk, uyg ‘ur, o ‘zbek tillari misolida) B Elov, S Hamroyeva, O Abdullayeva, Z Husainova, N Xudayberganov Uzbekistan: Language and Culture 2 (2) , 2023 2023 Citations: 3
Semantic Differentiation of Uzbek Homonyms Using the Lesk Algorithm EB Boltayevich, AX Ilxomovna, PMH Qizi, KN Uktambay O'g'li 2023 8th International Conference on Computer Science and Engineering (UBMK … , 2023 2023 Citations: 3
O ‘ZBEK TILI MATNLARINI NAIVE BAYES USULI ASOSIDA SENTIMENT TAHLIL QILISH B Elov, A Abdullayev, N Xudayberganov DIGITAL TRANSFORMATION AND ARTIFICIAL INTELLIGENCE 3 (2), 153-159 , 2025 2025 Citations: 1
O ‘zbek tili korpusiga morfologik ishlov berish N Xudayberganov Computer linguistics: problems, solutions, prospects 1 (1) , 2024 2024 Citations: 1
Methods and Algorithms of POS-tagging of Adverbs and Pronouns in Uzbek Texts EB Boltayevich, D Bakhronova, KNU O‘G‘Li, S Umirova, KZD Kizi, ... 2025 10th International Conference on Computer Science and Engineering (UBMK … , 2025 2025
Statistical POS Tagging Algorithms (HMM, CRF) O Jamoldinova, EB Boltayevich, M Sharipova, S Miralimova, ... 2025 10th International Conference on Computer Science and Engineering (UBMK … , 2025 2025
RAVISH SOʻZ TURKUMINI GRAMMATIK POS TEGLASH ALGORITMLARI: NAZARIY YONDASHUV, AMALIY TATBIQ VA KELGUSIDAGI YOʻNALISHLAR N Xudayberganov, Z Karimova COMPUTER LINGUISTICS: PROBLEMS, SOLUTIONS, PROSPECTS 1 (1) , 2025 2025
O ‘ZBEK TILIDAGI SO ‘ZLARNI POS TEGLASHNING ZAMONAVIY YONDASHUVLARI N Xudayberganov COMPUTER LINGUISTICS: PROBLEMS, SOLUTIONS, PROSPECTS 1 (1) , 2025 2025
O ‘ZBEK TILI KORPUSI MATNLARI ASOSIDA TIL MODELLARINI YARATISH E Botir, A Abdulla, X Nizomaddin «CONTEMPORARY TECHNOLOGIES OF COMPUTATIONAL LINGUISTICS» 2 (22.04), 344-353 , 2024 2024
The Stages of Creation of LMS Model JO Rasulovna, EB Boltayevich, PM Hakim qizi, XZ Yuldashevna, ... International Congress on Information and Communication Technology, 433-443 , 2024 2024
Fake News Classification Using Morphological Tag and N-Grams B Elov, N Khudayberganov, Z Khusainova EasyChair , 2023 2023
OʻZBEK TILI UCHUN STEMMERNI ISHLAB CHIQISH N Xudayberganov, N Aloyev COMPUTER LINGUISTICS: PROBLEMS, SOLUTIONS, PROSPECTS 1 (1) , 2023 2023
A STATISTICAL INDEX CALCULATED USING THE TF-IDF FOR TEXTS IN THE UZBEK LANGUAGE CORPUS B Elov, Z Xusainova, N Xudayberganov Science and Innovation 1 (8), 1774-1785 , 2022 2022
PYTHON DASTURLASH TILIDA IMLOVIY TAHRIR QILISH DASTURLARI HAQIDA N Xudayberganov COMPUTER LINGUISTICS: PROBLEMS, SOLUTIONS, PROSPECTS 1 (1) , 2022 2022