Husni A. Al-Muhtseb

@admissions.kfupm.edu.sa

Information and Computer Science Department
King Fahd University of Petroleum and Minerals

https://researchid.co/muhtaseb

RESEARCH, TEACHING, or OTHER INTERESTS

Computer Science, Artificial Intelligence, Education, Computer Science Applications

Scopus Publications

1083

Scholar Citations

Scholar h-index

Scholar i10-index

Scopus Publications

BPTI: Bilingual Printed Text Images Dataset for Recognition Purposes
Mohammed Yahia and Husni Al-Muhtaseb
Zarqa University
Datasets of text images are important for optical text recognition systems. Such datasets can be used to enhance performance and recognition rates. In this research work, we present a bilingual dataset consists of Arabic/English text images to address the lack of availability of bilingual text databases. The presented dataset consists of 97812 text images, which are categorized into two groups; Scanned page and digitized line images. Images of the two forms are written with 10 fonts and four sizes, and prepared/scanned with four dpi resolutions. The dataset preparation process includes text collection, text editing, image construction, and image processing. The dataset can be used in optical text recognition, optical font recognition, language identification, and segmentation. Different text recognition and language identification experiments have been conducted using images of the dataset and Hidden Markov Model (HMM) classifier. For the digitized images recognition experiments, the best-achieved recognition correctness is 99.01% and the best accuracy is 99.01%. The font that has the highest recognition rates was Tahoma. For the scanned images recognition experiments, Tahoma has also shown the highest performance with 97.86% for correctness and 97.73% for accuracy. For the language identification experiments, Tahoma has shown the performance with 99.98% for word-language identification rate.

Sport-fanaticism lexicons for sentiment analysis in Arabic social text
Mohammed Alqmase and Husni Al-Muhtaseb
Springer Science and Business Media LLC

Arabic Keyphrase Extraction: Enhancing Deep Learning Models with Pre-trained Contextual Embedding and External Features

Sports-fanaticism formalism for sentiment analysis in Arabic text
Mohammed Alqmase, Husni Al-Muhtaseb, and Habib Rabaan
Springer Science and Business Media LLC

Arabic phonemes recognition using hybrid LVQ/HMM model for continuous speech recognition
Khalid M. O. Nahar, Mohammed Abu Shquier, Wasfi G. Al-Khatib, Husni Al-Muhtaseb, and Moustafa Elshafei
Springer Science and Business Media LLC

Arabic Phonemes Transcription Using Learning Vector Quantization: 'Towards the Development of Fast Quranic Text Transcription'
Khalid M.O. Nahar, Wasfi G. Al-Khatib, Moustafa Elshafei, Husni Al-Muhtaseb, and Mansour M. Alghamdi
IEEE
In this paper, we investigated the use of Learning Vector Quantization (LVQ) for phoneme transcription in Arabic speech recognition systems. We used Arabic speech corpus of TV news clips. Then, we employed feature vectors, which embed the frame neighboring correlation information between adjacent phonemes to replace the traditional trip hones models. Next, we generated the phonemes codebooks using the K-means splitting algorithm. After that, we trained the generated codebooks using the LVQ algorithm. When using the trained LVQ codebooks in utterance phoneme transcription of an open vocabulary test corpus, the phoneme recognition rate was 72% without the use of any added phoneme big rams or HMM models. The results of this research if improved could be used to serve the holy Quran text transcription without any phonemes big rams (phonemes language model). This would increase the speed of the Quranic speech to text transcription and creates the infrastructure of suitable high speed automatic identification system of Quranic sounds recognition and translation.

Arabic phonemes transcription using data driven approach

Data-driven Arabic phoneme recognition using varying number of HMM states
K. M. O. Nahar, W. G. Al-Khatib, M. Elshafei, H. Al-Muhtaseb, and M. M. Alghamdi
IEEE
Continuous Arabic Speech Recognition, appears in many real life applications. Its speed, accuracy and improvement are highly dependent on the accuracy of the language phonemes set. The main goal of this research is to recognize and transcribe the Arabic phonemes based on a data-driven approach. We built a phoneme recognizer based on a data driven approach using HTK tool. Different numbers of Gaussian mixtures with different numbers of HMM states were used in modeling the Arabic phonemes in order to reach the best configuration. The corpus used consists of about 4000 files, representing 5 recorded hours of modern standard Arabic of TV-News. The maximum phoneme recognition accuracy reached was 56.79%. This result is very encouraging and shows the viability of our approach as compared to using a fixed number of HMM states.

Statistical analysis of Arabic phonemes used in Arabic speech recognition
Khalid M. O Nahar, Mustafa Elshafei, Wasfi G. Al-Khatib, Husni Al-Muhtaseb, and Mansour M. Alghamdi
Springer Berlin Heidelberg

Within-word pronunciation variation modeling for Arabic ASRs: A direct data-driven approach
Dia AbuZeina, Wasfi Al-Khatib, Moustafa Elshafei, and Husni Al-Muhtaseb
Springer Science and Business Media LLC

Arabic optical character recognition: Recent trends and future directions
Husni Al-Muhtaseb and Rami Qahwaji
IGI Global
Arabic text recognition is receiving more attentions from both Arabic and non-Arabic-speaking researchers. This chapter provides a general overview of the state-of-the-art in Arabic Optical Character Recognition (OCR) and the associated text recognition technology. It also investigates the characteristics of the Arabic language with respect to OCR and discusses related research on the different phases of text recognition including: pre-processing and text segmentation, common feature extraction techniques, classification methods and post-processing techniques. Moreover, the chapter discusses the available databases for Arabic OCR research and lists the available commercial Software. Finally, it explores the challenges related to Arabic OCR and discusses possible future trends.

Toward enhanced Arabic speech recognition using part of speech tagging
Dia AbuZeina, Wasfi Al-Khatib, Moustafa Elshafei, and Husni Al-Muhtaseb
Springer Science and Business Media LLC

Cross-word Arabic pronunciation variation modeling for speech recognition
Dia AbuZeina, Wasfi Al-Khatib, Moustafa Elshafei, and Husni Al-Muhtaseb
Springer Science and Business Media LLC

Offline handwritten Arabic cursive text recognition using Hidden Markov Models and re-ranking
Jawad H AlKhateeb, Jinchang Ren, Jianmin Jiang, and Husni Al-Muhtaseb
Elsevier BV

Generation of arabic phonetic dictionaries for speech recognition
Mohamed Ali, Moustafa Elshafei, Mansour Al-Ghamdi, Husni Al-Muhtaseb, and Atef Al-Najjar
IEEE
Phonetic dictionaries are essential components of large-vocabulary natural language speaker-independent speech recognition systems. This paper presents a rule-based technique to generate Arabic phonetic dictionaries for a large vocabulary speech recognition system. The system used classic Arabic pronunciation rules, common pronunciation rules of Modern Standard Arabic, as well as morphologically driven rules. The paper gives in detail an explanation of these rules as well as their formal mathematical presentation. The rules were used to generate a dictionary for a 5.4 hours corpus of broadcast news. The phonetic dictionary contains 23,841 definitions corresponding to about 14232 words. The generated dictionary was evaluated on an actual Arabic speech recognition system. The pronunciation rules and the phone set were validated by test cases. The Arabic speech recognition system achieves word error rate of %11.71 for fully diacritized transcription of about 1.1 hours of Arabic broadcast news.

Recognition of off-line printed Arabic text using Hidden Markov Models
Husni A. Al-Muhtaseb, Sabri A. Mahmoud, and Rami S. Qahwaji
Elsevier BV

Arabic broadcast news transcription system
Mansour Alghamdi, Moustafa Elshafei, and Husni Al-Muhtaseb
Springer Science and Business Media LLC

Techniques for high quality Arabic speech synthesis
M Elshafei
Elsevier BV

New fault models and efficient BIST algorithms for dual-port memories
A.A. Amin, M.Y. Osman, R.E. Abdel-Aal, and H. Al-Muhtaseb
Institute of Electrical and Electronics Engineers (IEEE)
The testability problem of dual-port memories is investigated. A functional model is defined, and architectural modifications to enhance the testability of such chips are described. These modifications allow multiple access of memory cells for increased test speed with minimal overhead on both silicon area and device performance. New fault models are proposed, and efficient O(/spl radic/n) test algorithms are described for both the memory array and the address decoders. The new fault models account for the simultaneous dual-access property of the device. In addition to the classical static neighborhood pattern-sensitive faults, the array test algorithm covers a new class of pattern sensitive faults, duplex dynamic neighborhood pattern-sensitive faults (DDNPSF).

KHABEER: An object-oriented arabic expert system shell

RECENT SCHOLAR PUBLICATIONS

BPTI: Bilingual Printed Text Images Dataset for Recognition Purposes
M Yahia, H Al-Muhtaseb
The International Arab Journal of Information Technology 20 (4) 2023

Arabic Keyphrase Extraction: Enhancing Deep Learning Models with Pre-trained Contextual Embedding and External Features
R Alharbi, H Al-Muhtasab
Proceedings of the Seventh Arabic Natural Language Processing Workshop 2022

Sport-fanaticism lexicons for sentiment analysis in Arabic social text
M Alqmase, H Al-Muhtaseb
Social Network Analysis and Mining 12 (1), 56 2022

BPTI: Bilingual Printed Text Images Dataset for Recognition Purposes
M Yahia, H Al-Muhtaseb
Social Science Research Network (SSRN). https://papers.ssrn.com/sol3/papers 2022

Sports-fanaticism formalism for sentiment analysis in Arabic text
M Alqmase, H Al-Muhtaseb, H Rabaan
Social Network Analysis and Mining 11 (1), 52 2021

Recognition of Printed Arabic-English Text
MHN Yahia
PQDT-Global 2018

Arabic Dataset for Automatic Keyphrase Extraction
M Al Logmani, H Al Muhtaseb
Seventh International Conference on Computer Science and Information 2017

Arabic phonemes recognition using hybrid LVQ/HMM model for continuous speech recognition
KMO Nahar, M Abu Shquier, WG Al-Khatib, H Al-Muhtaseb, M Elshafei
International Journal of Speech Technology 19, 495-508 2016

An Arabic corpus to assist in the automatic extraction of key-phrases (in Arabic) مكنز عربي للمساعدة في الاستنباط الآلي للعبارات المفتاحية
M Al Logmani, H Al-Muhtaseb
The 5th international conference on Arabic language, Dubai. المؤتمر الدولي 2016

Modeling the phenomenon of changing word pronunciation resulting from intonation judgements (in Arabic) نمذجة ظاهرة تغير نطق الكلمات الناتج عن أحكام التجويد
M Amro, W Al-Khatib, Elshafei, Moustafa, H Al-Muhtaseb
The 5th international conference on Arabic language, Dubai. المؤتمر الدولي 2016

Post-processing optimization for Arabic optical character recognition (In Arabic) تحسين مرحلة "بعد المعالجة" في نظام التعرف الضوئي الآلي على الكتابة العربية
H Al-Muhtaseb, H Luqman
The 5th international conference on Arabic language, Dubai. المؤتمر الدولي 2016

Automatic Vocalization of Arabic Text
YMS Khraishi
PQDT-Global 2016

Towards A Minimal Phonetic Set for Quran Recitiation
HA Al-Muhtaseb, SA Bellegdi
International Journal on Islamic an Al-Muhtaseb, HA, & Bellegdi, SA (2016 2016

Arabic Phonemes Transcription using Data Driven Approach.
K Nahar, H Al-Muhtaseb, W Al-Khatib, M Elshafei, M Alghamdi
International Arab Journal of Information Technology (IAJIT) 12 (3) 2015

Automatic rule based phonetic transcription and syllabification for quranic text
SA Bellegdi, HA Al-Muhtaseb
International Journal on Islamic Applications in Computer Science And 2015

System and method for decoding speech
DEM Abuzeina, M Elshafei, H Al-Muhtaseb, WG Al-Khatib
US Patent App. 13/597,162 2014

Arabic Phonemes Transcription Using Learning Vector Quantization:" Towards the Development of Fast Quranic Text Transcription"
KMO Nahar, WG Al-Khatib, M Elshafei, H Al-Muhtaseb, MM Alghamdi
2013 Taibah University International Conference on Advances in Information 2013

Method of generating a transliteration font
S Awaida, H Al-Muhtaseb
US Patent 8,438,008 2013

Data-driven Arabic phoneme recognition using varying number of HMM states
KMO Nahar, WG Al-Khatib, M Elshafei, H Al-Muhtaseb, MM Alghamdi
2013 1st International Conference on Communications, Signal Processing, and 2013

Cross-Word Arabic Pronunciation Variation Modeling Using Part of Speech Tagging
D AbuZeina, H Al-Muhtaseb, M Elshafei
Modern Speech Recognition Approaches with Case Studies 2012

MOST CITED SCHOLAR PUBLICATIONS

Offline handwritten Arabic cursive text recognition using Hidden Markov Models and re-ranking
JH AlKhateeb, J Ren, J Jiang, H Al-Muhtaseb
Pattern Recognition Letters 32 (8), 1081-1088 2011
Citations: 170

Recognition of off-line printed Arabic text using Hidden Markov Models
HA Al-Muhtaseb, SA Mahmoud, RS Qahwaji
Signal processing 88 (12), 2902-2912 2008
Citations: 128

Statistical methods for automatic diacritization of Arabic text
M Elshafei, H Al-Muhtaseb, M Alghamdi
The Saudi 18th National Computer Conference. Riyadh 18, 301-306 2006
Citations: 93

Techniques for high quality Arabic speech synthesis
M Elshafei, H Al-Muhtaseb, M Al-Ghamdi
Information sciences 140 (3-4), 255-267 2002
Citations: 76

Arabic broadcast news transcription system
M Alghamdi, M Elshafei, H Al-Muhtaseb
International Journal of Speech Technology 10, 183-195 2007
Citations: 60

Machine Generation of Arabic Diacritical Marks.
M Elshafei, H Al-Muhtaseb, M Al-Ghamdi
MLMTA 2006, 128-133 2006
Citations: 42

Generation of Arabic phonetic dictionaries for speech recognition
M Ali, M Elshafei, M Al-Ghamdi, H Al-Muhtaseb, A Al-Najjar
2008 International conference on innovations in information technology, 59-63 2008
Citations: 40

System and method for decoding speech
DEM Abuzeina, M Elshafei, H Al-Muhtaseb, WG Al-Khatib
US Patent App. 13/597,162 2014
Citations: 39

Arabic phonetic dictionaries for speech recognition
M Ali, M Elshafei, M Al-Ghamdi, H Al-Muhtaseb
Journal of Information Technology Research (JITR) 2 (4), 67-80 2009
Citations: 36

Cross-word Arabic pronunciation variation modeling for speech recognition
D AbuZeina, W Al-Khatib, M Elshafei, H Al-Muhtaseb
International Journal of Speech Technology 14, 227-236 2011
Citations: 32

Automatic Arabic text image optical character recognition method
HA Al-Muhtaseb, SA Mahmoud, R Qahwaji
US Patent 8,150,160 2012
Citations: 31

Statistical analysis of Arabic phonemes used in Arabic speech recognition
KMO Nahar, M Elshafei, WG Al-Khatib, H Al-Muhtaseb, MM Alghamdi
Neural Information Processing: 19th International Conference, ICONIP 2012 2012
Citations: 26

Some Differences Between Arabic and English: A Step Towards an Arabic Upper Model
H Al-Muhtaseb, C Mellish
The 6th International Conference on Multilingual Computing, Cambridge, UK. 1998
Citations: 25

Techniques for high quality Arabic speech synthesis
H Al-Muhtaseb, M Elshafei, M Al-Ghamdi
Information sciences 140, 255-267 2002
Citations: 21

Speaker-independent natural Arabic speech recognition system
M Elshafei, H Al-Muhtaseb, M Al-Ghamdi
The International Conference on Intelligent Systems 2008
Citations: 20

Sports-fanaticism formalism for sentiment analysis in Arabic text
M Alqmase, H Al-Muhtaseb, H Rabaan
Social Network Analysis and Mining 11 (1), 52 2021
Citations: 19

Within-word pronunciation variation modeling for Arabic ASRs: a direct data-driven approach
D AbuZeina, W Al-Khatib, M Elshafei, H Al-Muhtaseb
International Journal of Speech Technology 15, 65-75 2012
Citations: 18

Arabic Phonemes Transcription using Data Driven Approach.
K Nahar, H Al-Muhtaseb, W Al-Khatib, M Elshafei, M Alghamdi
International Arab Journal of Information Technology (IAJIT) 12 (3) 2015
Citations: 15

Toward enhanced Arabic speech recognition using part of speech tagging
D AbuZeina, W Al-Khatib, M Elshafei, H Al-Muhtaseb
International Journal of Speech Technology 14, 419-426 2011
Citations: 15