1,2,4Department of Information Technology, Faculty of Computing & Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
3 Faculty of Education, University of Jeddah, Jeddah, Saudi Arabia
5,6Department of Computer Science, Faculty of Computing & Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
Article Publishing History
Received: 09/05/2019
Accepted After Revision: 25/06/2019
This paper presents an intelligent model to read, understand and translate Arabic text content into Arabic signs. The proposed model includes four main phases, preprocessing, modeling language, translation, and sign generation phases. In the preprocessing phase, the corpora and the stop words will be employed. The language model includes morphological, lexical and syntax, and semantic analysis. This is in addition to stem, root extraction with ontological support, and number indication will be involved. Consequently, we have different features that represent the analyzed Arabic text (words’ meanings, words ordering, syntactic features, number features …). Therefore, the generation phase takes place to generate the equivalent Arabic signs using the signer model. Accordingly, the deaf and hearing-impaired people are showing the generated stream of Arabic signs using video or 3D Avatar. The proposed solution uses programming corpus written in Arabic language, so the generated dictionary/lexicon has limited set of Arabic words with their meaning. After getting the content data from the course, the language model analyzes and understands the content and store it into deep structure or internal representation, consequently, the system will generate the Arabic stream signs based on the signer model.
Arabic Nlp, Corpus, Deaf, Hearing-Impaired, Machine Translation
Al-Barhamtoshy H. M, Abuzinadah N. E, Nabawi A, Himdi T. F, Malibari A. A, Allinjawi A. A. Development of an Intelligent Arabic Text Translation Model for Deaf Students Using State of the Art Information Technology. Biosc.Biotech.Res.Comm. 2019;12(2).
Al-Barhamtoshy H. M, Abuzinadah N. E, Nabawi A, Himdi T. F, Malibari A. A, Allinjawi A. A. Development of an Intelligent Arabic Text Translation Model for Deaf Students Using State of the Art Information Technology. Biosc.Biotech.Res.Comm. 2019;12(2). Available from: https://bit.ly/2QMOMz1
Copyright © Al-Barhamtoshy et al., This is an open access article distributed under the terms of the Creative Commons Attribution License (CC-BY) https://creativecommns.org/licenses/by/4.0/, which permits unrestricted use distribution and reproduction in any medium, provide the original author and source are credited.
Introduction
Now a day, disability, deaf-blind, and hearing-impaired need a communication way to deal with teaching and communicating the community and society. The communication approach is to find the best way to speak, understand, visualize and deal what they are need. Sign language is the most effective way to communicate with those people. Sign language used for deaf and hearing-impaired people in order to facilitate the communication with community people (Abuzinadah, et al., 2017). A sign language recognition based on deep learning methodology from video sequences is proposed in (Konstantinidis, et al., 2018) and (Kong & Ranganath, 2008). However, face expression, two hands gestures and body language have been introduced to adopt the recognition and generation tasks with deaf and hearing-impaired (Lim, et al., 2016).
Other challenge point to work with the sign language is the dataset needed to work with sign recognition and sign generation. Some of this dataset uses only one hand (right hand), and others are signed with both hands, (Konstantinidis, et al., 2018) (Ronchetti, et al., 2016). Hand gestures provide important information to work with sign languages such as ASL, FSL, DSL, RSL and also Arabic Sign language (ArSL) (Patel & Ambekar, 2017) (Nikam & Ambekar, 2016) .
A more recent development in the academic, pedagogic and societal is a demand for legal recognition to emerge the hearing-impaired people into society (Harris, 2018). Therefore, sign languages are faced number of challenges related to socio-linguistic aspects for signing with the community (Quer & Steinbach, 2019).
Video Apprach
A bilingual corpus for Arabic sign language has been created in (ElMaazouzi, et al., 2016) concerned with developing a corpus for sign language that meets communication needs of the Arab deaf community. An alternative approach is presented to use video technology in Insign project to answer an online survey from 84 deaf over 22 diverse countries (Napier, et al., 2018). A deep learning method used to analyze and recognize video in sign language (Konstantinidis, et al., 2018) based on framework. In such work, video is examined and analyzed using images flow extracted features, and skeletal movement features (body, hand and face), in such case, each signed video corresponds separate word.
3D Avatar Approach
An assistant education model with 3D avatar is presented in (Ulisses, et al., 2018) to incorporate Virtual Sign as a translator between sign language and oral language.In last decade, an example-based approach is implemented as a translator in American Sign language (ASL) with a deaf people (Morrissey & Way, 2005). In addition, leap motion can be converted and therefore, translated into text using assistant device such as tablet machine (Escudeiro, et al., 2017).
Simov (Simoy, et al., 2016) presented semantic-based approach to translate from Bulgarian and English in information technology domain. This approach is used for answering questions after employing of morphological analysis for Bulgarian nature. Some troubles and difficulties such as understanding the concept and the meaning of programming terminologies and related definitions need to additional elaboration. The idea behind that is, lexical units that have similar meanings should appear in similar context. Therefore, official repository for semantic information for course contents and terminologies definitions will be involved.
Machine Translation Framework
A simple prototype framework for translate Arabic text content to Arabic sign language using video or 3D avatar will be described in this section. The framework of the proposed model accepts any Arabic text content and analyzes and understands such content and store such meaning into a deep structure or internal representation. The architecture of the proposed framework is illustrated in Fig 1. The proposed solution of ArSL signer based on rule and lexical-based approaches. It uses morphology phase and syntax phase in dependency trees, without any sophisticated methods (machine learning algorithms).
![]() |
Figure 1: High Level of the Arabic Sign Langauge (ArSL) Translator Framework |
Preprocessing Model
The proposed solution starts with retrieving Arabic text content, cleaning and preprocessing text data with analyzing, understanding with ways of language processing algorithms. Therefore, the “Arabic text content” must be aggressively filtered and corrected from obvious typos. So, syntactic analysis and semantic analysis will be explored. So, many NLP procedures will be involved, i.e.; word tokenization, stemming, POS tagging, word meaning, topic model or ontology model. Therefore, in this chapter various fundamentals in Arabic NLP coupled with other state of the art techniques will be covered.Consequently, the “Arabic text content” includes “Arabic stop words”, we need to remove these stop words before Arabic text analysis and understand. This in addition to make some corrections of the “Arabic text content”. After removing “stop words”, the language model takes place to tokenize the “Arabic text content” stream before the POS tagger model.
Language model
The objective the language model is to analyze and understand the content text in order to extract the relevant information and create the internal features’ representation of the analyzed text. This internal representation is very important to generate the equivalent of the signs of the Arabic sign language (ArSL). In this part, we are going to cover several procedures to process the Arabic text content along with examining content analysis. Therefore, ways of techniques to analyze and understand the Arabic content: text cleaning and text normalization, stop words removing, word tokenization, word stemming, and POS tagging. Consequently, the task of the language model includes several phases, tokenizer, morphological, syntactic and semantic analysis phases. Therefore, the language model uses a well-defined dictionary, lexicon and grammatical rules. The output results of the language model are used to achieve the internal deep structure (internal representation).
Phrase treatment and Arabic Grammar
The first step includes two entirely steps: sentence/phrase splitting and word derivations with affixes processing. Any Arabic text content will be segmented into separate sentences/ phrases (chunks). The Arabic phrases are classified into noun phrases (NP) or verb phrases (VP). We can extract the noun phrases from the Arabic text content, and consequently, identify noun, proper noun, and extra noun phrase from the text.
The Arabic grammar is flexible with phrase and sentence structure in word ordering. The simple Arabic phrase has three forms (read from right to left):
- (فهم الطالب البرنامج)
- (الطالب فهم البرنامج) < Verb: فهم > < Subject: الطالب>
- (البرنامج فهمه الطالب)
In English, the phrase might be (Student understood the program) =>