A compromise Arabic-Kazakh coded character processing method based on the OpenType font format
Dong, J (Dong, Jun); Jiang, TH (Jiang, Tonghai); Cheng, L (Cheng, Li); Anwar, A (Anwar, Azmat); Yang, Y (Yang, Yong)
刊名COMPUTER STANDARDS & INTERFACES
2018
卷号55期号:1页码:1-7
关键词Kazakh Coded Character Unicode Opentype
ISSN号0920-5489
DOI10.1016/j.csi.2017.02.005
英文摘要

Information systems for Arabic-Kazakh processing must handle the editing and display problems caused by four special vowels: (sic), (sic), (sic) and (sic) The current solution uses combinations of four alternative vowels ((sic), (sic), (sic), and (sic)) with the character (sic) to represent these four special vowels. However, this approach relies on deliberate spelling errors and can cause computer programs to be unable to semantically distinguish the alternative vowels from the original vowels. Moreover, this causes problems in Arabic-Kazakh text-processing applications such as text sorting, script conversion and speech synthesis. We propose a compromise method in which the four special vowels are represented by combinations of themselves with the character (sic) and the related editing and display problems are handled using an OpenType font. The relevant glyph layout features in the OpenType font format are compatible with the proposed compromise method. Results from the sorting and classification of 10,000 randomly selected common Arabic-Kazakh words demonstrate that the new method successfully avoids problems caused by letter replacement, including text sorting errors in 2843 of the tested words and ambiguities with the characters (sic), (sic), (sic), and (sic) in 3960 of the words.

WOS记录号WOS:000419411300001
内容类型期刊论文
源URL[http://ir.xjipc.cas.cn/handle/365002/5113]  
专题新疆理化技术研究所_多语种信息技术研究室
通讯作者Jiang, TH (Jiang, Tonghai)
作者单位1.Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
3.Xinjiang Lab Minor Speech & Language Informat Pro, Urumqi 830011, Peoples R China
4.Xinjiang Normal Univ, Coll Comp Sci, Urumqi 830054, Peoples R China
推荐引用方式
GB/T 7714
Dong, J ,Jiang, TH ,Cheng, L ,et al. A compromise Arabic-Kazakh coded character processing method based on the OpenType font format[J]. COMPUTER STANDARDS & INTERFACES,2018,55(1):1-7.
APA Dong, J ,Jiang, TH ,Cheng, L ,Anwar, A ,&Yang, Y .(2018).A compromise Arabic-Kazakh coded character processing method based on the OpenType font format.COMPUTER STANDARDS & INTERFACES,55(1),1-7.
MLA Dong, J ,et al."A compromise Arabic-Kazakh coded character processing method based on the OpenType font format".COMPUTER STANDARDS & INTERFACES 55.1(2018):1-7.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace