Morpheme Based Myanmar Word Segmenter - International Journal of Trend in Scientific Research and Development

IJTSRD is a leading Open Access, Peer-Reviewed International Journal which provides rapid publication of your research articles and aims to promote the theory and practice along with knowledge sharing between researchers, developers, engineers, students, and practitioners working in and around the world in many areas. For any further information, feel free to write us on editor.ijtsrd@gmail.com

Monday, 15 July 2019

Morpheme Based Myanmar Word Segmenter


Myanmar script has no fixed delimiters between words or syllables. Therefore, to achieve meaningful and correct segmented words from the text is a challenging task. This paper has proposed a morpheme based Myanmar word tokenizer which combines rule based syllable breaking and dictionary lookup syllable merging methods with longest string matching approach. The proposed approach is tested on a Monolingual dictionary that contains useful information for the word segmentation. It also contains above 32,581 words including headwords, stop words and essential words with Myanmar3 font. These words are collected from Myanmar and Essential Words dictionaries. According to the experimental results, it can provide the promising segmentation accuracy of Myanmar text. 


by Sin Thi Yar Myint | Hanni Htun | Myat Myo Nwe Wai ""Morpheme Based Myanmar Word Segmenter""

Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-5 , August 2019,

URL: https://www.ijtsrd.com/papers/ijtsrd26520.pdf

Paper URL: https://www.ijtsrd.com/computer-science/other/26520/morpheme-based-myanmar-word-segmenter/sin-thi-yar-myint

economics journal, best international journal

No comments:

Post a Comment

Ad