A Computational Implementation of Morphological Analysis and Generation of Verbs in Myanmar Language

Authors

  • Kaung Myat Thu Research Scholar Department of Computer Science, Manipur University
  • H. Mamata Devi Professor Department of Computer Science Manipur University
  • Th. Rupachandra Singh Assistant Professor MCA (Manipur University)

Keywords:

Morphology, Natural Language Processing, FST, MM-Morph, FSA, Morphological analysis and generation (MAG), finite-state morphological analysis and generation (FSMAG), Computational linguistics tool, Indian Language, Myanmar Language, Xerox's xfst, FOMA

Abstract

The field of morphological analysis and generation focuses on the study of word production, the recognition of grammatical components within words, and the creation of words that adhere to morphotactic standards. According to various research reports, finite-state techniques are fast, effective, and efficient in interpreting human language morphologies into the computational system. FOMA: a more elaborate version of Xerox's finite state toolset can be used to implement the finite state morphology. Using FOMA toolset and other programming languages, we have already created the MM-Morph tool: a computational linguistic tool for morphological analyzer and generator for Myanmar nouns. In this paper, we describe the linguistic phenomena of the morphology of verbs and the techniques used in the system's development process to integrate it into the existing MM-Morph tool. MM-Morph has been developed as a part of the research "Morphological Analysis and Generation for Myanmar Language using Finite State Techniques." We share the experimental evaluations conducted to assess this system's performance. Evaluation results show that the MAG system of Myanmar verbs can identify more than (78%) of the verbs in the language.

Downloads

Download data is not yet available.

References

Goldsmith, John. (2001). Unsupervised learning of the morphology of a natural language. Computational linguistics, 27(2):153–198, 2001.

Goldsmith, John. (2006). An Algorithm for the Unsupervised Learning of Morphology. Natural Language Engineering. 12. 353-371. 10.1017/S1351324905004055.

Welgama, Viraj & Weerasinghe, Ruvan & Niranjan, Mahesan. (2013). Evaluating a Machine Learning Approach to Sinhala Morphological Analysis. 10th International Conference on Natural Language Processing, At: Noida, India

John Lee. (2008). A Nearest-Neighbour Approach to the Automatic Analysis of Ancient Greek Morphology. CoNLL 2008: Proceedings of the 12th Conference on Computational Natural Language Learning, Manchester, August 2008: 127–134

Anand Kumar, M. & Dhanalakshmi, V. & Kp, Soman & Sankaravelayuthan, Rajendran. (2010). A Sequence Labeling Approach to Morphological Analyzer for Tamil Language. (IJCSE) International Journal on Computer Science and Engineering Volume 02, Issue No. 06, Page no (2201-2208),2010.

Canasai Kruengkrai, Virach Sornlertlamvanich, Hitoshi Isahara. (2006). A Conditional Random Field Framework for Thai Morphological Analysis. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC-06), May 24-26, 2006. Genoa, Italy.)

P. J. Antony, Dr. M. Anand Kumar, and Dr. Soman K. P. (2006). Paradigm based morphological analyzer for Kannada language using machine learning approach. International journal on-Advances in Computer Science and Technology (ACST), ISSN 0973-6107, vol. 3, pp. 457–481.

Shah D.N., Bhadka H. (2020). Paradigm-Based Morphological Analyzer for the Gujarati Language. Advances in Intelligent Systems and Computing, vol 989. Springer, Singapore. https://doi.org/10.1007/978-981-13-8618-3_50

Jedrzejowicz, Piotr & Strychowski, Jakub. (2005). A Neural Network Based Morphological Analyser of the Natural Language. 199-208. 10.1007/3-540-32392-9_21.

Premjith, B., Soman, K. P., & Kumar, M. A. (2018). A deep learning approach for Malayalam morphological analysis at character level. Procedia computer science, 132, 47-54.

Kimmo Koskenniemi. (1983). Two-level morphology. Ph.D. thesis, University of Helsinki.

Lauri Karttunen and Kenneth R Beesley. (2012). A short history of two-level morphology. ESSLLI-2001 Special Event titled" Twenty Years of Finite-State Morphology.

Katushemererwe, F., & Hanneforth, T. (2010). Finite State Methods in Morphological Analysis of Runyakitara Verbs. Nordic Journal of African Studies, 19, 22-22.

Sarveswaran, K., Dias, G., & Butt, M. (2018). ThamizhiFST: A Morphological Analyser and Generator for Tamil Verbs. 2018 3rd International Conference on Information Technology Research (ICITR), 1-6.

Kengatharaiyer, Sarveswaran & Dias, Gihan & Butt, Miriam. (2019). Using Meta-Morph Rules to develop Morphological Analysers: A case study concerning Tamil. 10.18653/v1/W19-3111.

Kayabaş, A., Schmid, H., Topcu, A., & Kiliç, Ö. (2019). TRMOR: a finite-state-based morphological analyzer for Turkish. Turkish Journal of Electrical Engineering and Computer Sciences, 27, 3837-3851.

Kenneth R Beesley and Lauri Karttunen. (2003). Finite-state morphology: Xerox tools and techniques. CSLI Publications, Stanford.

Cyril Allauzen, Michael Riley, Johan Schalkwyk, Wojciech Skut, and Mehryar Mohri. (2007). OpenFst: A general and efficient weighted finite state transducer library. In International Conference on Implementation and Application of Automata, pages 11–23. Springer.

Krister Lindén, Miikka Silfverberg, and Tommi Pirinen. (2009). HFST tools for morphology–an efficient open-source package for construction of morphological analyzers. In International Workshop on Systems and Frameworks for Computational Morphology, pages 28–47. Springer.

Mans Hulden. (2009). Foma: a finite-state compiler and library. In Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Demonstrations Session, pages 29–32. Association for Computational Linguistics.

Abebe, Tewodros & Washington, Jonathan & Gasser, Michael & Yimam, Baye. (2018). A Finite-State Morphological Analyzer for Wolaytta. Information and Communication Technology for Development for Africa.10.1007/978-3-319-95153-9_2.

Rahman, Mirzanur & Sarma, Shikhar. (2015). An Implementation of Apertium Based Assamese Morphological Analyzer. International Journal on Natural Language Computing. 4. 10.5121/ijnlc.2015.4102.

Zueva, A., Kuznetsova, A., & Tyers, F. (2020). A Finite-State Morphological Analyser for Evenki. Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 2581–2589.

Ammari, Rachid & Zenkoua, Ahbib. (2021). APMorph: finite-state transducer for Amazigh pronominal morphology. International Journal of Electrical and Computer Engineering (IJECE). 11. 699. 10.11591/ijece.v11i1.pp699-706.

Keleg, A., Tyers, F.M., Howell, N., & Pirinen, T.A. (2020). An Unsupervised Method for Weighting Finite-state Morphological Analyzers. Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 3842–3850

Phyue, S. L., & Thida, A. (2012). Morphological Processor for Inflectional Case of Multipurpose Lexico-Conceptual KnowledgeResource. International Journal of Advanced Research in Computer Engineering & Technology (IJARCET), 1(7), 157-163.

Latt, T. M., & Thida, A. (2018, June). An Analysis of Myanmar Inflectional Morphology Using Finite-state Method. In 2018 IEEE/ACIS 17th International Conference on Computer and Information Science (ICIS) (pp. 297-302). IEEE.

Sgarbas. (2000). A Straight Forward Approach to Morphological Analysis and Synthesis, Sgarbas (et. all), Proceeding of COMLEX 2000, Greece.

Sarmah, J., Sarma, S.K., & Barman, A. (2019). Development of Assamese Rule-based Stemmer using WordNet. GWC.

Debbarma, K., Patra, B. G., Das, D., & Bandyopadhyay, S. (2012, December). Morphological Analyzer for Kokborok. In Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language Processing (pp. 41-52).

Huxley, A. (2003). The Burmese Script. Journal of the Royal Asiatic Society, 133(2), 197-207.

Luce, G. H. (1985). Early Inscriptions of Burma. In Indic Scripts of Southeast Asia (pp. 45-63). Cornell University Press.

Okell, J. (2001). The Early Mon Tradition. In Myanmar: State, Society and Ethnicity (pp. 39-57). Institute of Southeast Asian Studies.

Juneja, V., Singh, S., Jain, V., Pandey, K.K., Dhabliya, D., Gupta, A., Pandey, D. Optimization-based data science for an IoT service applicable in smart cities (2023) Handbook of Research on Data-Driven Mathematical Modeling in Smart Cities, pp. 300-321.

Dhabliya, D. Security analysis of password schemes using virtual environment (2019) International Journal of Advanced Science and Technology, 28 (20), pp. 1334-1339

Downloads

Published

13.12.2023

How to Cite

Thu , K. M. ., Devi , H. M. ., & Singh , T. R. . (2023). A Computational Implementation of Morphological Analysis and Generation of Verbs in Myanmar Language. International Journal of Intelligent Systems and Applications in Engineering, 12(8s), 615–622. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/4234

Issue

Section

Research Article