A Computational Implementation of Morphological Analysis and Generation of Verbs in Myanmar Language
Keywords:
Morphology, Natural Language Processing, FST, MM-Morph, FSA, Morphological analysis and generation (MAG), finite-state morphological analysis and generation (FSMAG), Computational linguistics tool, Indian Language, Myanmar Language, Xerox's xfst, FOMAAbstract
The field of morphological analysis and generation focuses on the study of word production, the recognition of grammatical components within words, and the creation of words that adhere to morphotactic standards. According to various research reports, finite-state techniques are fast, effective, and efficient in interpreting human language morphologies into the computational system. FOMA: a more elaborate version of Xerox's finite state toolset can be used to implement the finite state morphology. Using FOMA toolset and other programming languages, we have already created the MM-Morph tool: a computational linguistic tool for morphological analyzer and generator for Myanmar nouns. In this paper, we describe the linguistic phenomena of the morphology of verbs and the techniques used in the system's development process to integrate it into the existing MM-Morph tool. MM-Morph has been developed as a part of the research "Morphological Analysis and Generation for Myanmar Language using Finite State Techniques." We share the experimental evaluations conducted to assess this system's performance. Evaluation results show that the MAG system of Myanmar verbs can identify more than (78%) of the verbs in the language.
Downloads
References
Goldsmith, John. (2001). Unsupervised learning of the morphology of a natural language. Computational linguistics, 27(2):153–198, 2001.
Goldsmith, John. (2006). An Algorithm for the Unsupervised Learning of Morphology. Natural Language Engineering. 12. 353-371. 10.1017/S1351324905004055.
Welgama, Viraj & Weerasinghe, Ruvan & Niranjan, Mahesan. (2013). Evaluating a Machine Learning Approach to Sinhala Morphological Analysis. 10th International Conference on Natural Language Processing, At: Noida, India
John Lee. (2008). A Nearest-Neighbour Approach to the Automatic Analysis of Ancient Greek Morphology. CoNLL 2008: Proceedings of the 12th Conference on Computational Natural Language Learning, Manchester, August 2008: 127–134
Anand Kumar, M. & Dhanalakshmi, V. & Kp, Soman & Sankaravelayuthan, Rajendran. (2010). A Sequence Labeling Approach to Morphological Analyzer for Tamil Language. (IJCSE) International Journal on Computer Science and Engineering Volume 02, Issue No. 06, Page no (2201-2208),2010.
Canasai Kruengkrai, Virach Sornlertlamvanich, Hitoshi Isahara. (2006). A Conditional Random Field Framework for Thai Morphological Analysis. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC-06), May 24-26, 2006. Genoa, Italy.)
P. J. Antony, Dr. M. Anand Kumar, and Dr. Soman K. P. (2006). Paradigm based morphological analyzer for Kannada language using machine learning approach. International journal on-Advances in Computer Science and Technology (ACST), ISSN 0973-6107, vol. 3, pp. 457–481.
Shah D.N., Bhadka H. (2020). Paradigm-Based Morphological Analyzer for the Gujarati Language. Advances in Intelligent Systems and Computing, vol 989. Springer, Singapore. https://doi.org/10.1007/978-981-13-8618-3_50
Jedrzejowicz, Piotr & Strychowski, Jakub. (2005). A Neural Network Based Morphological Analyser of the Natural Language. 199-208. 10.1007/3-540-32392-9_21.
Premjith, B., Soman, K. P., & Kumar, M. A. (2018). A deep learning approach for Malayalam morphological analysis at character level. Procedia computer science, 132, 47-54.
Kimmo Koskenniemi. (1983). Two-level morphology. Ph.D. thesis, University of Helsinki.
Lauri Karttunen and Kenneth R Beesley. (2012). A short history of two-level morphology. ESSLLI-2001 Special Event titled" Twenty Years of Finite-State Morphology.
Katushemererwe, F., & Hanneforth, T. (2010). Finite State Methods in Morphological Analysis of Runyakitara Verbs. Nordic Journal of African Studies, 19, 22-22.
Sarveswaran, K., Dias, G., & Butt, M. (2018). ThamizhiFST: A Morphological Analyser and Generator for Tamil Verbs. 2018 3rd International Conference on Information Technology Research (ICITR), 1-6.
Kengatharaiyer, Sarveswaran & Dias, Gihan & Butt, Miriam. (2019). Using Meta-Morph Rules to develop Morphological Analysers: A case study concerning Tamil. 10.18653/v1/W19-3111.
Kayabaş, A., Schmid, H., Topcu, A., & Kiliç, Ö. (2019). TRMOR: a finite-state-based morphological analyzer for Turkish. Turkish Journal of Electrical Engineering and Computer Sciences, 27, 3837-3851.
Kenneth R Beesley and Lauri Karttunen. (2003). Finite-state morphology: Xerox tools and techniques. CSLI Publications, Stanford.
Cyril Allauzen, Michael Riley, Johan Schalkwyk, Wojciech Skut, and Mehryar Mohri. (2007). OpenFst: A general and efficient weighted finite state transducer library. In International Conference on Implementation and Application of Automata, pages 11–23. Springer.
Krister Lindén, Miikka Silfverberg, and Tommi Pirinen. (2009). HFST tools for morphology–an efficient open-source package for construction of morphological analyzers. In International Workshop on Systems and Frameworks for Computational Morphology, pages 28–47. Springer.
Mans Hulden. (2009). Foma: a finite-state compiler and library. In Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Demonstrations Session, pages 29–32. Association for Computational Linguistics.
Abebe, Tewodros & Washington, Jonathan & Gasser, Michael & Yimam, Baye. (2018). A Finite-State Morphological Analyzer for Wolaytta. Information and Communication Technology for Development for Africa.10.1007/978-3-319-95153-9_2.
Rahman, Mirzanur & Sarma, Shikhar. (2015). An Implementation of Apertium Based Assamese Morphological Analyzer. International Journal on Natural Language Computing. 4. 10.5121/ijnlc.2015.4102.
Zueva, A., Kuznetsova, A., & Tyers, F. (2020). A Finite-State Morphological Analyser for Evenki. Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 2581–2589.
Ammari, Rachid & Zenkoua, Ahbib. (2021). APMorph: finite-state transducer for Amazigh pronominal morphology. International Journal of Electrical and Computer Engineering (IJECE). 11. 699. 10.11591/ijece.v11i1.pp699-706.
Keleg, A., Tyers, F.M., Howell, N., & Pirinen, T.A. (2020). An Unsupervised Method for Weighting Finite-state Morphological Analyzers. Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 3842–3850
Phyue, S. L., & Thida, A. (2012). Morphological Processor for Inflectional Case of Multipurpose Lexico-Conceptual KnowledgeResource. International Journal of Advanced Research in Computer Engineering & Technology (IJARCET), 1(7), 157-163.
Latt, T. M., & Thida, A. (2018, June). An Analysis of Myanmar Inflectional Morphology Using Finite-state Method. In 2018 IEEE/ACIS 17th International Conference on Computer and Information Science (ICIS) (pp. 297-302). IEEE.
Sgarbas. (2000). A Straight Forward Approach to Morphological Analysis and Synthesis, Sgarbas (et. all), Proceeding of COMLEX 2000, Greece.
Sarmah, J., Sarma, S.K., & Barman, A. (2019). Development of Assamese Rule-based Stemmer using WordNet. GWC.
Debbarma, K., Patra, B. G., Das, D., & Bandyopadhyay, S. (2012, December). Morphological Analyzer for Kokborok. In Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language Processing (pp. 41-52).
Huxley, A. (2003). The Burmese Script. Journal of the Royal Asiatic Society, 133(2), 197-207.
Luce, G. H. (1985). Early Inscriptions of Burma. In Indic Scripts of Southeast Asia (pp. 45-63). Cornell University Press.
Okell, J. (2001). The Early Mon Tradition. In Myanmar: State, Society and Ethnicity (pp. 39-57). Institute of Southeast Asian Studies.
Juneja, V., Singh, S., Jain, V., Pandey, K.K., Dhabliya, D., Gupta, A., Pandey, D. Optimization-based data science for an IoT service applicable in smart cities (2023) Handbook of Research on Data-Driven Mathematical Modeling in Smart Cities, pp. 300-321.
Dhabliya, D. Security analysis of password schemes using virtual environment (2019) International Journal of Advanced Science and Technology, 28 (20), pp. 1334-1339
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.