Stemming Implementation in Preprocessing Phase for Evaluating of Exams Using Data Mining Approach

  • Mehmet BALCI
  • Sakir TASDEMIR
  • Ridvan SARACOGLU
Keywords: Preprocessing, Stemming, Data Mining, Exam Assessment


In educational activities, examinations are sometimes carried out in the form of multiple-choice tests or sometimes as open-ended long texts. When multiple-choice tests are performed, evaluating process is carried out either manual or computer-assisted. Exam questions prepared in the form of multiple choice tests are not suitable for every course. It may be necessary to use open-ended questionnaires in order for pupils to accurately measure their achievement in relation to the course. It can take a long time to evaluate examinations made with such questions. However, this process can create problems in terms of objective evaluation. Data mining, defined as the extraction of useful information from large quantities of data, can be used to process all kinds of data. The data mining method used in the processing of textual data is called text mining. In text processing studies, data is subject to preprocessing in order to obtain a high quality data set. The most important stage of preprocessing is stemming. In this study, stemming process is implemented to questions and correct answers taken from students. The results obtained in 2 different samples and 4 sentences are 71%, 69%, 86% and 78% correct. In order to be able to distinguish what the textual data written in the natural language really is, it is necessary to use the states of the words which are made up of construction and free from the suffixes. Therefore, in the pre-processing phase, stemming process is applied to the textual data in accordance with the grammar rules of the language they are written on, and stems of every word are found. Text processing is used in many areas of the natural language. Computer-aided solutions will be inevitable so that problems can be eliminated and open-ended questions can be quickly assessed. Despite the desirability of a computer aided solution for this measurement technique, studies of this solution are not included in the literature very much.


Download data is not yet available.


Akdağ, H. and Çoklar, A.N., İlköğretim 6. ve 7. Sınıf Öğrencilerinin Sosyal Bilgiler Dersi Proje ve Performans Görevlerini Hazırlarken Yararlandıkları Kaynaklar, Internet’in Yeri ve Karşılaştıkları Güçlükler. Adiyaman University Journal of the Institute of Social Sciences, Year 2, Issue 2, 2009, pp. 1-16.

Nazlıçiçek, N. and Akarsu, F., Fizik, Kimya ve Matematik Öğretmenlerinin Değerlendirme Araçlarıyla İlgili Yaklaşımları ve Uygulamaları. Journal of Education and Science, Vol. 33, Issue 149, 2008, pp. 18-29.

Gelbal, S. and Kelecioğlu, H., Öğretmenlerin ölçme ve değerlendirme yöntemleri hakkındaki yeterlik algıları ve karşılaştıkları sorunlar. Hacettepe University Journal of Education Faculty, 33, 2007, pp. 135-147.

Mintzes, J. J., Wandersee, J. H. and Novak, J. D., Assessing Understanding in Biology, Journal of Biological Education, 35, 3, 2001, pp. 118-125.

Wikipedia The Free Encyclopedia, Veri Madenciliği, Available link:, (Aug 09, 2016)

Balci, M., Comparative Analysis Of The Long Match Algorithm In Computer Based Text Processing. Master Thesis, The Graduate School of Natural And Applied Science, Selcuk University, Konya, 2010

Türkeş M.K., Phrase Based Indexing In Information Retrieval, Master Thesis, Graduate School of Natural and Applied Sciences, Istanbul Technical University, Istanbul, 2007

Kesgin F., Topic Detection System For Turkish Texts, Master Thesis, Graduate School of Natural and Applied Sciences, Istanbul Technical University, Istanbul, 2007

Merabet, H., Bahi, T., Drici, D., Halem, N., & Bedoud, K. (2017). DIAGNOSIS OF ROTOR FAULT USING NEURO-FUZZY INFERENCE SYSTEM. Journal Of Fundamental And Applied Sciences, 9(1), 170-182. doi:

How to Cite
M. BALCI, S. TASDEMIR, and R. SARACOGLU, “Stemming Implementation in Preprocessing Phase for Evaluating of Exams Using Data Mining Approach”, IJISAE, vol. 5, no. 2, pp. 76-80, Jun. 2017.
Research Article