Enhancing Question Answering through Augmented Term Extraction on Generated Ontology in Closed Domain

Authors

  • Vikas Bali Department of Computer Science, Panjabi University, Patiala, India
  • Amandeep Verma Department of Computer Science, Panjabi University, Patiala, India

Keywords:

Question answering system, natural language processing, ontology, ontology-based question answering system, term extraction

Abstract

The ever-increasing use of smartphones and computers has led to a culture of instant gratification, shaping the way information is sought and shared in today's digital age. Natural language understanding (NLU) is the linchpin that allows machines to access, process, and provide answers from the vast amount of human knowledge stored in natural language (NL) text. The ongoing development of NLU technologies continues to drive advances in question answering and a wide range of other natural language processing (NLP) applications. Ontology plays a pivotal role in question answering by enhancing the system's ability to understand, contextualize, and retrieve relevant information from a structured knowledge base or unstructured textual data. In this work, we designed an Ontology-based Question Answering (QA) using a self-created dataset is an interesting and valuable endeavour. The system can leverage augmented term extraction on automatically created ontology to understand the domain context and relationships between concepts and is well integrated into the QA system. The performance of QA prototype is accessed using appropriate evaluation metrics This involve using a portion of self-created dataset as a test set and comparing the system's answers to the ground truth.

Downloads

Download data is not yet available.

References

L. A. Zadeh, "Chapter 9 From search engines to question answering systems - The problems of world knowledge, relevance, deduction and precisiation," in Fuzzy Logic and the Semantic Web, vol. 1, E. Sanchez, Ed. Elsevier, 2006, pp. 163-210.

L. Zadeh, "From search engines to question-answering systems - The role of fuzzy logic," Progress in Informatics, vol. 1, p. 1, 2005.

C. W. Lee, M. Y. Day, C. L. Sung, Y. H. Lee, T. J. Jiang, C. W. Wu, D. W. Shih, Y. R. Chen and W. L. Hsu, "Boosting Chinese Question Answering with Two Lightweight Methods: ABSPs and SCO-QAT," ACM Transactions on Asian Language Information Processing, vol. 7, p. 12, 2008.

S. Kalaivani and K. Duraiswamy, "Comparison of Question Answering Systems Based on Ontology and Semantic Web in Different Environment," Journal of Computer Science, vol. 8, no. 9, pp. 1407-1413, 2012.

P. M. Athira, M. Sreeja and P. C. Reghuraj, "Architecture of an Ontology-Based Domain-Specific Natural Language Question Answering System," International Journal of Web & Semantic Technology, vol. 4, no. 4, pp. 31-39, 2013.

J. Letkowski, "Doing database design with MySQL," Journal of Technology Research, vol. 6, 2015.

L. Hirschman and R. Gaizauskas, "Natural Language Question Answering: The View from Here," Natural Language Engineering, vol. 7, pp. 275-300, 2001.

"SPARQL Query Language for RDF," W3C, [Online]. Available: https://www.w3.org/TR/rdf-sparql-query/. [Accessed June 2023].

T. Lv, P. Yan and W. He, "Survey on JSON Data Modelling," Journal of Physics: Conference Series, vol. 1069, p. 012101, 2018.

L. Papaleo, "Introduction to XML and its applications," in Handbook of Metadata, Semantics and Ontologies, World Scientific, 2013.

"Java SE at a Glance," Oracle, [Online]. Available: https://www.oracle.com/java/technologies/java-se-glance.html. [Accessed September 2023].

"Java Downloads," Oracle, [Online]. Available: https://www.oracle.com/java/technologies/downloads /#java8-windows. [Accessed September 2023].

"Apache NetBeans," The Apache Software Foundation, [Online]. Available: https://jena.apache.org. [Accessed September 2023].

"Stanford CoreNLP – Natural language software," The Stanford NLP Group, [Online]. Available: https://stanfordnlp.github.io/CoreNLP/. [Accessed September 2023].

R. Barskar, G. Ahmed and N. Barskar, "An Approach for Extracting Exact Answers to Question Answering (QA) System for English Sentences," Procedia Engineering, vol. 30, pp. 1187-1194, 2012.

D. Mollá and J. Vicedo, "Question Answering in Restricted Domains: An Overview," Computational Linguistics, vol. 33(1), pp. 41-61, 2007.

E. M. Voorhees and D. M. Tice, "The TREC-8 Question Answering Track," in Proceedings of the Second International Conference on Language Resources and Evaluation (LREC '00), 2000.

T. Dodiya and S. Jain, "Comparison of Question Answering Systems," Advances in Intelligent Systems and Computing, 1st ed., Berlin, Heidelberg: Springer Berlin Heidelberg, pp. 99-107, 2013.

D. Jin, E. Pan, N. Oufattole, W. H. Weng, H. Fang and P. Szolovits, "What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical Exams," Computation and Language, 2000.

M. Lee, J. Cimino, H. R. Zhu, C. Sable, V. Shanker, J. Ely and H. Yu, "Beyond Information Retrieval - Medical Question Answering," AMIA Annu Symp Proc, p. 469–473, 02 2006.

"HonQA," Health On the Net Foundation, [Online]. Available: https://hon.ch/qa. [Accessed Feb 2017].

S. Dongre and S. Lodhi, "A survey of different semantic and ontology based question answering system," International Journal of Advanced Computational Engineering and Networking, vol. 2, no. 9, pp. 69-74, 2014.

Y. Liu, X. Yi, R. Chen and Y. Song, "A Survey on Frameworks and Methods of Question Answering," in 3rd International Conference on Information Science and Control Engineering (ICISCE), 2016.

L. Mei, "Intelligent Question Answering System of Research Based Ontology on Excellent Courses," in Fourth International Conference on Computational and Information Sciences, 2012.

A. Bouziane, D. Bouchiha, N. Doumi and M. Malki, "Question Answering Systems: Survey and Trends," vol. 73, pp. 366-375, 2015.

P. Bhatia, R. Madaan, A. Sharma and A. Dixit, "A Comparison Study of Question Answering Systems," Journal of Network Communications and Emerging Technologies (JNCET), vol. 5, no. 2, pp. 192-198, 2015.

V. Bali and A. Verma, "A Study on Components, Benchmark Criteria and Techniques used in Ontology-based Question Answering Systems," International Journal of Intelligent Systems and Applications in Engineering, vol. 10, no. 1s, p. 09–17, 2022.

A. Mishra and S. Jain, "A survey on question answering systems with classification," Journal of King Saud University - Computer and Information Sciences, vol. 28, no. 3, pp. 345-361, 2016.

"Admission and eCounselling Services for Session 2023," Joint Admission Committee Chandigarh, [Online]. Available: https://jacchd.admissions.nic.in/. [Accessed September 2023].

"SQuAD2.0 - The Stanford Question Answering Dataset," The Stanford NLP Group, [Online]. Available: https://rajpurkar.github.io/SQuAD-explorer/explore/1.1/dev/. [Accessed September 2023].

A. Fader, S. Soderland and O. Etzioni, "Identifying relations for Open Information Extraction," Proceedings of the conference on empirical methods in natural language processing (EMNLP), pp. 1535-1545, 2011.

"Software > Stanford RegexNER," Stanford NLP Group, [Online]. Available: https://nlp.stanford.edu/ software/regexner.html. [Accessed September 2023].

"Protege," Stanford University, [Online]. Available: https://protege.stanford.edu/. [Accessed June 2023].

"ProtégéVOWL: VOWL Plugin for Protégé," [Online]. Available: http://vowl.visualdataweb.org/ protegevowl.html. [Accessed June 2023].

S. Lohmann, S. Negru and D. Bold, "The ProtégéVOWL Plugin: Ontology Visualization for Everyone," Proceedings of ESWC 2014 Satellite Events, vol. 8798, p. 395–400, 2014.

"TextRazor - The Natural Language Processing API," TextRazor Ltd., [Online]. Available: https://www.textrazor.com/. [Accessed September 2023].

"spaCy Named Entity Recognizer (NER)," Text Analysis Online, [Online]. Available: https://textanalysisonline.com/spacy-named-entity-recognition-ner. [Accessed September 2023].

P. Ochieng, "PAROT: Translating natural language to SPARQL," Expert Systems with Applications: X, vol. 5, p. 100024, 2020.

"Stanford Dependencies," Stanford NLP Group, [Online]. Available: https://nlp.stanford.edu/software /stanford-dependencies.html. [Accessed June 2023].

M. C. Marnee and C. D. Manning, "Stanford typed dependencies manual," 2010.

"Apache Jena," The Apache Software Foundation, [Online]. Available: https://jena.apache.org/download /index.cgi. [Accessed September 2023].

D. M. Powers, "Evaluation: from precision, recall and F-measure to ROC, informedness, markedness & correlation," Journal of Machine Learning Technologies, vol. 2(1), p. 37–63, 2011.

H. Dalianis, "Evaluation Metrics and Evaluation," in Clinical Text Mining: Secondary Use of Electronic Patient Records, Cham: Springer International Publishing, 2018, p. 45–53.

D. Diefenbach, V. Lopez, K. Singh and P. Maret, "Core Techniques of Question Answering Systems over Knowledge Bases: a Survey," vol. 55, 2018.

C. Unger, C. Forascu, V. Lopez, . A. -C. Ngomo and E. Cabrio, "Question Answering over Linked Data (QALD-5)," In: Working notes for CLEF 2015 conference, 2015.

C. Unger, C. Forascu, V. Lopez, A. -C. Ngomo and E. Cabrio, "Question answering over linked data (QALD-4)," In: Working notes for CLEF 2014 conference, vol. 1180, p. 1172–1180, 2014.

L. Zou, R. Huang, H. Wang, J. Yu, W. He and D. Zhao, "Natural language question answering over RDF - A graph data driven approach," Proceedings of the ACM SIGMOD International Conference on Management of Data, 2014.

C. Zhu, K. Ren, X. Liu, H. Wang, Y. Tian and Y. Yu, "A Graph Traversal Based Approach to Answer Non-Aggregation Questions over DBpedia," in Semantic Technology, Springer International Publishing, p. 219–234, 2016.

P. Cimiano, V. Lopez, C. Unger, E. Cabrio, A. -C. Ngonga Ngomo and S. Walter, "Multilingual Question Answering over Linked Data (QALD-3): Lab Overview," in Information Access Evaluation. Multilinguality, Multimodality, and Visualization, p. 321–332, 2013.

V. Lopez, C. Unger, P. Cimiano and E. Motta, "Evaluating Question Answering over Linked Data," Journal of Web Semantics, vol. 21, 2013.

S. Walter, C. Unger, P. Cimiano and D. Bar, "Evaluation of a Layered Approach to Question Answering over Linked Data," in The Semantic Web, p. 362–374, 2012.

Downloads

Published

13.12.2023

How to Cite

Bali, V. ., & Verma, A. . (2023). Enhancing Question Answering through Augmented Term Extraction on Generated Ontology in Closed Domain. International Journal of Intelligent Systems and Applications in Engineering, 12(8s), 54–68. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/4087

Issue

Section

Research Article