Enhancing Big Data Retrieval: A Comparative Analysis of Standard and Three-Level Indexing Techniques Using Dictionary Words Dataset
Keywords:
Big Data Retrieval, Indexing Techniques, Three-Level Indexing, Standard Indexing, Comparative Analysis, Dataset, Dictionary Words, Retrieval Performance, Efficiency, Scalability, Information Extraction, Linguistic DatasetsAbstract
Utilizing a large dataset of dictionary words, this study examines the way three-level indexing approaches perform when compared to traditional indexing techniques for improving big data retrieval. The research is concerned with assessing the retrieval effectiveness, efficiency, and scalability of these indexing systems in the context of managing huge datasets. The dictionary terms dataset will be subjected to standard indexing and three-level indexing as part of the experimental framework, and the retrieval accuracy and efficiency metrics will be subjected to a thorough comparison study. Particularly in the context of linguistic datasets, the findings provide helpful information on optimizing big data retrieval strategies. This study emphasises the need of sophisticated indexing techniques for organizing and gleaning useful data from huge databases.
Downloads
References
Ali Selamat, Nicholas Akosu (2014). Word-length algorithm for language identification of under-resourced languages, Journal of King Saud University - Computer and Information Sciences Volume 28, Issue 4, October 2016, Pages 457-469 http://dx.doi.org/10.1016/j.jksuci.2014.12.004.
Julia Fomina, Denis Safikanov et.al (2020). Parametric and semantic analytical search indexes in hieroglyphic languages Procedia Computer Science, Volume 169, 2020, Pages 507-512 https://doi.org/10.1016/j.procs.2020.02.218.
Kamran Kowsari, Kiana Jafari meimandi et.al (2019) Text Classification Algorithms: A Survey, doi:10.3390/info10040150, Information 2019, www.mdpi.com/journal/inform.
Antonio Ferrandez, Jesus Peral (2019), MergedTrie: Efficient textual indexing PLoS ONE 14(4): e0215288. https://doi.org/10.1371/journal. pone.0215288.
Vimal P Parmar, C K Kumbharana (2017), Implementation of Trie Structure for Storing and Searching of English Spelled Homophone Words, International Journal of Scientific and Research Publications, Volume 7, Issue 1, January 2017 ISSN 2250-3153.
Rahat Yeasin Emon, Sharmistha Chanda Tista (2019), An Efficient Word Lookup System by using Improved Trie Algorithm P. N Singh, Tara G P (2021), Searching String in Bigdata- A better Approach by applied machine Learning, SN Computer Science (2021) 2:192.
A Gani, A Siddiqa et.al, 2015, A survey on indexing techniques for big data: taxonomy and performance evaluation. DOI 10.1007/s10115-015-0830-y.
Fatima Binta Adamu, Adid Habbal et.al (2015), A survey on big data indexing strategies, The 4th International Conference on Internet Applications, Protocols and Services (NETAPPS2015 DOI:10.13140/RG.2.1.1844.0721).
Dalia A. Abd Al lattif, Murtadha M. Hamad, A comparative Study of Indexing Techniques effect in big data system storage Optimization, 2nd Al-Noor International Conference for Science and Technology (2NICST2020), Baghdad, Iraq, https://doi.org/10.1109/NICST50904.2020.9280309
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.