Reservoir Sampling Based Streaming Method for Large Scale Collaborative Filtering
AbstractCollaborative filtering algorithms work on user feedback data (such as purchases, clicks, or ratings.) in order to build models of users and items. User feedback data in real life e-commerce sites can be very large which incurs high costs on maintenance and model building. Parallelization of computation might help but it results in additional costs for extra computing power and maintenance problems of very large datasets still persist. Sampling at this point can be an effective approach for reducing the amount of data. In this work we propose a novel sampling technique for collaborative filtering which can be used to reduce the amount of data considerably. Experimental results on three real life datasets show that the proposed method leads to a significant reduction in the amount of data with little harm to the accuracy of the models. The method works in a streaming fashion which makes it suitable for being used in real time at large-scale e-commerce applications where there is a large flow of continuous user feedback.
Copyright (c) 2018 International Journal of Intelligent Systems and Applications in Engineering
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.