Exploration of Cognition Impact: An Experiment with Cartoon Retrieval through Indexing
Keywords:
CNN, MFCC, Deep LearningAbstract
Large-scale Cartoon Image retrieval systems are able to calculate Image-to-Image similarity and accommodate differences in timing, key and tempo. Simple vector distance measure is not adequately powerful to perform Cartoon Image recognition, and expensive solutions such as dynamic time warping do not scale to millions of instances, making Cartoon Image retrieval inappropriate for commercial-scale applications. In this work, the content-based music features of Images are used as input and transformed them into vectors by using the 2D Fourier transform approach. By projecting the Images into a fusion vector of PCA and LDA, the efficient KD Tree and R Tree Indexing algorithm is used to compare the similarity of Images and retrieve the most similar Images from the large-scale database. The proposed system is not only efficient enough to perform scalable content-based music retrieval but can also develop the potential of making similar music recognition applications faster and more accurate.
Downloads
References
M. Haseyama and A. Matsumura, “A trainable retrieval system for cartoon character images,” in Proc. ICME, Jul. 2003, pp. 393–396.
Y. Yang, Y. Zhuang, D. Xu, Y. Pan, D. Tao, and S. Maybank, “Retrieval based interactive cartoon synthesis via unsupervised bi-distance metric learning,” in Proc. ACM Multimedia, 2009, pp. 311–320.
D. Sykora, J. Burianek, J. Zara, Sketching cartoons by examples, In Proceedings of The 2nd Eurographics Workshop on Sketch-Based Interfaces and Modeling, 27–34, 2005
Tiejun Zhang, D. Tao, D. Xu, J. Yu, and J. Luo,“ Recognizing cartoon image gestures for retrieval and interactive cartoon clip synthesis,” IEEE Trans. Circuits Syst. Video Technol., vol. 20, no. 12, pp. 1745–1756, Dec. 2010.
Jun Yu, Dongquan Liu, Dacheng Tao and Hock Soon Seah “On Combining Multiple Features for Cartoon Character Retrieval and Clip Synthesis”, IEEE Trans. Cybernetics, Vol. 42, no. 5, pp.1413-1427, Oct. 2012
H. Kang, S. Lee, C. K. Chui, Coherent line drawing, In Proceedings of the 5th International Symposium on Non-photorealistic Animation and Rendering, 43–50, 2007
S. Belongie, J. Puzicha and J. Malik Shape Matching and Object Recognition Using Shape Contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24 (24): 509-521
K bozas, Qi Han, Handan Hou, Xiamu Niu “ Local Invariant Shape Feature for Cartoon Image Retrieval”, IEEE Trans. 2013 Second International Conference on Robot, Vision and Signal Processing
R Zhou ,Y. Yang, S. Xiang, and C. Zhang, “Neighborhood MinMax projections,” in Proc. IJCAI, 2007, pp. 993–998
D. A. Forsyth, J. Ponce. Computer Vision: A Modern Approach, Prentice Hall, 2002
C. D. Juan and B. Bodenheimer, “Cartoon textures,” in Proc. Symp.Comput. Animation, 2004, pp. 267–276.
R Hu and S. Lee, “Human action recognition using shape and CLG-motion flow from multi-view image sequences,” Patt. Recog., vol. 41, no. 7, pp. 2237–2252, 2008.
Y. Houssem, D. Zhang, G. Lu, and W. Ma, “A survey of content-based image retrieval with high-level semantics,” Patt. Recog., vol. 40, no. 1, pp. 262–282, 2007.
T.Zhang, D. Tao, X. Li, and J. Yang, “Patch alignment for dimensionality reduction,” IEEE Trans. Known. Data Eng., vol. 21, no. 9, pp. 1299–1313, Sep. 2009.
Dalal, N., Triggs B., 2005. “Histogram of Oriented Gradients for human detection” in : CVPR. pp. 886-893
Lowe, David G. (1999). “ Object recognition from local Scale invariant features”. Proceedings of the international conference on computer vision 2. pp .1150-1157
E. Oja. Subspace methods of pattern recognition, volume 6 of Pattern recognition and image processing series. John Wiley & Sons, 1993.
D.Gleich[Online].Available:http://www.mathworks.com/matlabcentral/fileexchange/loadFile.do?objectId=12422&objectType=FILE
S. Balakrishnama, A. Ganapathiraju, Linear Discriminant Analysis.
Deng Liqiong, Chen Danwen, Yuan Zhimin, Wu Lingda. Attributebased Cartoon Scene Image Search System, Advanced Materials Research, v268-270, p1030~1035, 201
SongHai Zhang, Tao Chen, YiFei Zhang, ShiMin Hu and Ralph R. Martin. Vectorizing Cartoon Animations. IEEE Transactions on Visualization and Computer Graphics, 2009,15 (4).
Yuxiang Xie, Xidao Luan, Xin Zhang, Chen Li, Liang Bai. A Cartoon Image Annotation and Retrieval System Supporting Fast Cartoon Making. 2014 IEEE 17th International Conference on Computational Science and Engineering.
Mehendale, N. Facial emotion recognition using convolutional neural networks (FERC). SN Appl. Sci. 2, 446 (2020).
Mishra, A., Rai, S.N., Mishra, A., Jawahar, C.V. (2016). IIIT-CFW: A Benchmark Database of Cartoon Faces in the Wild. In: Hua, G., Jégou, H. (eds) Computer Vision – ECCV 2016 Workshops. ECCV 2016. Lecture Notes in Computer Science(), vol 9913. Springer, Cham.
Shukla, P., Gupta, T., Singh, P., Raman, B. (2020). CARTOONNET: Caricature Recognition of Public Figures. In: Chaudhuri, B., Nakagawa, M., Khanna, P., Kumar, S. (eds) Proceedings of 3rd International Conference on Computer Vision and Image Processing. Advances in Intelligent Systems and Computing, vol 1022. Springer, Singapore.
Ming, Z., Burie, JC. & Luqman, M.M. Cross-modal photo-caricature face recognition based on dynamic multi-task learning. IJDAR 24, 33–48 (2021).
W. Zheng, L. Yan, C. Gou, W. Zhang and F. -Y. Wang, "A Relation Network Embedded with Prior Features for Few-Shot Caricature Recognition," (2019) IEEE International Conference on Multimedia and Expo (ICME), Shanghai, China, 2019, pp. 1510-1515, doi: 10.1109/ICME.2019.00261.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.