Detection of In-Perceptible Fruits Bearing on Trees from Inter-Spaced Images

Authors

  • Chitra Bhole K. J. Somaiya Institute of Technology, Mumbai , Reseacrh Scholar at Sir Padampat Singhania University,Udaipur, India
  • Chandani Joshi Sir Padampat Singhania University, Udaipur,India
  • Mukesh Kalla Sir Padampat Singhania University, Udaipur,India
  • Kamal Hiren Symbiosis University of Applied Science ,Indore, India
  • Anand Darpan Sir Padampat Singhania University, Udaipur,India

Keywords:

Fine-Tuning, Fruit Detection, Mask-RCNN Object Detection, Region Based Detectors, Single Shot Detectors, YOLO

Abstract

The detection of fruits is a mainstream application taken into account for Single Shot Detectors as well as Region Based Detectors. The problem can be taken on other dimension by making a shift inclined towards the real world application where the detection of fruits when bearing on the trees are done. The dataset for such a problem needs to be gathered in a custom fashion, which we were able to do successfully. Apart from this the annotations for data were also made and famous state of-the-art Object Detection models were used for performance mapping using YOLO.v3, YOLO.v4 and Mask-RCNN. These models are fine-tuned to meet the needs of the dataset and metric of mean average precision comparison is shown. The loss metrics for these models in form of their comparisons is also provided in the results section. The result section also highlights a detailed prediction yielded by the different models on different testing based images respectively

Downloads

Download data is not yet available.

References

Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” nature, vol. 521, no.7553, pp. 436–444, 2015.

H. Sarker, “Machine learning: Algorithms, real-world applications and research directions,” SN Computer Science, vol. 2, no. 3, pp. 1–21, 2021.

V. Wiley and T. Lucas, “Computer vision and image processing: A paper review,” International Journal of Artificial Intelligence Research, 2018.

Y. A. Solangi, Z. A. Solangi, S. Aarain, A. Abro, G. A. Mallah, and A. Shah, “Review on natural language processing (nlp) and its toolkits for opinion mining and sentiment analysis,” in 2018 IEEE 5th International Conference on Engineering Technologies and Applied Sciences (ICETAS), 2018, pp. 1–4.

F. Dama and C. Sinoquet, “Time series analysis and modeling to forecast: a survey,” arXiv preprint arXiv:2104.00164, 2021.

H. Purwins, B. Li, T. Virtanen, J. Schlüter, S.-Y. Chang, and T. Sainath, “Deep learning for audio signal processing,” IEEE Journal of Selected Topics in Signal Processing, vol. 13, no. 2, pp. 206–219, 2019.

J. S. Cramer, “The origins of logistic regression,” Econometrics eJournal, 2002.

M. Hearst, S. Dumais, E. Osuna, J. Platt, and B. Scholkopf, “Support vector machines,” IEEE Intelligent Systems and their Applications, vol. 13, no. 4, pp. 18–28, 1998.

P. Cunningham and S. J. Delany, “K-nearest neighbour classifiers-a tutorial,” ACM Computing Surveys (CSUR), vol. 54, no. 6, pp. 1–25, 2021.

J. R. Quinlan, “Induction of decision trees,” Machine learning, vol. 1, no. 1, pp. 81–106, 1986.

N. Kanvinde, A. Gupta, and R. Joshi, “Binary classification for high dimensional data using supervised non-parametric ensemble method,” arXiv preprint arXiv:2202.07779, 2022.

M. Gupta, S. S. Shetty, R. M. Joshi, and R. M. Laban, “Succinct differentiation of disparate boosting ensemble learning methods for prognostication of polycystic ovary syndrome diagnosis,” in 2021 International Conference on Advances in Computing, Communication, and Control (ICAC3). IEEE, 2021, pp. 1–5.

S. Nair, A. Gupta, R. Joshi, and V. Chitre, “Combining varied learners for binary classification using stacked generalization,” arXiv preprint arXiv:2202.08910, 2022.

Gupta, H. Soni, R. Joshi, and R. M. Laban, “Discriminant analysis in contrasting dimensions for polycystic ovary syndrome prognostication,”arXiv preprint arXiv:2201.03029, 2022.

D. Xu and Y. Tian, “A comprehensive survey of clustering algorithms,” Annals of Data Science, vol. 2, no. 2, pp. 165–193, 2015.

D. Maulud and A. M. Abdulazeez, “A review on linear regression comprehensive in machine learning,” Journal of Applied Science and Technology Trends, vol. 1, no. 4, pp. 140–147, Dec. 2020. [Online].

J. Singh and R. Banerjee, “A study on single and multi-layer perceptron neural network,” in 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC), 2019, pp. 35–40.

D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning internal representations by error propagation,” California Univ San Diego La Jolla Inst for Cognitive Science, Tech. Rep., 1985.

J. S. Bridle, “Training stochastic model recognition algorithms as networks can lead to maximum mutual information estimation of parameters,” in Proceedings of the 2nd International Conference on Neural Information Processing Systems, ser. NIPS’89. Cambridge, MA, USA: MIT Press, 1989, p. 211–217.

F. Agarap, “Deep learning using rectified linear units (relu),” arXiv preprint arXiv:1803.08375, 2018.

D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.

J. Duchi, E. Hazan, and Y. Singer, “Adaptive subgradient methods for online learning and stochastic optimization,” Journal of Machine Learning Research, vol. 12, no. 61, pp. 2121–2159,2011.

Y. LeCun, Y. Bengio et al., “Convolutional networks for images, speech, and time series.”

L. Alzubaidi, J. Zhang, A. J. Humaidi, A. Al-Dujaili, Y. Duan, O. AlShamma, J. Santamaría, M. A. Fadhel, M. Al-Amidie, and L. Farhan, “Review of deep learning: Concepts, cnn architectures, challenges, applications, future directions,” Journal of big Data, vol. 8, no. 1, pp. 1–74, 2021.

Gupta, R. Joshi, and R. Laban, “Detection of tool based edited images from error level analysis and convolutional neural network,” arXiv preprint arXiv:2204.09075, 2022.

Gupta, S. Nair, R. Joshi, and V. Chitre, “Residual-concatenate neural network with deep regularization layers for binary classification,” in 2022

6th International Conference on Intelligent Computing and Control Systems (ICICCS). IEEE, 2022, pp. 1018–1022.

Li, J. Feng, L. Hu, J. Li, and H. Ma, “Review of image classification method based on deep transfer learning,” in 2020 16th International Conference on Computational Intelligence and Security (CIS), 2020, pp. 104–108.

Z.-Q. Zhao, P. Zheng, S.-t. Xu, and X. Wu, “Object detection with deep learning: A review,” IEEE transactions on neural networks and learning systems, vol. 30, no. 11, pp. 3212–3232, 2019.

N. R. Pal and S. K. Pal, “A review on image segmentation techniques,” Pattern Recognition, vol. 26, no. 9, pp. 1277–1294,1993.

S. Minaee, Y. Y. Boykov, F. Porikli, A. J. Plaza, N. Kehtarnavaz, and D. Terzopoulos, “Image segmentation using deep learning: A survey,” IEEE transactions on pattern analysis and machine intelligence, 2021.

R. M. Joshi and D. Shah, “Refactoring faces under bounding box using instance segmentation algorithms in deep learning for replacement of editing tools,” in Intelligent Computing and Networking. Springer, 2022, pp. 236–247.

M. Wang and W. Deng, “Deep face recognition: A survey,” Neurocomputing, vol. 429, pp. 215–244, 2021.

Elhagry and K. Kadaoui, “A thorough review on recent deep learning methodologies for image captioning,” arXiv preprint arXiv:2107.13114, 2021.

K. Cho, B. Van Merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio, “Learning phrase representations using rnn encoder-decoder for statistical machine translation,” arXiv preprint arXiv:1406.1078, 2014.

Y. Zhang, R. Jin, and Z.-H. Zhou, “Understanding bag-of-words model: a statistical framework,” International journal of machine learning and cybernetics, vol. 1, no. 1, pp. 43–52, 2010.

S. Tambe, R. Joshi, A. Gupta, N. Kanvinde, and V. Chitre, “Effects of parametric and non-parametric methods on high dimensional sparse matrix representations,” arXiv preprint arXiv:2202.02894, 2022.

T. Mikolov, K. Chen, G. Corrado, and J. Dean, “Efficient estimation of word representations in vector space,” arXiv preprint arXiv:1301.3781, 2013.

J. Pennington, R. Socher, and C. D. Manning, “Glove: Global vectors for word representation,” in Empirical Methods in Natural Language Processing (EMNLP), 2014, pp. 1532–1543.

R. C. Staudemeyer and E. R. Morris, “Understanding lstm – a tutorial into long short-term memory recurrent neural networks,”2019.

M. Schuster and K. Paliwal, “Bidirectional recurrent neural networks,” IEEE Transactions on Signal Processing, vol. 45, no. 11, pp. 2673–2681, 1997.

X. Zhang, J. Zhao, and Y. LeCun, “Character-level convolutional networks for text classification,” Advances in neural information processing systems, vol. 28, 2015.

R. Joshi, A. Gupta, and N. Kanvinde, “Res-cnn-bilstm network for overcoming mental health disturbances caused due to cyberbullying through social media,” arXiv preprint arXiv:2204.09738, 2022.

Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” Advances in neural information processing systems, vol. 30, 2017.

R. Joshi and A. Gupta, “Performance comparison of simple transformer and res-cnn-bilstm for cyberbullying classification,” arXiv preprint arXiv:2206.02206, 2022.

H. Lin, J. Si, and G. P. Abousleman, “Region-of-interest detection and its application to image segmentation and compression,” in 2007 International Conference on Integration of Knowledge Intensive Multi-Agent Systems, 2007, pp. 306–311.

R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2014, pp. 580–587.

K. He, X. Zhang, S. Ren, and J. Sun, “Spatial pyramid pooling in deep convolutional networks for visual recognition,” IEEE transactions on pattern analysis and machine intelligence, vol. 37, no. 9, pp. 1904–1916, 2015.

R. Girshick, “Fast r-cnn,” in Proceedings of the IEEE international conference on computer vision, 2015, pp. 1440–1448.

S. Ren, K. He, R. Girshick, and J. Sun, “Faster r-cnn: Towards realtime object detection with region proposal networks,” Advances in neural information processing systems, vol. 28, 2015.

J. Dai, Y. Li, K. He, and J. Sun, “R-fcn: Object detection via region-based fully convolutional networks,” Advances in neural information processing systems, vol. 29, 2016.

T.-Y. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan, and S. Belongie, “Feature pyramid networks for object detection,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 2117–2125.

K. He, G. Gkioxari, P. Dollár, and R. Girshick, “Mask r-cnn,” in Proceedings of the IEEE international conference on computer vision, 2017, pp. 2961–2969.

Erhan, C. Szegedy, A. Toshev, and D. Anguelov, “Scalable object detection using deep neural networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2014, pp. 2147–2154.

Yoo, S. Park, J.-Y. Lee, A. S. Paek, and I. So Kweon, “Attentionnet: Aggregating weak directions for accurate object detection,” in Proceedings of the IEEE International Conference on Computer Vision, 2015, pp. 2659–2667.

M. Najibi, M. Rastegari, and L. S. Davis, “G-cnn: an iterative grid based object detector,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 2369–2377.

J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You only look once: Unified, real-time object detection,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 779– 788.

W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, and A. C. Berg, “Ssd: Single shot multibox detector,” in European conference on computer vision. Springer, 2016, pp. 21–37.

C.-Y. Fu, W. Liu, A. Ranga, A. Tyagi, and A. C. Berg, “Dssd: Deconvolutional single shot detector,” arXiv preprint arXiv:1701.06659, 2017.

J. Redmon and A. Farhadi, “Yolo9000: better, faster, stronger,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 7263–7271.

Z. Shen, Z. Liu, J. Li, Y.-G. Jiang, Y. Chen, and X. Xue, “Dsod: Learning deeply supervised object detectors from scratch,” in Proceedings of the IEEE international conference on computer vision, 2017, pp. 1919–1927.

J. Redmon and A. Farhadi, “Yolov3: An incremental improvement,” arXiv preprint arXiv:1804.02767, 2018.

Bochkovskiy, C.-Y. Wang, and H.-Y. M. Liao, “Yolov4: Optimal speed and accuracy of object detection,” arXiv preprint arXiv:2004.10934, 2020.

C. Foong, G. K. Meng, and L. L. Tze, “Convolutional neural network based rotten fruit detection using resnet50,” in 2021 IEEE 12th Control and System Graduate Research Colloquium (ICSGRC), 2021, pp. 75–80.

T. P. Chung and D. V. Tai, “A fruits recognition system based on a modern deep learning technique,” Journal of Physics: Conference Series, vol. 1327, no. 1, p. 012050, oct 2019. [Online]. Available: https://doi.org/10.1088/1742-6596/1327/1/012050

Chaudhari, S. S. More, S. Khane, H. Mane, and P. Kamble, “Object detection using convolutional neural network in the application of supplementary nutrition value of fruits,” International Journal of Innovative Technology and Exploring Engineering, 2019.

M. Everingham, L. Van Gool, C. K. Williams, J. Winn, and A. Zisserman, “The pascal visual object classes (voc) challenge,” International journal of computer vision, vol. 88, no. 2, pp. 303–338, 2010.

T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick, “Microsoft coco: Common objects in context,” in European conference on computer vision. Springer, 2014, pp. 740–755.

M. Abadi, A. Agarwal, P. Barham, E. Brevdo, Z. Chen, C. Citro, G. S. Corrado, A. Davis, J. Dean, M. Devin, S. Ghemawat, I. Goodfellow, A. Harp, G. Irving, M. Isard, Y. Jia, R. Jozefowicz, L. Kaiser, “TensorFlow: Large-scale machine learning on heterogeneous systems,” 2015, software available from tensorflow.org.

J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, “Imagenet: A large-scale hierarchical image database,” in 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009, pp. 248–255.

K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778.

G. Huang, Z. Liu, L. Van Der Maaten, and K. Q. Weinberger, “Densely connected convolutional networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 4700–4708.

Z. Wei, C. Duan, X. Song, Y. Tian, and H. Wang, “Amrnet: Chips augmentation in aerial images object detection,” arXiv preprint arXiv:2009.07168, 2020.

G. Ghiasi, T.-Y. Lin, and Q. V. Le, “Dropblock: A regularization method for convolutional networks,” Advances in neural information processing systems, vol. 31, 2018.

N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout: a simple way to prevent neural networks from overfitting,” The journal of machine learning research, vol. 15, no. 1, pp. 1929–1958, 2014.

Misra, “Mish: A self regularized non-monotonic activation function,” arXiv preprint arXiv:1908.08681, 2019.

Massa and R. Girshick, “maskrcnn-benchmark: Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch,” https://github.com/facebookresearch/maskrcnn-benchmark, 2018, accessed: [13-06-2022].

Banerjee, S. ., & Mondal, A. C. . (2023). An Intelligent Approach to Reducing Plant Disease and Enhancing Productivity Using Machine Learning. International Journal on Recent and Innovation Trends in Computing and Communication, 11(3), 250–262. https://doi.org/10.17762/ijritcc.v11i3.6344

Prof. Sharayu Waghmare. (2012). Vedic Multiplier Implementation for High Speed Factorial Computation. International Journal of New Practices in Management and Engineering, 1(04), 01 - 06. Retrieved from http://ijnpme.org/index.php/IJNPME/article/view/8

Downloads

Published

22.07.2023

How to Cite

Bhole, C. ., Joshi, C. ., Kalla, M. ., Hiren, K. ., & Darpan, A. . . (2023). Detection of In-Perceptible Fruits Bearing on Trees from Inter-Spaced Images. International Journal of Intelligent Systems and Applications in Engineering, 11(3), 259–267. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3166

Issue

Section

Research Article