Key Pointers for Developing Pre-Trained Convolutional Neural Networks for Remote Sensing Image Classification

Authors

  • Nisha Gupta, Ajay Mittal, Satvir Singh

Keywords:

Image Classification, Convolution Neural Networks, Pre-Trained Models

Abstract

The process of correctly identifying objects in an image is known as image classification. Effectively classifying high-resolution spatial images for huge remote sensing archives is known as remote sensing image classification. Better classification performance is directly correlated with effective feature extraction from images. The majority of feature extraction steps employed manually created low-level features that concentrated on basic components like color, form, and texture before deep learning was widely used in remote sensing image classification. However, because of their poor performance, these conventional handmade methods were swiftly superseded by Convolution Neural Networks (CNN’s), that successfully recovered abstract information. However, while creating deep Convolution Neural Networks, it is important to carefully consider the significant training constraints of CNNs. This research aims to investigate the primary training challenges encountered while training deep learning pre-trained models utilizing a transfer learning approach.

Downloads

Download data is not yet available.

References

I. Dimitrovski, I. Kitanovski, D. Kocev, and N. Simidjievski, “Current trends in deep learning for earth observa- tion: An open-source benchmark arena for image classification,” ISPRS Journal of Photogrammetry and Remote Sensing, vol. 197, pp. 18–35, 2023.

K. O’shea and R. Nash, “An introduction to convolutional neural networks,” arXiv preprint arXiv:1511.08458, 2015.

J. Gu, Z. Wang, J. Kuen, L. Ma, A. Shahroudy, B. Shuai, T. Liu, X. Wang, G. Wang, J. Cai, et al., “Recent advances in convolutional neural networks,” Pattern recognition, vol. 77, pp. 354–377, 2018.

N. Ketkar, J. Moolayil, N. Ketkar, and J. Moolayil, “Convolutional neural networks,” Deep Learning with Python: Learn Best Practices of Deep Learning Models with PyTorch, pp. 197–242, 2021.

J. S. Ren, L. Xu, Q. Yan, and W. Sun, “Shepard convolutional neural networks,” Advances in neural information processing systems, vol. 28, 2015.

W. Zhou, H. Guan, Z. Li, Z. Shao, and M. R. Delavar, “Remote sensing image retrieval in the past decade: Achievements, challenges, and future directions,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2023.

H. Habibi Aghdam, E. Jahani Heravi, H. Habibi Aghdam, and E. Jahani Heravi, “Convolutional neural net- works,” 2017.

U. Markowska-Kaczmar and H. Kwaśnicka, “Deep learning—a new era in bridging the semantic gap,” Bridging the Semantic Gap in Image and Video Analysis, pp. 123–159, 2018.

Y. Liu, L. Ding, C. Chen, and Y. Liu, “Similarity-based unsupervised deep transfer learning for remote sensing image retrieval,” IEEE Transactions on Geoscience and Remote Sensing, vol. 58, no. 11, pp. 7872–7889, 2020.

M. E. Paoletti, J. M. Haut, J. Plaza, and A. Plaza, “A new deep convolutional neural network for fast hy- perspectral image classification,” ISPRS journal of photogrammetry and remote sensing, vol. 145, pp. 120–147, 2018.

G. Huang, Z. Liu, L. Van Der Maaten, and K. Q. Weinberger, “Densely connected convolutional networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4700–4708, 2017.

J. Zhang, C. Lu, X. Li, H.-J. Kim, and J. Wang, “A full convolutional network based on densenet for remote sensing scene classification,” Mathematical Biosciences and Engineering, vol. 16, no. 5, pp. 3345–3367, 2019.

P. S. Tan, K. M. Lim, C. H. Tan, and C. P. Lee, “Pre-trained densenet-121 with multilayer perceptron for acoustic event classification.,” IAENG International Journal of Computer Science, vol. 50, no. 1, 2023.

G. Li, M. Zhang, J. Li, F. Lv, and G. Tong, “Efficient densely connected convolutional neural networks,” Pattern Recognition, vol. 109, p. 107610, 2021.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,”

Advances in neural information processing systems, vol. 25, 2012.

L. Zhen, P. Hu, X. Peng, R. S. M. Goh, and J. T. Zhou, “Deep multimodal transfer learning for cross-modal retrieval,” IEEE Transactions on Neural Networks and Learning Systems, vol. 33, no. 2, pp. 798–810, 2020.

Y. Wang, R. Xiao, J. Qi, and C. Tao, “Cross-sensor remote-sensing images scene understanding based on transfer learning between heterogeneous networks,” IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1–5, 2021.

E. d. S. Puls, M. V. Todescato, and J. L. Carbonera, “An evaluation of pre-trained models for feature extraction in image classification,” arXiv preprint arXiv:2310.02037, 2023.

C. Desai, “Image classification using transfer learning and deep learning,” International Journal of Engineering and Computer Science, vol. 10, no. 9, pp. 25394–25398, 2021.

A. Alem and S. Kumar, “Deep learning models performance evaluations for remote sensed image classification,”

IEEE Access, vol. 10, pp. 111784–111793, 2022.

C. Wang, D. Chen, L. Hao, X. Liu, Y. Zeng, J. Chen, and G. Zhang, “Pulmonary image classification based on inception-v3 transfer learning model,” IEEE Access, vol. 7, pp. 146533–146541, 2019.

S. Zhou, W. Liang, J. Li, and J.-U. Kim, “Improved vgg model for road traffic sign recognition,” Computers, Materials & Continua, vol. 57, no. 1, pp. 11–24, 2018.

U. Muhammad, W. Wang, S. P. Chattha, and S. Ali, “Pre-trained vggnet architecture for remote-sensing image scene classification,” in 2018 24th International Conference on Pattern Recognition (ICPR), pp. 1622–1627, IEEE, 2018.

V. Risojević and V. Stojnić, “Do we still need imagenet pre-training in remote sensing scene classification?,” arXiv preprint arXiv:2111.03690, 2021.

S. Jiang, H. Zhao, W. Wu, and Q. Tan, “A novel framework for remote sensing image scene lassification,” The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. 42, pp. 657–663, 2018.

Y. Dong and Q. Zhang, “A combined deep learning model for the scene classification of high-resolution remote sensing image,” IEEE Geoscience and Remote Sensing Letters, vol. 16, no. 10, pp. 1540–1544, 2019.

G.-S. Xia, J. Hu, F. Hu, B. Shi, X. Bai, Y. Zhong, L. Zhang, and X. Lu, “Aid: A benchmark data set for performance evaluation of aerial scene classification,” IEEE Transactions on Geoscience and Remote Sensing, vol. 55, no. 7, pp. 3965–3981, 2017.

W. Zhang, P. Tang, and L. Zhao, “Remote sensing image scene classification using cnn-capsnet,” Remote Sensing, vol. 11, no. 5, p. 494, 2019.

W. Zhou, S. Newsam, C. Li, and Z. Shao, “Patternnet: A benchmark dataset for performance evaluation of remote sensing image retrieval,” ISPRS journal of photogrammetry and remote sensing, vol. 145, pp. 197–209, 2018.

Y. Liu, Y. Liu, C. Chen, and L. Ding, “Remote-sensing image retrieval with tree-triplet-classification networks,” Neurocomputing, vol. 405, pp. 48–61, 2020.

B. B. Traore, B. Kamsu-Foguem, and F. Tangara, “Deep convolution neural network for image recognition,” Ecological informatics, vol. 48, pp. 257–268, 2018.

A. Khan, A. Sohail, U. Zahoora, and A. S. Qureshi, “A survey of the recent architectures of deep convolutional neural networks,” Artificial intelligence review, vol. 53, pp. 5455–5516, 2020.

W. Zhou, S. Newsam, C. Li, and Z. Shao, “Learning low dimensional convolutional neural networks for high- resolution remote sensing image retrieval,” Remote Sensing, vol. 9, no. 5, p. 489, 2017.

J. Wang, Y. Zheng, M. Wang, Q. Shen, and J. Huang, “Object-scale adaptive convolutional neural networks for high-spatial resolution remote sensing image classification,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 14, pp. 283–299, 2020.

S. Salman and X. Liu, “Overfitting mechanism and avoidance in deep neural networks,” arXiv preprint arXiv:1901.06566, 2019.

J. Li, C. Xu, W. Yang, C. Sun, and D. Tao, “Discriminative multi-view interactive image re-ranking,” IEEE Transactions on Image Processing, vol. 26, no. 7, pp. 3113–3127, 2017.

H. Alhichri, A. S. Alswayed, Y. Bazi, N. Ammour, and N. A. Alajlan, “Classification of remote sensing images using efficientnet-b3 cnn model with attention,” IEEE access, vol. 9, pp. 14078–14094, 2021.

J. Xin, F. Ye, Y. Xia, Y. Luo, and X. Chen, “A new remote sensing image retrieval method based on cnn and yolo,” Journal of Internet Technology, vol. 24, no. 2, pp. 233–242, 2023.

S. Hochreiter, “The vanishing gradient problem during learning recurrent neural nets and problem solutions,” International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, vol. 6, no. 02, pp. 107–116, 1998.

B. Hanin, “Which neural net architectures give rise to exploding and vanishing gradients?,” Advances in neural information processing systems, vol. 31, 2018.

Y. Hu, A. Huber, J. Anumula, and S.-C. Liu, “Overcoming the vanishing gradient problem in plain recurrent networks,” arXiv preprint arXiv:1801.06105, 2018.

N. Bjorck, C. P. Gomes, B. Selman, and K. Q. Weinberger, “Understanding batch normalization,” Advances in neural information processing systems, vol. 31, 2018.

P. Luo, X. Wang, W. Shao, and Z. Peng, “Towards understanding regularization in batch normalization,” arXiv preprint arXiv:1809.00846, 2018.

S. Ioffe and C. Szegedy, “Batch normalization: Accelerating deep network training by reducing internal covariate shift,” in International conference on machine learning, pp. 448–456, pmlr, 2015.

K. Kawaguchi, “Deep learning without poor local minima,” Advances in neural information processing systems, vol. 29, 2016.

H. Li and A. Yezzi, “Local or global minima: Flexible dual-front active contours,” IEEE transactions on pattern analysis and machine intelligence, vol. 29, no. 1, pp. 1–14, 2006.

C. A. Floudas and H. T. Jongen, “Global optimization: local minima and transition points,” Journal of Global Optimization, vol. 32, pp. 409–415, 2005.

F. Boselie, “Local versus global minima in visual pattern completion,” Perception & Psychophysics, vol. 43,

pp. 431–445, 1988.

S.-i. Amari, “Backpropagation and stochastic gradient descent method,” Neurocomputing, vol. 5, no. 4-5, pp. 185– 196, 1993.

M. Andrychowicz, M. Denil, S. Gomez, M. W. Hoffman, D. Pfau, T. Schaul, B. Shillingford, and N. De Freitas, “Learning to learn by gradient descent by gradient descent,” Advances in neural information processing systems, vol. 29, 2016.

L. Mason, J. Baxter, P. Bartlett, and M. Frean, “Boosting algorithms as gradient descent,” Advances in neural information processing systems, vol. 12, 1999.

W. A. Gardner, “Learning characteristics of stochastic-gradient-descent algorithms: A general study, analysis, and critique,” Signal processing, vol. 6, no. 2, pp. 113–133, 1984.

N. Ketkar and N. Ketkar, “Stochastic gradient descent,” Deep learning with Python: A hands-on introduction,

pp. 113–132, 2017.

J. Konečny`, J. Liu, P. Richtárik, and M. Takáč, “Mini-batch semi-stochastic gradient descent in the proximal setting,” IEEE Journal of Selected Topics in Signal Processing, vol. 10, no. 2, pp. 242–255, 2015.

X. Qian and D. Klabjan, “The impact of the mini-batch size on the variance of gradients in stochastic gradient descent,” arXiv preprint arXiv:2004.13146, 2020.

D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.

D. Yi, J. Ahn, and S. Ji, “An effective optimization method for machine learning based on adam,” Applied Sciences, vol. 10, no. 3, p. 1073, 2020.

Y. Yan, T. Yang, Z. Li, Q. Lin, and Y. Yang, “A unified analysis of stochastic momentum methods for deep learning,” arXiv preprint arXiv:1808.10396, 2018.

N. B. Kovachki and A. M. Stuart, “Analysis of momentum methods,” arXiv preprint arXiv:1906.04285, 2019.

T. O. Hodson, “Root mean square error (rmse) or mean absolute error (mae): When to use them or not,”

Geoscientific Model Development Discussions, vol. 2022, pp. 1–10, 2022.

P. Li, X. He, X. Cheng, M. Qiao, D. Song, M. Chen, T. Zhou, J. Li, X. Guo, S. Hu, et al., “An improved categorical cross entropy for remote sensing image classification based on noisy labels,” Expert Systems with Applications, vol. 205, p. 117296, 2022.

P. Murugan, “Implementation of deep convolutional neural network in multi-class categorical image classifica- tion,” arXiv preprint arXiv:1801.01397, 2018.

S. L. Smith, P.-J. Kindermans, C. Ying, and Q. V. Le, “Don’t decay the learning rate, increase the batch size,”

arXiv preprint arXiv:1711.00489, 2017.

R. A. Jacobs, “Increased rates of convergence through learning rate adaptation,” Neural networks, vol. 1, no. 4,

pp. 295–307, 1988.

Z. Zhang, W. Lu, X. Feng, J. Cao, and G. Xie, “A discriminative feature learning approach with distinguishable distance metrics for remote sensing image classification and retrieval,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 16, pp. 889–901, 2022.

Y. Yao, L. Rosasco, and A. Caponnetto, “On early stopping in gradient descent learning,” Constructive Approx- imation, vol. 26, no. 2, pp. 289–315, 2007.

L. Prechelt, “Automatic early stopping using cross validation: quantifying the criteria,” Neural networks, vol. 11, no. 4, pp. 761–767, 1998.

T. Anthony, Z. Tian, and D. Barber, “Thinking fast and slow with deep learning and tree search,” Advances in neural information processing systems, vol. 30, 2017.

B. Asadi and H. Jiang, “On approximation capabilities of relu activation and softmax output layer in neural networks,” arXiv preprint arXiv:2002.04060, 2020.

B. Wang, X. Luo, Z. Li, W. Zhu, Z. Shi, and S. Osher, “Deep neural nets with interpolating function as output activation,” Advances in neural information processing systems, vol. 31, 2018

Downloads

Published

26.03.2024

How to Cite

Nisha Gupta. (2024). Key Pointers for Developing Pre-Trained Convolutional Neural Networks for Remote Sensing Image Classification. International Journal of Intelligent Systems and Applications in Engineering, 12(21s), 5076 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/7721

Issue

Section

Research Article