Performance Analysis of Different Optimization Algorithms for Multi-Class Object Detection

Main Article Content

Jay Laxman Borade
Akkalakshmi Muddana

Abstract

Object recognition is a significant approach employed for recognizing suitable objects from the image. Various improvements, particularly in computer vision, are probable to diagnose highly difficult tasks with the assistance of local feature detection methodologies. Detecting multi-class objects is quite challenging, and many existing researches have worked to enhance the overall accuracy. But because of certain limitations like higher network loss, degraded training ability, improper consideration of features, less convergent and so on. The proposed research introduced a hybrid convolutional neural network (H-CNN) approach to overcome these drawbacks. The collected input images are pre-processed initially through Gaussian filtering to eradicate the noise and enhance the image quality. Followed by image pre-processing, the objects present in the images are localized using Grid Guided Localization (GGL). The effective features are extracted from the localized objects using the AlexNet model. Different objects are classified by replacing the concluding softmax layer of AlexNet with Support Vector Regression (SVR) model. The losses present in the network model are optimized using the Improved Grey Wolf (IGW) optimization procedure. The performances of the proposed model are analyzed using PYTHON. Various datasets are employed, including MIT-67, PASCAL VOC2010, Microsoft (MS)-COCO and MSRC. The performances are analyzed by varying the loss optimization algorithms like improved Particle Swarm Optimization (IPSO), improved Genetic Algorithm (IGA), and improved dragon fly algorithm (IDFA), improved simulated annealing algorithm (ISAA) and improved bacterial foraging algorithm (IBFA), to choose the best algorithm. The proposed accuracy outcomes are attained as PASCAL VOC2010 (95.04%), MIT-67 dataset (96.02%), MSRC (97.37%), and MS COCO (94.53%), respectively.

Article Details

How to Cite
Borade, J. L. ., & Muddana, A. . (2023). Performance Analysis of Different Optimization Algorithms for Multi-Class Object Detection. International Journal on Recent and Innovation Trends in Computing and Communication, 11(4), 175–191. https://doi.org/10.17762/ijritcc.v11i4.6400
Section
Articles

References

U.A. Khan, A. Javed, R. Ashraf, “An effective hybrid framework for content based image retrieval (CBIR).” Multimedia Tools and Applications pp. 26911-26937, vol. 80, 2021,

R. Gao, S. Zhang, H. Wang, J. Zhang, H. Li, Z. Zhang, “The Aeroplane and Undercarriage Detection Based on Attention Mechanism and Multi-Scale Features Processing.” Mobile Information Systems pp. 2022, 2022,

M. Mandal, M. Shah, P. Meena, S. Devi, S. K. Vipparthi, “AVDNet: A small-sized vehicle detection network for aerial visual data.” IEEE Geoscience and Remote Sensing Letters pp. 494-498, vol. 17, no. 3, 2019,

K. Liu, Z. Jiang, M. Xu, M. Perc, X. Li, “Tilt correction toward building detection of remote sensing images.” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing pp. 5854-5866, vol. 14, 2021,

Z. Cai, N. Vasconcelos, “Cascade R-CNN: high quality object detection and instance segmentation.” IEEE transactions on pattern analysis and machine intelligence pp. 1483-1498, vol. 43, no. 5, 2019,

X. Gao, G. Xing, S. Roy, H. Liu, “Ramp-cnn: A novel neural network for enhanced automotive radar object recognition.” IEEE Sensors Journal pp. 5119-5132, vol. 21, no. 4, 2020,

Y. Gong, Z. Xiao, X. Tan, H. Sui, C. Xu, H. Duan, D. Li, “Context-aware convolutional neural network for object detection in VHR remote sensing imagery.” IEEE Transactions on Geoscience and Remote Sensing pp. 34-44, vol. 58, no. 1, 2019,

Q. Zhang, R. Cong, C. Li, M.M. Cheng, Y. Fang, X. Cao, Y. Zhao, S. Kwong, “Dense attention fluid network for salient object detection in optical remote sensing images.” IEEE Transactions on Image Processing pp. 1305-1317, vol. 30, 2020,

R. Souza, S. Azevedo, G. Cardim, E. Antonio, "Semiautomatic Method for Reconstruction of Road Network Detected from Satellites Image."

M. Rudorfer, “Towards Robust Object Detection and Pose Estimation as a Service for Manufacturing Industries.”

L. Barsanti, L. Birindelli, P. Gualtieri, “Water monitoring by means of digital microscopy identification and classification of microalgae.” Environmental Science: Processes & Impacts pp. 1443-1457, vol. 23, no. 10, 2021,

A. Dasgupta, M. Manuel, R. S. Mansur, N. Nowak, D. Gra?anin, “Towards real time object recognition for context awareness in mixed reality: a machine learning approach.” In 2020 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW), pp. 262-268, IEEE, 2020,

Y. Sun, Z. Zhang, W. Jiang, Z. Zhang, L. Zhang, S. Yan, M. Wang, “Discriminative local sparse representation by robust adaptive dictionary pair learning.” IEEE Transactions on Neural Networks and Learning Systems pp. 4303-4317, vol. 31, no. 10, 2020,

S. Chen, S. Zhong, B. X. X. Li, Liaoying Zhao, C.I. Chang, “Iterative scale-invariant feature transform for remote sensing image registration.” IEEE Transactions on Geoscience and Remote Sensing pp. 3244-3265, vol. 59, no. 4, 2020,

C.R. Rahmad, A. Asmara, D. R. H. Putra, I. Dharma, H. Darmono, I. Muhiqqin, “Comparison of Viola-Jones Haar Cascade classifier and histogram of oriented gradients (HOG) for face detection.” In IOP conference series: materials science and engineering, IOP Publishing, p. 012038, vol. 732, no. 1, 2020,

W. A. Qader, M. M. Ameen, B. I. Ahmed, “An overview of bag of words; importance, implementation, applications, and challenges.” In 2019 international engineering conference (IEC), pp. 200-204. IEEE, 2019,

J. Ai, R. Tian, Q. Luo, J. Jin, B. Tang, “Multi-scale rotation-invariant Haar-like feature integrated CNN-based ship detection algorithm of multiple-target environment in SAR imagery.” IEEE Transactions on Geoscience and Remote Sensing pp. 10070-10087, vol. 57, no. 12, 2019,

B. Cheng, Z. Li, Q. Wu, B. Li, H. Yang, L. Qing, B. Qi, “Multi-class objects detection method in remote sensing image based on direct feedback control for convolutional neural network.” IEEE Access 144691-144709, vol. 7, 2019,

J. Pang, C. Li, J. Shi, Z. Xu, H. Feng, “$mathcal {R}^ 2$-CNN: fast Tiny object detection in large-scale remote sensing images.” IEEE Transactions on Geoscience and Remote Sensing pp. 5512-5524, vol. 57, no. 8, 2019,

Z. Shao, P. Tang, Z. Wang, N. Saleem, S. Yam, C. Sommai, “BRRNet: A fully convolutional neural network for automatic building extraction from high-resolution remote sensing images.” Remote Sensing pp. 1050, 12, no. 6, 2020,

X. Zhu, Y. Ma, T. Wang, Y. Xu, J. Shi, D. Lin, “Ssn: Shape signature networks for multi-class object detection from point clouds.” In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXV 16, pp. 581-597, Springer International Publishing, 2020,

T. Yin, X. Zhou, P. Krahenbuhl, “Center-based 3d object detection and tracking.” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 11784-11793, 2021,

C.H. Wang, H.W. Chen, L.C. Fu, “Vpfnet: Voxel-pixel fusion network for multi-class 3d object detection.” arXiv preprint arXiv: 2111.00966 2021,

L. Chen, Z. Liu, L.Tong, Z. Jiang, S. Wang, J. Dong, H. Zhou, “Underwater object detection using Invert Multi-Class Adaboost with deep learning.” In 2020 International Joint Conference on Neural Networks (IJCNN), IEEE, pp. 1-8, 2020,

H. Zhang, X. Zhang, G. Meng, C. Guo, Z. Jiang, “Few-Shot Multi-Class Ship Detection in Remote Sensing Images Using Attention Feature Map and Multi-Relation Detector.” Remote Sensing pp. 2790, vol. 14, no. 12, 2022,

A. G. Gad, “Particle swarm optimization algorithm and its applications: a systematic review.” Archives of computational methods in engineering pp. 2531-2561, vol. 29, no. 5, 2022,

P. Kumar, R. Anil Kumar, A. Mandal, B. Vaferi, “Genetic algorithm optimization of deep structured classifier-predictor models for pressure transient analysis.” Journal of Energy Resources Technology pp. 023003, vol. 145, no. 2, 2023,

C. M. Rahman, T. A. Rashid, “Dragonfly algorithm and its applications in applied science survey.” Computational Intelligence and Neuroscience vol. 2019, 2019,

F. He, Q. Ye, “A bearing fault diagnosis method based on wavelet packet transform and convolutional neural network optimized by simulated annealing algorithm.” Sensors pp. 1410, vol. 22, no. 4, 2022,

N. Hakimuddin, I. Nasiruddin, T. S. Bhatti, “Generation?based automatic generation control with multisources power system using bacterial foraging algorithm.” Engineering Reports e12191, vol. 2, no. 8, 2020,

W. Jingdong, K. Sun, T. Cheng, B. Jiang, C. Deng, Y. Zhao, D. Liu, “Deep high-resolution representation learning for visual recognition,” IEEE transactions on pattern analysis and machine intelligence. 2020,

L. Aziz, M.S. FC, S. Ayub, “Multi-level refinement enriched feature pyramid network for object detection,” Image and Vision Computing. pp. 104287, vol. 115, 2021,

D. Cao, Z. Chen, L. Gao, “An improved object detection algorithm based on multi-scaled and deformable convolutional neural networks,” Human-centric Computing and Information Sciences. pp. 1-22, vol. 10, 1, 2020,

B. Alexey, C.Y. Wang, Mark H.Y. Liao, Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934. 2020,

T. Mingxing, R. Pang, Q.V. Le, “Efficientdet: Scalable and efficient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 10781-10790, 2020,

P. Minh-Tan, L. Courtrai, C. Friguet, S. Lefèvre, A. Baussard, “One-stage detector of small objects under various backgrounds in remote sensing images,” Remote Sensing, pp. 2501, vol. 12, no. 15, 2020,

M. Tomasz, A. Alexei, A Improving spatial support for objects via multiple segmentation. 2007,

L. Tsung-Yi, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, C.L. Zitnick, “Common objects in context,” In European conference on computer vision, Springer, Cham. pp. 740-755, 2014,

E. Mark, L.V. Gool, C.K.I. Williams, J. Winn, A. Zisserman, “The pascal visual object classes (voc) challenge,” International journal of computer vision. pp. 303-338, vol. 88, no. 2, 2010,

Q. Ariadna, A. Torralba, Recognizing indoor scenes. “In 2009 IEEE Conference on Computer Vision and Pattern Recognition,” IEEE. pp. 413-420, 2009,