Slime Mold Optimization with Relational Graph Convolutional Network for Big Data Classification on Apache Spark Environment

Main Article Content

K. Manivannan
T. Suresh

Abstract

Lately, Big Data (BD) classification has become an active research area in different fields namely finance, healthcare, e-commerce, and so on. Feature Selection (FS) is a crucial task for text classification challenges. Text FS aims to characterize documents using the most relevant feature. This method might reduce the dataset size and maximize the efficiency of the machine learning method. Various researcher workers focus on elaborating effective FS techniques. But most of the presented techniques are assessed for smaller datasets and validated by a single machine. As textual data dimensionality becomes high, conventional FS methodologies should be parallelized and improved to manage textual big datasets. This article develops a Slime Mold Optimization based FS with Optimal Relational Graph Convolutional Network (SMOFS-ORGCN) for BD Classification in Apache Spark Environment. The presented SMOFS-ORGCN model mainly focuses on the classification of BD accurately and rapidly. To handle BD, the SMOFS-ORGCN model uses an Apache Spark environment. In the SMOFS-ORGCN model, the SMOFS technique gets executed for reducing the profanity of dimensionality and to improve classification accuracy. In this article, the RGCN technique is employed for BD classification. In addition, Grey Wolf Optimizer (GWO) technique is utilized as a hyperparameter optimizer of the RGCN technique to enhance the classification achievement. To exhibit the better achievement of the SMOFS-ORGCN technique, a far-reaching experiments were conducted. The comparison results reported enhanced outputs of the SMOFS-ORGCN technique over current models.

Article Details

How to Cite
Manivannan, K. ., & Suresh, T. . (2023). Slime Mold Optimization with Relational Graph Convolutional Network for Big Data Classification on Apache Spark Environment. International Journal on Recent and Innovation Trends in Computing and Communication, 11(11s), 230–237. https://doi.org/10.17762/ijritcc.v11i11s.8095
Section
Articles

References

BenSaid, F. and Alimi, A.M., 2021. Online feature selection system for big data classification based on multi-objective automated negotiation. Pattern Recognition, 110, p.107629.

Rong, M., Gong, D. and Gao, X., 2019. Feature selection and its use in big data: challenges, methods, and trends. IEEE Access, 7, pp.19709-19725.

Abdulwahab, H.M., Ajitha, S. and Saif, M.A.N., 2022. Feature selection techniques in the context of big data: taxonomy and analysis. Applied Intelligence, pp.1-46.

AlNuaimi, N., Masud, M.M., Serhani, M.A. and Zaki, N., 2020. Streaming feature selection algorithms for big data: A survey. Applied Computing and Informatics.

Joseph Manoj, R., Praveena, A. and Vijayakumar, K., 2019. An ACO–ANN-based feature selection algorithm for big data. Cluster Computing, 22(2), pp.3953-3960.

Ghaddar, B. and Naoum-Sawaya, J., 2018. High dimensional data classification and feature selection using support vector machines. European Journal of Operational Research, 265(3), pp.993-1004.

Lakshmanaprabu, S.K., Shankar, K., Ilayaraja, M., Nasir, A.W., Vijayakumar, V. and Chilamkurti, N., 2019. Random forest for big data classification in the internet of things using optimal features. International journal of machine learning and cybernetics, 10(10), pp.2609-2618.

Soheili, M. and Eftekhari-Moghadam, A.M., 2020. DQPFS: Distributed quadratic programming based feature selection for big data. Journal of Parallel and Distributed Computing, 138, pp.1-14.

Rashid, A.N.M., Ahmed, M., Sikos, L.F. and Haskell-Dowland, P., 2020. Cooperative co-evolution for feature selection in Big Data with random feature grouping. Journal of Big Data, 7(1), pp.1-42.

Shehab, N., Badawy, M. and Ali, H.A., 2022. Toward feature selection in big data preprocessing based on hybrid cloud-based model. The Journal of Supercomputing, 78(3), pp.3226-3265.

Thenmozhi, L. and Chandrakala, N., 2022. Developed Modified Particle Swarm Optimization For Feature Selection On Learning Based Big Data In Cloud Computing. JOURNAL OF ALGEBRAIC STATISTICS, 13(1), pp.310-320.

Hira, S. and Bai, A., 2022. A Novel Map Reduced Based Parallel Feature Selection and Extreme Learning for Micro Array Cancer Data Classification. Wireless Personal Communications, 123(2), pp.1483-1505.

Hassib, E., El-Desouky, A., Labib, L. and El-Kenawy, E.S.M., 2020. WOA+ BRNN: An imbalanced big data classification framework using Whale optimization and deep neural network. Soft Computing, 24(8), pp.5573-5592.

Alarifi, A., Tolba, A., Al-Makhadmeh, Z. and Said, W., 2020. A big data approach to sentiment analysis using greedy feature selection with cat swarm optimization-based long short-term memory neural networks. The Journal of Supercomputing, 76(6), pp.4414-4429.

El-Hasnony, I.M., Barakat, S.I., Elhoseny, M. and Mostafa, R.R., 2020. Improved feature selection model for big data analytics. IEEE Access, 8, pp.66989-67004.

Esmaeilzadeh, A., Heidari, M., Abdolazimi, R., Hajibabaee, P. and Malekzadeh, M., 2022, January. Efficient large scale nlp feature engineering with apache spark. In 2022 IEEE 12th Annual Computing and Communication Workshop and Conference (CCWC) (pp. 0274-0280). IEEE.

Zubaidi, S.L., Abdulkareem, I.H., Hashim, K.S., Al-Bugharbee, H., Ridha, H.M., Gharghan, S.K., Al-Qaim, F.F., Muradov, M., Kot, P. and Al-Khaddar, R., 2020. Hybridised artificial neural network model with slime mould algorithm: a novel methodology for prediction of urban stochastic water demand. Water, 12(10), p.2692.

Chen, Z., Huang, K., Wu, L., Zhong, Z. and Jiao, Z., 2022. Relational Graph Convolutional Network for Text-Mining-Based Accident Causal Classification. Applied Sciences, 12(5), p.2482.

Mirjalili, S., Mirjalili, S.M. and Lewis, A., 2014. Grey wolf optimizer. Advances in engineering software, 69, pp.46-61.

Lakshmanaprabu, S.K., Shankar, K., Khanna, A., Gupta, D., Rodrigues, J.J., Pinheiro, P.R. and De Albuquerque, V.H.C., 2018. Effective features to classify big data using social internet of things. IEEE access, 6, pp.24196-24204.

Kaur, I., Lydia, E.L., Nassa, V.K., Shrestha, B., Nebhen, J., Malebary, S. and Joshi, G.P., 2021. Generative Adversarial Networks with Quantum Optimization Model for Mobile Edge Computing in IoT Big Data. Wireless Personal Communications, pp.1-21.