Embedding-based Method for the Supervised Link Prediction in Social Networks
Main Article Content
Abstract
In recent years, social network analysis has received a lot of interest. Link prediction is an important area of research in this field that uses information from current networks to predict the likely links that will emerge in the future. It has attracted considerable attention from interdisciplinary research communities due to its ubiquitous applications in biological networks, computer science, transportation networks, bioinformatics, telecommunication networks, and so on. Currently, supervised machine learning is one of the critical techniques in the link prediction task. Several algorithms have been developed by many authors to predict the future link in the network, but there is still scope to improve the previous approaches. In the supervised link prediction process, feature selection is a crucial step. Most existing algorithms use one type of similarity-based feature to represent data, which is not well described due to the large scale and heterogeneity of social networks. One of the newest techniques for link prediction is embedding methods, which are used to preparing the feature vector for each the nonexisting links in the network. In this paper, we introduce a novel approach to supervised link prediction based on feature embedding methods in order to achieve better performance. Our contribution considers a set of embedding methods as the feature vector for training the machine learning classifiers. The main focus of this work is to investigate the relevance of different feature embedding methods to improve the performance of the supervised link prediction models. The experimental results on some real-world temporal networks revealed satisfactory results, which encourage us for further analysis. Moreover, the use of feature embedding methods will provide better performance in this regard.
Article Details
References
Haghani, Sogol, and Mohammad Reza Keyvanpour. "A systemic analysis of link prediction in social network.", Artificial Intelligence Review, vol. 52, 2019, pp. 1961-1995.
Luo, Hongsheng, et al. "Link prediction in multiplex networks using a novel multiple-attribute decision-making approach», Knowledge-Based Systems, vol. 219, 2021, pp. 106904.
Tofighy, Sajjad, Nasrollah Moghadam Charkari, and Foad Ghaderi. "Link prediction in multiplex networks using intralayer probabilistic distance and interlayer co-evolving factors." Physica A: Statistical Mechanics and its Applications, vol. 606, 2022, pp. 128043.
Nasiri, Elahe, Kamal Berahmand, and Yuefeng Li. "A new link prediction in multiplex networks using topologically biased random walks." Chaos, Solitons & Fractals, vol. 151, 2021, pp.111230.
Berahmand, Kamal, et al. "A preference random walk algorithm for link prediction through mutual influence nodes in complex networks." Journal of king saud university-computer and information sciences, vol. 34, 2022, 5375-5387.
Xie, Feng, et al. "A link prediction approach for item recommendation with complex number." Knowledge-Based Systems, vo. 81, 2015, pp.148-158.
Assouli, Nora, Khelifa Benahmed, and Brahim Gasbaoui. "How to predict crime—informatics-inspired approach from link prediction." Physica A: Statistical Mechanics and its Applications, 2021, vol. 570, pp.125795.
Chuan, Pham Minh, et al. "Link prediction in co-authorship networks based on hybrid content similarity metric." Applied Intelligence, vol. 48, 2018, pp. 2470-2486.
Lin, Chih-Hsu, et al. "Multimodal network diffusion predicts future disease–gene–chemical associations." Bioinformatics, vol. 35, 2019, pp.1536-1543.
Nasiri, Elahe, et al. "A novel link prediction algorithm for protein-protein interaction networks by attributed graph embedding." Computers in Biology and Medicine, vol.137, 2021, pp. 104772.
Daud, Nur Nasuha, et al. "Applications of link prediction in social networks: A review." Journal of Network and Computer Applications, vol. 166, 2020, pp.102716.
Kumar, Ajay, et al. "Link prediction techniques, applications, and performance: A survey." Physica A: Statistical Mechanics and its Applications, vol. 553, 2020, pp. 124289.
Mutlu, Ece C., et al. "Review on learning and extracting graph features for link prediction." Machine Learning and Knowledge Extraction, vol. 2, 2020, pp. 672-704.
Islam, Md Kamrul, Sabeur Aridhi, and Malika Smail-Tabbone. "Appraisal study of similarity-based and embedding-based link prediction methods on graphs." Proceedings of the 10th International Conference on Data Mining & Knowledge Management Process, 2021, pp. 81-92.
Cui, Peng, et al. "A survey on network embedding." IEEE transactions on knowledge and data engineering, vol.31, 2018, pp. 833-852.
Cai, Hongyun, Vincent W. Zheng, and Kevin Chen-Chuan Chang. "A comprehensive survey of graph embedding: Problems, techniques, and applications." IEEE transactions on knowledge and data engineering, vol.30, 2018, pp.1616-1637.
Wang, Daixin, Peng Cui, and Wenwu Zhu. "Structural deep network embedding." Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. 2016, pp. 1225-1234.
Ou, Mingdong, et al. "Asymmetric transitivity preserving graph embedding." Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. 2016, pp. 1105-1114.
Perozzi, Bryan, Rami Al-Rfou, and Steven Skiena. "Deepwalk: Online learning of social representations." Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. 2014, pp. 701-710.
Grover, Aditya, and Jure Leskovec. "node2vec: Scalable feature learning for networks." Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. 2016, pp. 855-864.
Tang, Jian, et al. "Line: Large-scale information network embedding." Proceedings of the 24th international conference on world wide web. 2015, pp. 1067-1077.
Ribeiro, Leonardo FR, Pedro HP Saverese, and Daniel R. Figueiredo. "struc2vec: Learning node representations from structural identity." Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining. 2017, pp. 385-394.
Goyal, Palash, and Emilio Ferrara. "Graph embedding techniques, applications, and performance: A survey." Knowledge-Based Systems, vol.151, 2018, pp. 78-94.
Zeng, Shan. "Link prediction based on local information considering preferential attachment." Physica A: Statistical Mechanics and its Applications, vol. 443, 2016, pp. 537-542.
Zhou, Kai, et al. "Attacking similarity-based link prediction in social networks." arXiv preprint arXiv:1809.08368, 2018.
Mumin, Diyawu, Lei?Lei Shi, and Lu Liu. "An efficient algorithm for link prediction based on local information: Considering the effect of node degree." Concurrency and Computation: Practice and Experience, vol. 34, 2022, pp. e6289.
Cao, Ren-Meng, Si-Yuan Liu, and Xiao-Ke Xu. "Network embedding for link prediction: The pitfall and improvement." Chaos: An Interdisciplinary Journal of Nonlinear Science, vol. 29, 2019, pp. 103102.
Wu, Chencheng, et al. "Link prediction based on graph embedding method in unweighted networks." 2020 39th Chinese Control Conference (CCC). IEEE, 2020, pp. 736-741.
Tripathi, Shashi Prakash, Rahul Kumar Yadav, and Abhay Kumar Rai. "Network embedding based link prediction in dynamic networks." Future Generation Computer Systems, vol. 127, 2022, pp.409-420.
Pecli, Antonio, Maria Claudia Cavalcanti, and Ronaldo Goldschmidt. "Automatic feature selection for supervised learning in link prediction applications: a comparative study." Knowledge and Information Systems, vol. 56, 2018, pp. 85-121.
Kumari, Anisha, Behera, Ranjan Kumar, Sahoo, Kshira Sagar, et al. "Supervised link prediction using structured-based feature extraction in social network. " Concurrency and Computation: practice and Experience, vol. 34, 2020, pp. e5839.
Malhotra, Deepanshu, and Rinkaj Goyal. "Supervised-learning link prediction in single layer and multiplex networks." Machine Learning with Applications, vol. 6, 2021, pp.100086.
Badiy, Mohamed, Fatima Amounas, and Moha Hajar. "A Novel Hybrid Approach for Improving the Accuracy of the Supervised Link Prediction Based on Graph Structure Features in Social Networks." International Conference on Business Intelligence. Springer, Cham, 2022, pp. 231-242.
Pisner, Derek A., and David M. Schnyer. "Support vector machine." Machine learning. Academic Press, 2020, pp.101-121.
Taunk, Kashvi, et al. "A brief review of nearest neighbor algorithm for learning and classification." 2019 International Conference on Intelligent Computing and Control Systems (ICCS). IEEE, 2019, pp. 1255-1260.
Parmar, Aakash, Rakesh Katariya, and Vatsal Patel. "A review on random forest: An ensemble classifier." International Conference on Intelligent Data Communication Technologies and Internet of Things (ICICI) 2018. Springer International Publishing, 2019, pp. 758-763.
Bahad, Pritika, and Preeti Saxena. "Study of adaboost and gradient boosting algorithms for predictive analytics." International Conference on Intelligent Computing and Smart Communication 2019: Proceedings of ICSC 2019. Springer Singapore, 2020, pp. 235-244.
Bentéjac, Candice, Anna Csörg?, and Gonzalo Martínez-Muñoz. "A comparative analysis of gradient boosting algorithms” Artificial Intelligence Review, vol.54, 2021, pp. 1937-1967.
Chen, Tianqi, and Carlos Guestrin. "Xgboost: A scalable tree boosting system." Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. 2016, pp. 785-794.
Zachary, Wayne W. "An information flow model for conflict and fission in small groups." Journal of anthropological research, vol. 33, 1977, pp. 452-473.
Lusseau, D., Schneider, K., Boisseau, O.J. et al. The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations. BehavEcol Sociobiol, vol. 54, 2003, pp. 396-405.
Rossi, Ryan, and Nesreen Ahmed. "The network data repository with interactive graph analytics and visualization." Proceedings of the AAAI conference on artificial intelligence. vol. 29, 2015.
Rozemberczki, Benedek, Carl Allen, and Rik Sarkar. "Multi-scale attributed node embedding." Journal of Complex Networks, vol. 9, 2021, pp. cnab014.