Real-Time Streaming Analytics using Big Data Paradigm and Predictive Modelling based on Deep Learning

Main Article Content

J. Ruby Dinakar
Vagdevi S.

Abstract

With the evolution of distributed streaming platforms analysing humongous time series data, which is streamed continuously from IoT devices become lot easier. In most of the IoT networks the data are in motion or in data centre/cloud. It is possible to process this data in real time similar to edge devices using the big data framework.  In data intensive applications predictive analytics require more resources to perform complex computations. Apache Flink framework is capable of performing real time streaming of schema less data and scales very high in distributed environment with low latency, it is used to collect and store the data in the cloud. This work suggests a suitable environment to collect, transport, preprocess and aggregate the data stream to perform predictive analytics using deep learning models. Deep learning automatically extracts features and builds models after training, it has the potential to solve problems that can't be solved by conventional machine learning models. Therefore, the use of algorithms based on deep learning is recommended for forecasting temporal data. Also, we discuss a number of different deep learning forecasting models and analyse the performance of different deep learning forecasting models in order to determine which one is the effective model for single step, multi step and multi variant methods based on error functions with respect to streamed sensor data.

Article Details

How to Cite
Dinakar, J. R. ., & S., V. . (2023). Real-Time Streaming Analytics using Big Data Paradigm and Predictive Modelling based on Deep Learning . International Journal on Recent and Innovation Trends in Computing and Communication, 11(4s), 161–165. https://doi.org/10.17762/ijritcc.v11i4s.6323
Section
Articles

References

Kolajo, T., Daramola, O. & Adebiyi, A. Big data stream analysis: a systematic literature review. J Big Data 6, 47 (2019). https://doi.org/10.1186/s40537-019-0210-7

Namiot, Dmitry. (2015). On Big Data Stream Processing. International Journal of Open Information Technologies. 3. pp 48-51.

Fernandes, Eliana & Salgado, Ana Carolina & Bernardino, Jorge. (2020). Big Data Streaming Platforms to Support Real-time Analytics. 426-433. 10.5220/0009817304260433.

David J. Hill, Barbara S. Minsker,” Anomaly detection in streaming environmental sensor data: A data-driven modeling approach”, Environmental Modelling & Software, Volume 25, Issue 9,2010, pp 1014-1022, ISSN 1364-8152, https://doi.org/10.1016/j.envsoft.2009.08.010.

Huang C-J, Kuo P-H. A Deep CNN-LSTM Model for Particulate Matter (PM2.5) Forecasting in Smart Cities. Sensors. 2018; 18(7):2220. https://doi.org/10.3390/s18072220

Lim, B, Zohren S. 2021 Time-series forecasting with deep learning: a survey. Phil. Trans. R. Soc. A379: 20200209. https://doi.org/10.1098/rsta.2020.0209

Mahmud, Amal & Mohammed, Ammar. (2021). A Survey on Deep Learning for Time-Series Forecasting. 10.1007/978-3-030-59338-4_19.

Kang, Gaganjot et al. “Air Quality Prediction: Big Data and Machine Learning Approaches.” International journal of environmental science and development 9 (2018): 8-16.

Heydari, A., Majidi Nezhad, M., Astiaso Garcia, D. et al. Air pollution forecasting application based on deep learning model and optimization algorithm. Clean Techn Environ Policy (2021).

https://doi.org/10.1007/s10098-021-02080-5

Jeya, S., & Sankari, L. (2020). Air Pollution Prediction by Deep Learning Model. 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS). doi:10.1109/iciccs48265.2020.9120

Torres JF, Hadjout D, Sebaa A, Martínez-Álvarez F, Troncoso A. Deep Learning for Time Series Forecasting: A Survey. Big Data. 2021 Feb;9(1):3-21. doi: 10.1089/big.2020.0159.

Wenjing Mao, Weilin Wang, Limin Jiao, Suli Zhao, Anbao Liu, Modeling air quality prediction using a deep learning approach: Method optimization and evaluation, Sustainable Cities and Society,Volume65,2021,102567,ISSN2210-6707, https://doi.org/10.1016/j.scs.2020.102567.

Apache Flink. https://flink.apache.org/.

Apache Kafka https://kafka.apache.org/