(Publisher of Peer Reviewed Open Access Journals)

International Journal of Advanced Technology and Engineering Exploration (IJATEE)

ISSN (Print):2394-5443    ISSN (Online):2394-7454
Volume-9 Issue-90 May-2022
Full-Text PDF
Paper Title : Exponential kernelized feature map Theil-Sen regression-based deep belief neural learning classifier for drift detection with data stream
Author Name : Thangam M and A. Bhuvaneswari
Abstract :

Data streams are potentially large and thus data stream classification tasks are not strictly stationary. In the process of data analysis, the fundamental structure may vary over time and the changes in the primary distribution of the data are known as drift. Early drift detection achieves better detection results in the evolving data stream analysis. In order to perform accurate drift detection with minimum time, a novel deep learning technique called exponential kernelized feature map Theil-Sen regression-based deep belief neural learning classifier (EKFMTR-DBNLC) was introduced. The main aim of the proposed EKFMTR-DBNLC technique is to perform multiple drift detection from the data stream using multiple layers. The proposed deep belief network comprises of various layers such as an input layer, an output layer and two hidden layers. The input layer receives the number of features and data from the dataset. The hidden layers perform the significant feature selection to reduce the drift detection time. The exponential kernelized semantic feature mapping technique is applied for identifying the significant feature for data classifications. Then, using Theil-Sen regression (TSR) function, the drifts in the data stream are detected and classified from the selected relevant features in the next hidden layer. The regression function analyzes the distribution of the data between the two-time intervals. Based on regression analysis, multiple drifts such as incremental drift, gradual drift, sudden drift and recurring drift are identified. Experimental estimation of the proposed EKFMTR-DBNLC technique and conventional methods are performed with different factors such as classification accuracy, precision, recall, F-score and drift detection time using real-world and synthetic datasets. The analyzed numerical result confirms that the proposed technique EKFMTR-DBNLC achieves 10% higher classification accuracy and also minimizes the time consumption by 13.5% than the conventional methods.

Keywords : Data stream classification, Drift detection, Feature mapping, Neural learning classifier, Regression function.
Cite this article : Thangam M, Bhuvaneswari A. Exponential kernelized feature map Theil-Sen regression-based deep belief neural learning classifier for drift detection with data stream. International Journal of Advanced Technology and Engineering Exploration. 2022; 9(90):663-675. DOI:10.19101/IJATEE.2021.874851.
References :
[1]Khamassi I, Sayed-mouchaweh M, Hammami M, Ghédira K. Discussion and review on evolving data streams and concept drift adapting. Evolving Systems. 2018; 9(1):1-23.
[Crossref] [Google Scholar]
[2]Krawczyk B, Minku LL, Gama J, Stefanowski J, Woźniak M. Ensemble learning for data stream analysis: a survey. Information Fusion. 2017; 37:132-56.
[Crossref] [Google Scholar]
[3]Wares S, Isaacs J, Elyan E. Data stream mining: methods and challenges for handling concept drift. SN Applied Sciences. 2019; 1(11):1-19.
[Crossref] [Google Scholar]
[4]Gama J, Medas P, Castillo G, Rodrigues P. Learning with drift detection. In Brazilian symposium on artificial intelligence 2004 (pp. 286-95). Springer, Berlin, Heidelberg.
[Crossref] [Google Scholar]
[5]Wang X, Chen W, Xia J, Chen Z, Xu D, Wu X, et al. ConceptExplorer: visual analysis of concept drifts in multi-source time-series data. In conference on visual analytics science and technology 2020 (pp. 1-11). IEEE.
[Crossref] [Google Scholar]
[6]Hatamikhah N, Barari M, Kangavari MR, Keyvanrad MA. Concept drift detection via improved deep belief network. In electrical engineering Iranian conference on 2018 (pp. 1703-7). IEEE.
[Crossref] [Google Scholar]
[7]Shah SH, Rehman A, Rashid T, Karim J, Shah S. A comparative study of ordinary least squares regression and Theil-Sen regression through simulation in the presence of outliers. Journal of Science and Technology. 2016; 137-42.
[Google Scholar]
[8]Hua Y, Guo J, Zhao H. Deep belief networks and deep learning. In proceedings of 2015 international conference on intelligent computing and internet of things 2015 (pp. 1-4). IEEE.
[Crossref] [Google Scholar]
[9]Zheng X, Li P, Hu X, Yu K. Semi-supervised classification on data streams with recurring concept drift and concept evolution. Knowledge-Based Systems. 2021.
[Crossref] [Google Scholar]
[10]Pratama M, Pedrycz W, Webb GI. An incremental construction of deep neuro fuzzy system for continual learning of nonstationary data streams. IEEE Transactions on Fuzzy Systems. 2019; 28(7):1315-28.
[Crossref] [Google Scholar]
[11]Yan MM. Accurate detecting concept drift in evolving data streams. ICT Express. 2020; 6(4):332-8.
[Crossref] [Google Scholar]
[12]Prasad KS, Rao AS, Ramana AV. Ensemble framework for concept-drift detection in multidimensional streaming data. International Journal of Computers and Applications. 2020:1-8.
[Crossref] [Google Scholar]
[13]Mahdi OA, Pardede E, Ali N. KAPPA as drift detector in data stream mining. Procedia Computer Science. 2021; 184:314-21.
[Crossref] [Google Scholar]
[14]Namitha K, Kumar GS. Learning in the presence of concept recurrence in data stream clustering. Journal of Big Data. 2020; 7(1):1-28.
[Crossref] [Google Scholar]
[15]Liu A, Lu J, Zhang G. Concept drift detection via equal intensity k-means space partitioning. IEEE Transactions on Cybernetics. 2020; 51(6):3198-211.
[Crossref] [Google Scholar]
[16]Bi X, Zhang C, Zhao X, Li D, Sun Y, Ma Y. CODES: efficient incremental semi-supervised classification over drifting and evolving social streams. IEEE Access. 2020; 8:14024-35.
[Crossref] [Google Scholar]
[17]Chen D, Yang Q, Liu J, Zeng Z. Selective prototype-based learning on concept-drifting data streams. Information Sciences. 2020; 516:20-32.
[Crossref] [Google Scholar]
[18]Mahdi OA, Pardede E, Ali N, Cao J. Diversity measure as a new drift detection method in data streaming. Knowledge-Based Systems. 2020.
[Crossref] [Google Scholar]
[19]Singh VK, Verma S, Kumar M. Stream processing with concept drift for event identification in sensors enabled IoT environment. IEEE Sensors Journal. 2019; 19(24):12187-95.
[Crossref] [Google Scholar]
[20]Ancy S, Paulraj D. Handling imbalanced data with concept drift by applying dynamic sampling and ensemble classification model. Computer Communications. 2020; 153:553-60.
[Crossref] [Google Scholar]
[21]Altendeitering M, Dübler S. Scalable detection of concept drift: a learning technique based on support vector machines. Procedia Manufacturing. 2020; 51:400-7.
[Crossref] [Google Scholar]
[22]Liu A, Lu J, Zhang G. Concept drift detection: dealing with missing values via fuzzy distance estimations. IEEE Transactions on Fuzzy Systems. 2020; 29(11):3219-33.
[Crossref] [Google Scholar]
[23]Yang Z, Al-Dahidi S, Baraldi P, Zio E, Montelatici L. A novel concept drift detection method for incremental learning in nonstationary environments. IEEE Transactions on Neural Networks and Learning Systems. 2019; 31(1):309-20.
[Crossref] [Google Scholar]
[24]Jedrzejowicz J, Jedrzejowicz P. GEP-based classifier with drift detection for mining imbalanced data streams. Procedia Computer Science. 2020; 176:41-9.
[Crossref] [Google Scholar]
[25]Mehmood H, Kostakos P, Cortes M, Anagnostopoulos T, Pirttikangas S, Gilman E. Concept drift adaptation techniques in distributed environment for real-world data streams. Smart Cities. 2021; 4(1):349-71.
[Crossref] [Google Scholar]
[26]Yuan Y, Wang Z, Wang W. Unsupervised concept drift detection based on multi-scale slide windows. Ad Hoc Networks. 2021.
[Crossref] [Google Scholar]
[27]Priya S, Uthra RA. Deep learning framework for handling concept drift and class imbalanced complex decision-making on streaming data. Complex & Intelligent Systems. 2021:1-17.
[Crossref] [Google Scholar]
[28]Oikarinen E, Tiittanen H, Henelius A, Puolamäki K. Detecting virtual concept drift of regressors without ground truth values. Data Mining and Knowledge Discovery. 2021; 35(3):726-47.
[Crossref] [Google Scholar]
[29]Mayaki MZ, Riveill M. Autoregressive based drift detection method. arXiv preprint arXiv:2203.04769. 2022.
[Google Scholar]
[30]Nikpour S, Asadi S. A dynamic hierarchical incremental learning-based supervised clustering for data stream with considering concept drift. Journal of Ambient Intelligence and Humanized Computing. 2022; 13(6):2983-300.
[Crossref] [Google Scholar]