Concept Drift Detection in Data Stream Mining: The Review of Contemporary Literature

1
B. Ramakrishna
B. Ramakrishna
2
Dr. S Krishna Mohan Rao
Dr. S Krishna Mohan Rao
1 JNTU

Send Message

To: Author

GJCST Volume 17 Issue C2

Article Fingerprint

ReserarchID

CSTSDE7FFA9

Concept Drift Detection in Data Stream Mining: The Review of Contemporary Literature Banner
  • English
  • Afrikaans
  • Albanian
  • Amharic
  • Arabic
  • Armenian
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan
  • Cebuano
  • Chichewa
  • Chinese (Simplified)
  • Chinese (Traditional)
  • Corsican
  • Croatian
  • Czech
  • Danish
  • Dutch
  • Esperanto
  • Estonian
  • Filipino
  • Finnish
  • French
  • Frisian
  • Galician
  • Georgian
  • German
  • Greek
  • Gujarati
  • Haitian Creole
  • Hausa
  • Hawaiian
  • Hebrew
  • Hindi
  • Hmong
  • Hungarian
  • Icelandic
  • Igbo
  • Indonesian
  • Irish
  • Italian
  • Japanese
  • Javanese
  • Kannada
  • Kazakh
  • Khmer
  • Korean
  • Kurdish (Kurmanji)
  • Kyrgyz
  • Lao
  • Latin
  • Latvian
  • Lithuanian
  • Luxembourgish
  • Macedonian
  • Malagasy
  • Malay
  • Malayalam
  • Maltese
  • Maori
  • Marathi
  • Mongolian
  • Myanmar (Burmese)
  • Nepali
  • Norwegian
  • Pashto
  • Persian
  • Polish
  • Portuguese
  • Punjabi
  • Romanian
  • Russian
  • Samoan
  • Scots Gaelic
  • Serbian
  • Sesotho
  • Shona
  • Sindhi
  • Sinhala
  • Slovak
  • Slovenian
  • Somali
  • Spanish
  • Sundanese
  • Swahili
  • Swedish
  • Tajik
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Urdu
  • Uzbek
  • Vietnamese
  • Welsh
  • Xhosa
  • Yiddish
  • Yoruba
  • Zulu

Mining process such as classification, clustering of progressive or dynamic data is a critical objective of the information retrieval and knowledge discovery; in particular, it is more sensitive in data stream mining models due to the possibility of significant change in the type and dimensionality of the data over a period. The influence of these changes over the mining process termed as concept drift. The concept drift that depict often in streaming data causes unbalanced performance of the mining models adapted. Hence, it is obvious to boost the mining models to predict and analyse the concept drift to achieve the performance at par best. The contemporary literature evinced significant contributions to handle the concept drift, which fall in to supervised, unsupervised learning, and statistical assessment approaches. This manuscript contributes the detailed review of the contemporary concept-drift detection models depicted in recent literature. The contribution of the manuscript includes the nomenclature of the concept drift models and their impact of imbalanced data tuples.

67 Cites in Articles

References

  1. John Gantz,David Reinsel (2012). The digital universe in 2020: Big data, bigger digital shadows, and biggest growth in the far east.
  2. Jeffrey Schlimmer,Richard Granger (1986). Incremental Learning from Noisy Data.
  3. Gerhard Widmer,Miroslav Kubat (1996). Learning in the Presence of Concept Drift and Hidden Contexts.
  4. Miroslav Kubat,Robert Holte,Stan Matwin (1998). Machine Learning for the Detection of Oil Spills in Satellite Radar Images.
  5. Nathalie Japkowicz,Shaju Stephen (2002). The class imbalance problem: A systematic study1.
  6. Miroslav Kubat,Stan Matwin (1997). Addressing the curse of imbalanced training sets: one-sided selection.
  7. Gustavo Batista,C Ronaldo,Maria Prati,Monard (2004). A study of the behavior of several methods for balancing machine learning training data.
  8. Nhlakanipho Mqadi (2003). Machine learning : a data-point approach to solving misclassifications in the imbalanced Credit Card Datasets.
  9. Sofia Visa,Ancaralescu (2005). Issues in mining imbalanced data sets-a review paper.
  10. Rong Yan (2003). On predicting rare classes with SVM ensembles in scene classification.
  11. Gang Wu,Edward Chang (2003). Class-boundary alignment for imbalanced dataset learning.
  12. Jason Van Hulse,Taghi Khoshgoftaar,Amri Napolitano (2007). Experimental perspectives on learning from imbalanced data.
  13. Nitesh Chawla,K Bowyer,L Hall,W Kegelmeyer (2002). SMOTE: Synthetic Minority Over-sampling Technique.
  14. Victoria López (2013). An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics.
  15. Thomas Hoens,Ryan (2012). Living in an imbalanced world.
  16. Leandro Minku,Xin Yao (2012). DDD: A new ensemble approach for dealing with concept drift.
  17. Leandro Minku,A White,Xin Yao (2010). The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift.
  18. Mark Kelly,David Hand,Niall Adams (1999). The impact of changing populations on classifier performance.
  19. Ryan Elwell,R Polikar (2011). Incremental Learning of Concept Drift in Nonstationary Environments.
  20. T Hoens,Robipolikar Ryan,Nitesh Chawla (2012). Learning from streaming data with concept drift and imbalance: an overview.
  21. Pan,Shirui (2015). Graph ensemble boosting for imbalanced noisy graph stream classification.
  22. Maria Sousa,João Gama,Elísio Brandão (2016). A new dynamic modeling framework for credit risk assessment.
  23. Jing Gao (2008). Classifying data streams with skewed class distributions and concept drifts.
  24. Jing Gao,Wei Fan,Jiawei Han,Philip Yu (2007). A General Framework for Mining Concept-Drifting Data Streams with Skewed Distributions.
  25. Ke Wu,Andrea Edwards,Wei Fan,Jing Gao,Kun Zhang (2014). Classifying Imbalanced Data Streams via Dynamic Feature Group Weighting with Importance Sampling.
  26. Sheng Chen,Haibo He (2011). Towards incremental learning of non stationary imbalanced data stream: a multiple selectively recursive approach.
  27. Sheng Chen,Haibo He (2009). Sera: selectively recursive approach towards non stationary imbalanced stream data mining.
  28. Gregory Ditzler,Robipolikar (2013). Incremental learning of concept drift from streaming imbalanced data.
  29. Thomas Hoens,Nitesh Ryan,Chawla (2012). Learning in non-stationary environments with class imbalance.
  30. T Hoens,Nitesh Chawla,Robi Polikar (2011). Heuristic Updatable Weighted Random Subspaces for Non-stationary Environments.
  31. Gregory Ditzler,Manuel Roveri,Cesare Alippi,Robi Polikar (2015). Learning in Nonstationary Environments: A Survey.
  32. Shuo Wang (2013). Concept drift detection for online class imbalance learning.
  33. Heng Wang,Zubin Abraham (2015). Concept drift detection for streaming data.
  34. Dariusz Brzezinski,Jerzy Stefano Wski (2014). Prequential AUC for classifier evaluation and drift detection in evolving data streams.
  35. Dariusz Brzezinski,Jerzy Stefano Wski Prequential AUC: Properties of the Area Under the ROC Curve for Data Streams with Concept Drift.
  36. Bilal Mirza,Zhiping Lin,Nan Liu (2015). Ensemble of subset online sequential extreme learning machine for class imbalance and concept drift.
  37. Adel Ghazikhani,Reza Monsefi,Hadi Sadoghi Yazdi (2013). Recursive least square perceptron model for non-stationary and imbalanced data stream classification.
  38. Ghazi Khani,Reza Adel,Hadi Sadoghi Monsefi,Yazdi (2014). Online neural network model for nonstationary and imbalanced data stream classification.
  39. Wang,Leandro Shuo,Xin Minku,Yao (2013). A learning framework for online class imbalance learning.
  40. W Street,Yongseog Kim (2001). A streaming ensemble algorithm (SEA) for large-scale classification.
  41. Alexey Tsymbal,Mykola Pechenizkiy,Pádraig Cunningham,Seppo Puuronen (2008). Dynamic integration of classifiers for handling concept drift.
  42. Dariusz Brzezinski,Jerzy Stefanowski (2014). Reacting to Different Types of Concept Drift: The Accuracy Updated Ensemble Algorithm.
  43. Robi Polikar,L Upda,S Upda,V Honavar (2001). Learn++: an incremental learning algorithm for supervised neural networks.
  44. George Forman (2006). Tackling concept drift by temporal inductive transfer.
  45. Gavin Brown,Jeremy Wyatt,Rachel Harris,Xin Yao (2005). Diversity creation methods: a survey and categorisation.
  46. E Tang,P Suganthan,X Yao (2006). An analysis of diversity measures.
  47. Gordon Ross (2012). Exponentially weighted moving average charts for detecting concept drift.
  48. Parinaz Sobhani,Hamid Beigy (2011). New drift detection method for data streams.
  49. Stephen Bach,Marcus Malo (2008). Paired learners for concept drift.
  50. Maayan Harel (2014). Concept Drift Detection Through Re sampling.
  51. Ron Kohavi (1995). Table 2: Model selection cross-validation accuracy metrics..
  52. Eduardo Spinosa,André De Leon F. De Carvalho,João Gama (2007). OLINDDA.
  53. Elaine Faria,João Gama,André Cplf Carvalho (2013). Novelty detection algorithm for data streams multi-class problems.
  54. Joung Ryu,Mehmed Kantardzic,Myung-Won Kim,A Ra Khil (2012). An Efficient Method of Building an Ensemble of Classifiers in Streaming Data.
  55. Mohammad Masud (2011). Classification and novel class detection in concept-drifting data streams under time constraints.
  56. Mortezazi Hayat,Mahmoud Hashemi (2010). A dct based approach for detecting novelty and concept drift in data streams.
  57. Tegjyot Sethi,Mehmed Kantardzic,Hanquing Hu (2016). A grid density based framework for classifying streaming data in the presence of concept drift.
  58. Jeonghoon Lee,Frederic Magoules (2012). Detection of Concept Drift for Learning from Stream Data.
  59. Gregory Ditzler,Robipolikar (2011). Hellinger distance based drift detection for nonstationary environments.
  60. Ludmila Kuncheva,William Faithfull (2014). PCA feature extraction for change detection in multidimensional unlabeled data.
  61. Abdulhakim Qahtan (2015). A pca-based change detection framework for multidimensional data streams: Change detection in multidimensional data streams.
  62. Anton Dries,Ulrich Rückert (2009). Adaptive concept drift detection.
  63. Patrick Lindstrom,Brian Mac Namee,Sarah Delany (2013). Drift detection using uncertainty distribution divergence.
  64. Mark Dredze,Koby Crammer (2010). Online methods for multi-domain learning and adaptation.
  65. Indre Žliobaite (2010). Change with Delayed Labeling: When is it Detectable?.
  66. João Gama,Indrė Žliobaitė,Albert Bifet,Mykola Pechenizkiy,Abdelhamid Bouchachia (2014). A survey on concept drift adaptation.
  67. Indre Zliobaite,Albert Bifet,Mohamed Gaber,Bogdan Gabrys,Joao Gama,Leandro Minku,Katarzyna Musial (2012). Next challenges for adaptive learning systems.

Funding

No external funding was declared for this work.

Conflict of Interest

The authors declare no conflict of interest.

Ethical Approval

No ethics committee approval was required for this article type.

Data Availability

Not applicable for this article.

B. Ramakrishna. 2017. \u201cConcept Drift Detection in Data Stream Mining: The Review of Contemporary Literature\u201d. Global Journal of Computer Science and Technology - C: Software & Data Engineering GJCST-C Volume 17 (GJCST Volume 17 Issue C2): .

Download Citation

Journal Specifications

Crossref Journal DOI 10.17406/gjcst

Print ISSN 0975-4350

e-ISSN 0975-4172

Keywords
Classification
GJCST-C Classification: C.1.1, H.2.8
Version of record

v1.2

Issue date

August 5, 2017

Language

English

Experiance in AR

The methods for personal identification and authentication are no exception.

Read in 3D

The methods for personal identification and authentication are no exception.

Article Matrices
Total Views: 6550
Total Downloads: 1804
2026 Trends
Research Identity (RIN)
Related Research

Published Article

Mining process such as classification, clustering of progressive or dynamic data is a critical objective of the information retrieval and knowledge discovery; in particular, it is more sensitive in data stream mining models due to the possibility of significant change in the type and dimensionality of the data over a period. The influence of these changes over the mining process termed as concept drift. The concept drift that depict often in streaming data causes unbalanced performance of the mining models adapted. Hence, it is obvious to boost the mining models to predict and analyse the concept drift to achieve the performance at par best. The contemporary literature evinced significant contributions to handle the concept drift, which fall in to supervised, unsupervised learning, and statistical assessment approaches. This manuscript contributes the detailed review of the contemporary concept-drift detection models depicted in recent literature. The contribution of the manuscript includes the nomenclature of the concept drift models and their impact of imbalanced data tuples.

Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]
×

This Page is Under Development

We are currently updating this article page for a better experience.

Request Access

Please fill out the form below to request access to this research paper. Your request will be reviewed by the editorial or author team.
X

Quote and Order Details

Contact Person

Invoice Address

Notes or Comments

This is the heading

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

High-quality academic research articles on global topics and journals.

Concept Drift Detection in Data Stream Mining: The Review of Contemporary Literature

B. Ramakrishna
B. Ramakrishna JNTU
Dr. S Krishna Mohan Rao
Dr. S Krishna Mohan Rao

Research Journals