Leveraging the Christoffel function for outlier detection in data streams - Equipe diagnostic, supervision et COnduite
Article Dans Une Revue International Journal of Data Science and Analytics Année : 2024

Leveraging the Christoffel function for outlier detection in data streams

Résumé

Outlier detection holds significant importance in the realm of data mining, particularly with the growing pervasiveness of data acquisition methods. The ability to identify outliers in data streams is essential for maintaining data quality and detecting faults. However, dealing with data streams presents challenges due to the non-stationary nature of distributions and the ever-increasing data volume. While numerous methods have been proposed to tackle this challenge, a common drawback is the lack of straightforward parameterization in many of them. This article introduces two novel methods: DyCF and DyCG. DyCF leverages the Christoffel function from the theory of approximation and orthogonal polynomials. Conversely, DyCG capitalizes on the growth properties of the Christoffel function, eliminating the need for tuning parameters. Both approaches are firmly rooted in a well- defined algebraic framework, meeting crucial demands for data stream processing, with a specific focus on addressing low-dimensional aspects and maintaining data history without memory cost. A comprehensive comparison between DyCF, DyCG, and state-of-the-art methods is presented, using both synthetic and real industrial data streams. The results show that DyCF outperforms fine-tuning methods, offering superior performance in terms of execution time and memory usage. DyCG performs less well, but has the considerable advantage of requiring no tuning at all.
Fichier principal
Vignette du fichier
Int_J_Data_Science_and_Analytics_premium(version_auteur).pdf (9.48 Mo) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04630422 , version 1 (01-07-2024)

Identifiants

Citer

Kévin Ducharlet, Louise Travé-Massuyès, Jean-Bernard Lasserre, Marie-Véronique Le Lann, Youssef Miloudi. Leveraging the Christoffel function for outlier detection in data streams. International Journal of Data Science and Analytics, inPress, pp.doi.org/10.1007/s41060-024-00581-2. ⟨10.1007/s41060-024-00581-2⟩. ⟨hal-04630422⟩
291 Consultations
15 Téléchargements

Altmetric

Partager

More