Real-Time Outlier Detection in Fast-Moving Data Streams

Main Article Content

  Eka Puji Agustini
  Mohd Zaki Zakaria

Abstract

Anomaly detection is a critical task in various fields such as finance, healthcare, network monitoring, and sensor data analysis, where identifying unusual patterns or outliers in data streams is essential for timely decision-making. Two commonly used techniques for anomaly detection are the Moving Average (MA) and Exponential Moving Average (EMA) methods. Despite their widespread use, selecting the appropriate method depends on the nature of the data and the requirements of the system. This paper presents a comparative analysis of MA and EMA for anomaly detection, focusing on critical factors such as speed of detection, stability, precision and recall, false positive rate, and computational efficiency. This research addresses the problem of determining which method, MA or EMA, is better suited for specific types of data, particularly in streaming environments with varying trends and anomalies. The results of our comparison indicate that EMA performs better in dynamic environments where rapid identification of anomalies is critical, such as financial markets or network traffic analysis. It quickly detects sudden deviations but may flag minor fluctuations as false positives due to its sensitivity. MA, on the other hand, is more stable and computationally efficient, with a lower false positive rate, making it more suitable for applications where long-term trend monitoring is required, and stability is prioritized over speed. This research highlights the strengths and weaknesses of both methods, demonstrating that the choice between MA and EMA should be based on the specific needs of the anomaly detection system. For real-time, high-speed environments, EMA offers a more responsive solution, while MA provides better stability and efficiency in long-term monitoring. A hybrid approach combining both methods could offer a more robust solution, adapting to different types of data and detection requirements.

Article Details

How to Cite
Agustini, E. P., & Zakaria, M. Z. (2024). Real-Time Outlier Detection in Fast-Moving Data Streams. International Journal of Advances in Artificial Intelligence and Machine Learning, 1(1), 19–27. https://doi.org/10.58723/ijaaiml.v1i1.287
Section
Articles

References

BoltzmannBrain. (2016). Numenta Anomaly Benchmark (NAB). https://www.kaggle.com/

datasets/boltzmannbrain/nab

Braei, M., & Wagner, S. (2020). Anomaly Detection in Univariate Time-series: A Survey on the State-of-the-Art. https://doi.org/10.48550/arXiv.2004.00433

Cai, Z., Ravichandran, A., Maji, S., Fowlkes, C., Tu, Z., & Soatto, S. (2021). Exponential Moving Average Normalization for Self-supervised and Semi-supervised Learning.

Ejder, U., & Özel, S. A. (2024). A novel distance-based moving average model for improvement in the predictive accuracy of financial time series. Borsa Istanbul Review, 24(2), 376–397. https://doi.org/https://doi.org/10.1016/j.bir.2024.01.011

Hansun, S. (2013). A new approach of moving average method in time series analysis. 2013 Conference on New Media Studies (CoNMedia), 1–4. https://doi.org/10.1109/CoNMedia.2013.6708545

Ivanovski, Z., & Milenkovski Zoran Narasanov, A. (2018). Time Series Forecasting Using a Moving Average Model for Extrapolation of Number of Tourist. In UTMS Journal of Economics (Vol. 9, Issue 2). https://hdl.handle.net/10419/281836

Karasu, S., Altan, A., Bekiros, S., & Ahmad, W. (2020). A new forecasting model with wrapper-based feature selection approach using multi-objective optimization technique for chaotic crude oil time series. Energy, 212, 118750. https://doi.org/https://doi.org/ 10.1016/j.energy.2020.118750

Kim, T., Solanki, V. S., Baraiya, H. J., Mitra, A., Shah, H., & Roy, S. (2020). A Smart, Sensible Agriculture System Using the Exponential Moving Average Model. Symmetry, 12(3). https://doi.org/10.3390/sym12030457

Nakano, M., Takahashi, A., & Takahashi, S. (2017). Generalized exponential moving average (EMA) model with particle filtering and anomaly detection. Expert Systems with Applications, 73, 187–200. https://doi.org/10.1016/j.eswa.2016.12.034

Siffer, A., Fouque, P.-A., Termier, A., & Largouët, C. (2017). Anomaly Detection in Streams with Extreme Value Theory. https://doi.org/10.1145/3097983.3098144

van Rossum, H. (2019). Moving average quality control: principles, practical application and future perspectives. Clinical Chemistry and Laboratory Medicine (CCLM), 57(6), 773-782. https://doi.org/10.1515/cclm-2018-0795