Presentation + Paper
22 April 2020 Emergent pattern detection algorithm for big data streams
Author Affiliations +
Abstract
Pattern detection is an active field in big data streams analytics with numerous ongoing challenges. Actually, due to the great velocity and variety of data, new patterns can appear and change over time. Existing state-of-the- art solutions consist in updating the pattern detection model regularly in order to integrate newly appeared and validated patterns. However, in several applications, such as security and defense, patterns can represent anomalies. Therefore, it becomes crucial to detect new patterns (i.e. new anomalies), as early as possible, in order to react at the right moment. Consequently, emergent pattern detection becomes a very challenging task. To tackle this challenge, we propose EPDA (Emergent Pattern Detection Algorithm): a new and validated algorithm for detecting emergent patterns in data streams. The originality of EPDA consists in exploiting frequent pattern mining techniques by proposing new statistical measures in order to estimate the evolution of emergent patterns over time. To perform this detection in a real-time, EPDA runs on the well-known Apache STORM distributed real-time computation system. To better fit our algorithm, we propose a new Apache STORM topology which is composed of one Spouts level and two Bolts levels. Experiments on a real data stream have shown the relevance of the proposed measures and the efficiency of our algorithm in a prediction task and in terms of execution time.
Conference Presentation
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Lina Fahed and Ayman Alfalou "Emergent pattern detection algorithm for big data streams", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000M (22 April 2020); https://doi.org/10.1117/12.2558536
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data mining

Detection and tracking algorithms

Mining

Data processing

Data modeling

Distance measurement

Analytics

RELATED CONTENT


Back to Top