Kontaktujte nás | Jazyk: čeština English
Název: | A novel approach for mining closed clickstream patterns | ||||||||||
Autor: | Huynh, Bao; Nguyen, Loan T. T.; Huynh, Minh Huy; Kozierkiewicz, Adrianna; Yun, Unil; Komínková Oplatková, Zuzana; Vo, Bay | ||||||||||
Typ dokumentu: | Recenzovaný odborný článek (English) | ||||||||||
Zdrojový dok.: | Cybernetics and Systems. 2021 | ||||||||||
ISSN: | 0196-9722 (Sherpa/RoMEO, JCR) | ||||||||||
Journal Impact
This chart shows the development of journal-level impact metrics in time
|
|||||||||||
DOI: | https://doi.org/10.1080/01969722.2020.1871225 | ||||||||||
Abstrakt: | Closed sequential pattern (CSP) mining is an optimization technique in sequential pattern mining because they produce more compact representations. Additionally, the runtime and memory usage required for mining CSPs is much lower than the sequential pattern mining. This task has fascinated numerous researchers. In this study, we propose a novel approach for closed clickstream pattern mining using C-List (CCPC) data structure. Closed clickstream pattern mining is a more specific task of CSP mining that has been lacking in research investment; nevertheless, it has promising applications in various fields. CCPC consists of two key steps: It initially builds the SPPC-tree and the C-List for each frequent 1-pattern and then determines all frequently closed clickstream 1-patterns; next, it constructs the C-List for each frequent k-pattern and mines the remaining frequently closed k-patterns. The proposed method is optimized by modifying the SPPC-tree structure and a new property is added into each node element in both the SPPC-tree and C-Lists to quickly prune nonclosed clickstream. Experimental results conducted on several datasets show that the proposed method is better than the previous techniques and improves the runtime and memory usage in most cases, especially when using low minimum support thresholds on the huge databases. © 2021 Taylor & Francis Group, LLC. | ||||||||||
Plný text: | https://www.tandfonline.com/doi/full/10.1080/01969722.2020.1871225 | ||||||||||
Zobrazit celý záznam |