Kontaktujte nás | Jazyk: čeština English
dc.title | Efficient algorithms for mining clickstream patterns using pseudo-IDLists | en |
dc.contributor.author | Huynh, Huy M. | |
dc.contributor.author | Nguyen, Loan T.T. | |
dc.contributor.author | Vo, Bay | |
dc.contributor.author | Yun, Unil | |
dc.contributor.author | Komínková Oplatková, Zuzana | |
dc.contributor.author | Hong, Tzung-Pei | |
dc.relation.ispartof | Future Generation Computer Systems | |
dc.identifier.issn | 0167-739X Scopus Sources, Sherpa/RoMEO, JCR | |
dc.date.issued | 2020 | |
utb.relation.volume | 107 | |
dc.citation.spage | 18 | |
dc.citation.epage | 30 | |
dc.type | article | |
dc.language.iso | en | |
dc.publisher | Elsevier B.V. | |
dc.identifier.doi | 10.1016/j.future.2020.01.034 | |
dc.relation.uri | https://www.sciencedirect.com/science/article/pii/S0167739X19314475 | |
dc.subject | sequential pattern mining | en |
dc.subject | clickstream pattern mining | en |
dc.subject | candidate pruning | en |
dc.subject | vertical format | en |
dc.description.abstract | Sequential pattern mining is an important task in data mining. Its subproblem, clickstream pattern mining, is starting to attract more research due to the growth of the Internet and the need to analyze online customer behaviors. To date, only few works are dedicately proposed for the problem of mining clickstream patterns. Although one approach is to use the general algorithms for sequential pattern mining, those algorithms’ performance may suffer and the resources needed are more than would be necessary with a dedicated method for mining clickstreams. In this paper, we present pseudo-IDList, a novel data structure that is more suitable for clickstream pattern mining. Based on this structure, a vertical format algorithm named CUP (Clickstream pattern mining Using Pseudo-IDList) is proposed. Furthermore, we propose a pruning heuristic named DUB (Dynamic intersection Upper Bound) to improve our proposed algorithm. Four real-life clickstream databases are used for the experiments and the results show that our proposed methods are effective and efficient regarding runtimes and memory consumption. © 2020 Elsevier B.V. | en |
utb.faculty | Faculty of Applied Informatics | |
dc.identifier.uri | http://hdl.handle.net/10563/1009572 | |
utb.identifier.obdid | 43881213 | |
utb.identifier.scopus | 2-s2.0-85078857727 | |
utb.identifier.wok | 000527331800002 | |
utb.identifier.coden | FGCSE | |
utb.source | j-scopus | |
dc.date.accessioned | 2020-03-02T12:28:36Z | |
dc.date.available | 2020-03-02T12:28:36Z | |
dc.description.sponsorship | Vietnam National Foundation for Science and Technology Development (NAFOSTED)National Foundation for Science & Technology Development (NAFOSTED) [02/2019/TN] | |
utb.contributor.internalauthor | Komínková Oplatková, Zuzana | |
utb.fulltext.affiliation | Huy M. Huynh a, Loan T.T. Nguyen b,g, Bay Vo c∗, Unil Yun d, Zuzana Komínková Oplatková e, Tzung-Pei Hong f a Institute of Research and Development, Duy Tan University, Da Nang 550000, Viet Nam b School of Computer Science and Engineering, International University, Ho Chi Minh City, Viet Nam c Faculty of Information Technology, Ho Chi Minh City University of Technology (HUTECH), Ho Chi Minh City, Viet Nam d Department of Computer Engineering, Sejong University, Seoul, Republic of Korea e Faculty of Applied Informatics, Tomas Bata University in Zlín, Nám. T.G. Masaryka 5555, Zlín, Czech Republic f Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan g Vietnam National University, Ho Chi Minh City, Viet Nam ∗ Corresponding author. E-mail addresses: huy.hm88@gmail.com (H.M. Huynh), nttloan@hcmiu.edu.vn (L.T.T. Nguyen), vd.bay@hutech.edu.vn (B. Vo), yunei@sejong.ac.kr (U. Yun), oplatkova@utb.cz (Z.K. Oplatková), tphong@nuk.edu.tw (T.-P. Hong). | |
utb.fulltext.dates | Received 2 June 2019 Received in revised form 18 December 2019 Accepted 20 January 2020 Available online 24 January 2020 | |
utb.fulltext.sponsorship | This research is funded by Vietnam National Foundation for Science and Technology Development (NAFOSTED) under grant number: 02/2019/TN | |
utb.wos.affiliation | [Huynh, Huy M.] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam; [Nguyen, Loan T. T.] Int Univ, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam; [Vo, Bay] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam; [Yun, Unil] Sejong Univ, Dept Comp Engn, Seoul, South Korea; [Oplatkova, Zuzana Kominkova] Tomas Bata Univ Zlin, Fac Appl Informat, Nam TG Masaryka 5555, Zlin, Czech Republic; [Hong, Tzung-Pei] Natl Univ Kaohsiung, Dept Comp Sci & Informat Engn, Kaohsiung, Taiwan; [Nguyen, Loan T. T.] Vietnam Natl Univ, Ho Chi Minh City, Vietnam | |
utb.scopus.affiliation | Institute of Research and Development, Duy Tan University, Da Nang, 550000, Viet Nam; School of Computer Science and Engineering, International University, Ho Chi Minh City, Viet Nam; Faculty of Information Technology, Ho Chi Minh City University of Technology (HUTECH), Ho Chi Minh City, Viet Nam; Department of Computer Engineering, Sejong University, Seoul, South Korea; Faculty of Applied Informatics, Tomas Bata University in Zlín, Nám. T.G. Masaryka 5555, Zlín, Czech Republic; Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan; Vietnam National University, Ho Chi Minh City, Viet Nam | |
utb.fulltext.projects | 02/2019/TN | |
utb.fulltext.faculty | Faculty of Applied Informatics |