Sequential pattern mining using personalized minimum support threshold with minimum items

Suraya Alias and Mohd Norhisham Razali and Tan, Soo Fun and Mohd Shamrie Sainin (2011) Sequential pattern mining using personalized minimum support threshold with minimum items. In: 2011 International Conference on Research and Innovation in Information Systems, ICRIIS'11, 23-24 November 2011, Kuala Lumpur, Malaysia.

Full text not available from this repository.

Abstract

One of the challenges of Sequential Pattern Mining is finding frequent sequential patterns in a huge click stream data (web logs) since the data has the issue of a very low support distribution. By applying a Frequent Pattern Discovery technique, a sequence is considered as frequent if it occurs more than the minimum support (min sup) threshold value. The conventional method of assuming one min sup value is valid for all levels of k-sequence, may have an impact on the overall results or pattern generation. In this paper, a personalized minimum support (P-minsup) threshold with user specified minimum items or min-i is introduced. The P-minsup is generated for each k-sequence by analyzing the overall support pattern distribution of the click stream data; while the min-i value gives the user the flexibility to gain control on the number of patterns to be generated on the next k-sequence by using the top min-i items. This approach is then applied in the SPADE Algorithm using vector array as an extension from the previous method of using relational database and pre-defined threshold. The result from this experiment demonstrates that P-minsup with the complement of min-i value approach is applicable in assisting the process of determining the suitable threshold value to be used in detecting users' frequent k-sequential topics in navigating the World Wide Web (WWW).

Item Type: Conference or Workshop Item (UNSPECIFIED)
Keyword: Sequential Pattern, Web Mining, Clickstreams, Conventional methods, Frequent pattern discovery, Frequent sequential patterns, Minimum support, Minimum support thresholds, Pattern Generation, Relational Database, Sequential patterns, Sequential-pattern mining, Vector arrays, Web Mining, Weblogs
Subjects: Q Science > QA Mathematics
T Technology > T Technology (General) > T1-995 Technology (General) > T55.4-60.8 Industrial engineering. Management engineering > T58.5-58.64 Information technology
Department: SCHOOL > School of Engineering and Information Technology
Depositing User: ADMIN ADMIN
Date Deposited: 24 Jul 2012 10:04
Last Modified: 30 Dec 2014 09:39
URI: https://eprints.ums.edu.my/id/eprint/4603

Actions (login required)

View Item View Item