Sequential pattern mining using personalized minimum support threshold with minimum items

Suraya Alias, and Mohd Norhisham Razali, and Tan, Soo Fun and Mohd Shamrie Sainin, (2011) Sequential pattern mining using personalized minimum support threshold with minimum items. In: 2011 International Conference on Research and Innovation in Information Systems, ICRIIS'11, 23-24 November 2011, Kuala Lumpur, Malaysia.

Full text not available from this repository.

Official URL: http://dx.doi.org/ 10.1109/ICRIIS.2011.6125688

Abstract

One of the challenges of Sequential Pattern Mining is finding frequent sequential patterns in a huge click stream data (web logs) since the data has the issue of a very low support distribution. By applying a Frequent Pattern Discovery technique, a sequence is considered as frequent if it occurs more than the minimum support (min sup) threshold value. The conventional method of assuming one min sup value is valid for all levels of k-sequence, may have an impact on the overall results or pattern generation. In this paper, a personalized minimum support (P-minsup) threshold with user specified minimum items or min-i is introduced. The P-minsup is generated for each k-sequence by analyzing the overall support pattern distribution of the click stream data; while the min-i value gives the user the flexibility to gain control on the number of patterns to be generated on the next k-sequence by using the top min-i items. This approach is then applied in the SPADE Algorithm using vector array as an extension from the previous method of using relational database and pre-defined threshold. The result from this experiment demonstrates that P-minsup with the complement of min-i value approach is applicable in assisting the process of determining the suitable threshold value to be used in detecting users' frequent k-sequential topics in navigating the World Wide Web (WWW).

Item Type:Conference Paper (UNSPECIFIED)
Uncontrolled Keywords:Sequential Pattern, Web Mining, Clickstreams, Conventional methods, Frequent pattern discovery, Frequent sequential patterns, Minimum support, Minimum support thresholds, Pattern Generation, Relational Database, Sequential patterns, Sequential-pattern mining, Vector arrays, Web Mining, Weblogs
Subjects:Q Science > QA Mathematics
?? T58.5-58.64 ??
Divisions:SCHOOL > School of Engineering and Information Technology
ID Code:4603
Deposited By:IR Admin
Deposited On:24 Jul 2012 10:04
Last Modified:30 Dec 2014 09:39

Repository Staff Only: item control page


Browse Repository
Collection
   Articles
   Book
   Speeches
   Thesis
   UMS News
Search
Quick Search

   Latest Repository

Link to other Malaysia University Institutional Repository

Malaysia University Institutional Repository