A genetic-based feature construction method for data summarisation

Rayner Alfred, (2008) A genetic-based feature construction method for data summarisation. In: 4th International Conference on Advanced Data Mining and Applications (ADMA 2008), 8-10 October 2008, Chengdu, China.

Full text not available from this repository.

Official URL: http://dx.doi.org/10.1007/978-3-540-88192-6-6


The importance of input representation has been recognised already in machine learning. This paper discusses the application of genetic-based feature construction methods to generate input data for the data summarisation method called Dynamic Aggregation of Relational Attributes (DARA). Here, feature construction methods are applied in order to improve the descriptive accuracy of the DARA algorithm. The DARA algorithm is designed to summarise data stored in the non-target tables by clustering them into groups, where multiple records stored in non-target tables correspond to a single record stored in a target table. This paper addresses the question whether or not the descriptive accuracy of the DARA algorithm benefits from the feature construction process. This involves solving the problem of constructing a relevant set of features for the DARA algorithm by using a genetic-based algorithm. This work also evaluates several scoring measures used as fitness functions to find the best set of constructed features. © 2008 Springer-Verlag Berlin Heidelberg.

Item Type:Conference Paper (UNSPECIFIED)
Uncontrolled Keywords:Clustering, Data Summarisation, Feature Construction, Genetic Algorithm
Subjects:?? QA75-76.95 ??
Divisions:SCHOOL > School of Engineering and Information Technology
ID Code:1626
Deposited By:IR Admin
Deposited On:24 Mar 2011 14:46
Last Modified:30 Dec 2014 14:47

Repository Staff Only: item control page

Browse Repository
   UMS News
Quick Search

   Latest Repository

Link to other Malaysia University Institutional Repository

Malaysia University Institutional Repository