BioDARA: data summarization approach to extracting bio-medical structuring information

Chung Seng Kheau and Rayner Alfred and Joe Henry Obit (2011) BioDARA: data summarization approach to extracting bio-medical structuring information. Journal of Computer Science, 7. pp. 1914-1920. ISSN 1549-3636 (P-ISSN) ,1552-6607 (E-ISSN)

[img] Text
BioDARA_data summarization approach to extracting bio-medical structuring information ABSTRACT.pdf

Download (62kB)
[img] Text
BioDARA_ data summarization approach to extracting bio-medical structuring information FULL TEXT.pdf
Restricted to Registered users only

Download (97kB) | Request a copy

Abstract

Problem statement: Due to the ever growing amount of biomedical datasets stored in multiple tables, Information Extraction (IE) from these datasets is increasingly recognized as one of the crucial technologies in bioinformatics. However, for IE to be practically applicable, adaptability of a system is crucial, considering extremely diverse demands in biomedical IE application. One should be able to extract a set of hidden patterns from these biomedical datasets at low cost. Approach: In this study, a new method is proposed, called Bio-medical Data Aggregation for Relational Attributes (BioDARA), for automatic structuring information extraction for biomedical datasets. BioDARA summarizes biomedical data stored in multiple tables in order to facilitate data modeling efforts in a multi-relational setting. BioDARA has the advantages or capabilities to transform biomedical data stored in multiple tables or databases into a Vector Space model, summarize biomedical data using the Information Retrieval theory and finally extract frequent patterns that describe the characteristics of these biomedical datasets. Results: the results show that data summarization performed by DARA, can be beneficial in summarizing biomedical datasets in a complex multi-relational environment, in which biomedical datasets are stored in a multi-level of one-to-many relationships and also in the case of datasets stored in more than one one-to-many relationships with non-target tables. Conclusion: This study concludes that data summarization performed by BioDARA, can be beneficial in summarizing biomedical datasets in a complex multi-relational environment, in which biomedical datasets are stored in a multi-level of one-to-many relationships.

Item Type: Article
Keyword: Information extraction , Data summarization , Relational data mining , Relational database , Biomedical datasets , Summarization performed , Datasets stored , Multiple tables , Relational attributes
Subjects: R Medicine > R Medicine (General) > R5-920 Medicine (General) > R856-857 Biomedical engineering. Electronics. Instrumentation
Department: FACULTY > Faculty of Computing and Informatics
Depositing User: SAFRUDIN BIN DARUN -
Date Deposited: 20 Sep 2021 09:51
Last Modified: 20 Sep 2021 09:51
URI: https://eprints.ums.edu.my/id/eprint/29060

Actions (login required)

View Item View Item