A Novel Approach for Semantic Extractive Text Summarization

Waseemullah and Zainab Fatima and Shehnila Zardari and Muhammad Fahim and Maria Andleeb Siddiqui and Ag. Asri Ag. Ibrahim and Kashif Nisar and Laviza Falak Naz (2022) A Novel Approach for Semantic Extractive Text Summarization. Applied Sciences, 12. pp. 1-14. ISSN 2076-3417

[img] Text
A Novel Approach for Semantic Extractive Text Summarization.pdf

Download (36kB)
[img] Text
A Novel Approach for Semantic Extractive Text Summarization1.pdf
Restricted to Registered users only

Download (3MB)

Abstract

Text summarization is a technique for shortening down or exacting a long text or document. It becomes critical when someone needs a quick and accurate summary of very long content. Manual text summarization can be expensive and time-consuming. While summarizing, some important content, such as information, concepts, and features of the document, can be lost; therefore, the retention ratio, which contains informative sentences, is lost, and if more information is added, then lengthy texts can be produced, increasing the compression ratio. Therefore, there is a tradeoff between two ratios (compression and retention). The model preserves or collects all the informative sentences by taking only the long sentences and removing the short sentences with less of a compression ratio. It tries to balance the retention ratio by avoiding text redundancies and also filters irrelevant information from the text by removing outliers. It generates sentences in chronological order as the sentences are mentioned in the original document. It also uses a heuristic approach for selecting the best cluster or group, which contains more meaningful sentences that are present in the topmost sentences of the summary. Our proposed model extractive summarizer overcomes these deficiencies and tries to balance between compression and retention ratios.

Item Type: Article
Keyword: Text mining , Text summarization , Text extraction , Semantic text extraction
Subjects: Q Science > QA Mathematics > QA1-939 Mathematics > QA71-90 Instruments and machines > QA75.5-76.95 Electronic computers. Computer science
Department: FACULTY > Faculty of Computing and Informatics
Depositing User: SITI AZIZAH BINTI IDRIS -
Date Deposited: 16 Jul 2022 10:59
Last Modified: 16 Jul 2022 10:59
URI: https://eprints.ums.edu.my/id/eprint/33238

Actions (login required)

View Item View Item