CFP last date
15 May 2024
Reseach Article

Semantic based Text Summarization using Universal Networking Language

by S. Mangairkarasi, S. Gunasundari
International Journal of Applied Information Systems
Foundation of Computer Science (FCS), NY, USA
Volume 3 - Number 8
Year of Publication: 2012
Authors: S. Mangairkarasi, S. Gunasundari
10.5120/ijais12-450580

S. Mangairkarasi, S. Gunasundari . Semantic based Text Summarization using Universal Networking Language. International Journal of Applied Information Systems. 3, 8 ( August 2012), 18-23. DOI=10.5120/ijais12-450580

@article{ 10.5120/ijais12-450580,
author = { S. Mangairkarasi, S. Gunasundari },
title = { Semantic based Text Summarization using Universal Networking Language },
journal = { International Journal of Applied Information Systems },
issue_date = { August 2012 },
volume = { 3 },
number = { 8 },
month = { August },
year = { 2012 },
issn = { 2249-0868 },
pages = { 18-23 },
numpages = {9},
url = { https://www.ijais.org/archives/volume3/number8/251-0580/ },
doi = { 10.5120/ijais12-450580 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2023-07-05T10:46:08.081035+05:30
%A S. Mangairkarasi
%A S. Gunasundari
%T Semantic based Text Summarization using Universal Networking Language
%J International Journal of Applied Information Systems
%@ 2249-0868
%V 3
%N 8
%P 18-23
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Text Summarization is extracting the important information from the document by leaving out the irrelevant information, and to reduce the details and collects them in a compressed way. Normally text summarization is done in single or multi documents. The advantage on processing time can be achieved in the text summarization. Converting English sentences into expressions or Interlingua is called Universal Networking Language (UNL). The given source document is preprocessed by eliminating tables and images. The preprocessed document is fed into sentence splitter and then to word separator. The given word is sent to Morphological Analyzer to find the root word. This root word is fed into UNL dictionary for finding the corresponding concepts and attributes. By using the heuristic rules, we identify the relations between concepts. UNL represents knowledge in the form of graphical format, where nodes represent concepts and links represent relations between concepts. It represents the whole document not the sentences in particular. The graph algorithm is used to find the weight age of links connected to the Universal Word. According to the highest weight age the document is summarized.

References
  1. Book-Natural Language processing by Akshar Bharati and Rajeev Sangal 2]http://en. wikipedia. org/wiki/Universal_Networking_Language
  2. http://en. wikipedia. org/wiki/Automatic_summarization
  3. Shanmugasundaram Hariharan , "Extraction based multi document summarization Using single document summary Cluster" , Advance Soft computing Application of vol. 2/vol. 2. 1. 1. March 2010. ISSN 2074-8523, ICSRS publication.
  4. Bracewell,D. B. , Kuriowa,S. , Ren,F. ," Multilingual single document keyword extraction for information retrieval "IEEE proceeding of International conference on Natural Language Processing and Knowledge Engineering, 2005. 517 – 522
  5. Amitabhu Mukerjee, Achla M Raina, Kumar Kapil, Pankaj Goyal, Pushpraj Shukla, Universal Networking Language- A tool for language independent semantics?" International conference on the convergence of Knowledge, 2003.
  6. Martins, C. B. , and Rino, L. H. M. (2002),"Revisiting UNLSumm Improvement through a case study", Workshop on Multilingual Information Access and Natural Language Processing, IBERAMIA'2002. (ISBN 84-607-6057-X)
  7. Jade Goldstein, Vibhu Mittal, Jaime carbonell, Mark Kantrowitz,"Multi-Document Summarization by sentence Extraction",Workshop on Automatic Summarization,ANLP/NAACL'2000.
  8. Gragomir R. Radev, HongYan Jing, Malgorzata Budzikowska, " Centroid-based Summarization of multiple documents: Sentence extraction, Utility-Based evaluation, and User-studies", Citeseerx. ist. psu. edu/viewdoc/download?doi=10. 1. 1. 75. 4383.
  9. Hien Nguyen;Santos,E; Russell,J. "Evaluation of the Impact of User-Cognitive Styles on the Assessment of Text Summarization", IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans,2011. Volume: 41, Issue: 6, 1038-1051
  10. Soe-Tsyr Yuan;Jerry Sun " Ontology-based Structured cosine similarity in document Summarization: with applications to mobile audio-based knowledge management", IEEE Transactions on Systems, Man and Cybernetics, Part B: Cybernetics, 2005. Volume:35, Issue:5, 1028-1040
  11. Xiaohua Hu; Park, E. K. ; Xiaodan Zhang, "Microarray Gene Cluster Identification and annotation Through Cluster Ensemble and EM-based Informative", IEEE Transactions on Information Technology in Biomedicine,2009. Volume: 13, Issue: 5, 832-840
  12. Devasena. C. L. ;Hemalatha,M. "Automatic Text categorization and summarization using rule reduction", International Conference on Advances in Engineering, Science and Management (ICAESM) 2012, 594-598.
  13. Suanmali,L;Salim,N;Binwahlan,M. S. ,"Fuzzy Genetic Semantic Based Text Summarization", IEEE Ninth International Conference on Dependable, Automatic and Secure Computing (DASC),2011. 1184-1191.
Index Terms

Computer Science
Information Sciences

Keywords

Interlingua Document preprocessor UNL dictionary Morphological Analyzer