CFP last date
15 April 2024
Reseach Article

N-layer Approach to Web Information Retrieval

by Jayant Gadge, S.s. Sane, H.b. Kekre
International Journal of Applied Information Systems
Foundation of Computer Science (FCS), NY, USA
Volume 5 - Number 1
Year of Publication: 2013
Authors: Jayant Gadge, S.s. Sane, H.b. Kekre
10.5120/ijais12-450840

Jayant Gadge, S.s. Sane, H.b. Kekre . N-layer Approach to Web Information Retrieval. International Journal of Applied Information Systems. 5, 1 ( January 2013), 45-49. DOI=10.5120/ijais12-450840

@article{ 10.5120/ijais12-450840,
author = { Jayant Gadge, S.s. Sane, H.b. Kekre },
title = { N-layer Approach to Web Information Retrieval },
journal = { International Journal of Applied Information Systems },
issue_date = { January 2013 },
volume = { 5 },
number = { 1 },
month = { January },
year = { 2013 },
issn = { 2249-0868 },
pages = { 45-49 },
numpages = {9},
url = { https://www.ijais.org/archives/volume5/number1/410-0840/ },
doi = { 10.5120/ijais12-450840 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2023-07-05T16:00:53.298362+05:30
%A Jayant Gadge
%A S.s. Sane
%A H.b. Kekre
%T N-layer Approach to Web Information Retrieval
%J International Journal of Applied Information Systems
%@ 2249-0868
%V 5
%N 1
%P 45-49
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In web information retrieval, the terms or keywords are used for indexing purpose of document. These terms or keywords appear in special location such as title, subtitle, header, hyperlinks and so on. Vector space model ignores the importance of these terms with respect to their position while calculating the weight of the indexing terms. The effectiveness of the vector space model crucially depends on the weights applied to the terms of the document vectors. These weights are found using a term weight evaluation scheme based on the frequency of the terms in the document and the collection. Terms that occur more often in a document are treated as more important whereas terms that occur less frequently throughout a collection are given a higher weight. In N-level Vector space approach, the importance of these terms with respect to their position is considered. The web document is logically divided in N-layer considering the structure of web document and weights are assigned to terms based on their presence in different layer within the document. Different weight evaluation schemes proposed for vector space models are applied to N-level vector space model and are compared. N-layer vector space model gives better result as compare to vector space model. Cosine similarity and all six weight evaluation methods that are formed using different local weights and global weights show that average precision and average recall in case of N-layer vector space model is always better than vector space model.

References
  1. Srinath Sriniwas, P. C. Bhatt ( 2002 ) "Introduction to Web Information Retrieval: A User Perspective" Resonance June 2002 Resonance, June 2002 Page 27-38
  2. P. Ravikumar, Ashutosh kumar singh (2010) "Web Structure Mining: Exploring Hyperlinks and Algorithms for information Retrieval" American Journal of Applied Science 7(6) 2010 Page 840-845
  3. Anwar A. Alhenshiri " Web Information Retrieval and Search Engine Techniques" Al-Satil Journal Page 55-81
  4. Mehran Sahami, Vibhu Mittal, Shumeet Baluja, Henry Rowley. "The Happy Searcher: Challenges in Web Information Retrieval" Google Inc. 1600 Amphitheatre Parkway, Mountain View, CA 94043
  5. Ricardo Baeza-Yate "Information retrieval in the Web: beyond current search engines" International Journal of Approximate Reasoning 34 (2003) 97–104
  6. Cheng Xiang Zhai, "Statistical Language Models for Information Retrieval A Critical Review" Foundations and Trends in Information Retrieval Vol. 2, No. 3 (2008) 137–213
  7. Joon Ho Lee, "Properties of Extended Boolean models in information Retrieval" Korea research and development center, koera institute of science and technology
  8. http://www. miislita. com/information-retrieval-tutorial/latent-semantic-indexing-fast-track-tutorial. pdf visited on 10/10/2011
  9. Kirk Baker, "Singular Value Decomposition Tutorial"
  10. Norbert Fuhr, "probabilistic model in information retrieval".
  11. Dr. E. Garcia, "A Tutorial on the Robertson-Sparck Jones Probabilistic Model for Information Retrieval"
  12. http://snowball. tartarus. org/algorithms/porter/stemmer. html
  13. G. Salton and C. Buckley. "Term weighting approaches in automatic text retrieval". Information Processing and Managemen 24(5):513{523, 1988.
  14. Christopher D. Manning, Prabhakar Raghavan , Hinrich Schütze, "Introduction to Information Retrieval" Cambridge University Press. 2008.
  15. Ronan Cummins • Colm O'Riordan "Evolving local and global weighting schemes in information retrieval" Inf. Retrieval (2006) 9:311–330
  16. http://wing. comp. nus. edu. sg/downloads/mwc/ visited on 06/12/2012
Index Terms

Computer Science
Information Sciences

Keywords

N-layer vector space model global weight local weight weight evaluation scheme