Open Access   Article Go Back

An Approach to Design Personalized Focused Crawler

H.P. Trivedi1 , G.N. Daxini2 , J.A. Oswal3 , V.D. Gor4 , S. Mali5

Section:Research Paper, Product Type: Journal Paper
Volume-2 , Issue-3 , Page no. 144-147, Mar-2014

Online published on Mar 30, 2014

Copyright © H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali, “An Approach to Design Personalized Focused Crawler,” International Journal of Computer Sciences and Engineering, Vol.2, Issue.3, pp.144-147, 2014.

MLA Style Citation: H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali "An Approach to Design Personalized Focused Crawler." International Journal of Computer Sciences and Engineering 2.3 (2014): 144-147.

APA Style Citation: H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali, (2014). An Approach to Design Personalized Focused Crawler. International Journal of Computer Sciences and Engineering, 2(3), 144-147.

BibTex Style Citation:
@article{Trivedi_2014,
author = {H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali},
title = {An Approach to Design Personalized Focused Crawler},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {3 2014},
volume = {2},
Issue = {3},
month = {3},
year = {2014},
issn = {2347-2693},
pages = {144-147},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=86},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=86
TI - An Approach to Design Personalized Focused Crawler
T2 - International Journal of Computer Sciences and Engineering
AU - H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali
PY - 2014
DA - 2014/03/30
PB - IJCSE, Indore, INDIA
SP - 144-147
IS - 3
VL - 2
SN - 2347-2693
ER -

VIEWS PDF XML
3643 3395 downloads 3599 downloads
  
  
           

Abstract

The amount of data and its dynamicity makes it impossible to crawl the World Wide Web (WWW) completely. It�s a challenge in front of crawlers to crawl only the relevant pages from this information explosion. Thus a focused crawler solves this issue of relevancy to a certain level, by focusing on web pages for some given topic or a set of topics. Also a focused crawler with a page change detection policy can help in narrowing down the search to only newer pages, and thus eliminates risk of redundancy and missing updated data. This paper proposes a policy for design of a focused crawler with web page change detection policy.

Key-Words / Index Term

Web Crawler, Focused Crawler, World Wide Web(WWW),Content Analysis, Link Scoring, Change Detection

References

[1] Mahdi Bazarganigilani, Ali Syed and Sandid Burki, �Focused web crawling using decay concept and genetic programming�, published in International Journal of Data Mining & Knowledge Management Process (IJDKP), Vol.1, No.1, Page no(1-12), January 2011.
[2] 3Swati Mali and B B Meshram, �Focused Web Crawler with Page Change Detection Policy�, published in International Journal of Computer Applications (IJCA) proceedings on International Conference and workshop on Emerging Trends in Technology (ICWET), No 9 Article 9, Page No 51-56, 2011.
[3] 4DivakarYadav, AK Sharma, Sonia Sanchez-Cuadrado, Jorge Morato, �an approach to design incremental parallel webcrawler�, published in Journal of Theoretical and Applied Information Technology, Volume 43 No 1, Page no:(8-29), 15 September 2012.
[4] 6Anshika Pal, Deepak Tomar and S.C. Shrivastava, �Effective Focused Crawling Based on Content and Link Structure Analysis�, published in (IJCSIS) International Journal of Computer Science and Information Security, Vol. 2, no. 1, Page No: (1-5), June 2009.
[5] 7Ioannis Avraam and Ioanni Anagnostopoulos, �A Comparison over Focused Web Crawling Strategies�, published in Panhellenic Conference on Informatics(IEEE), Print ISBN 978-1-61284-962-1,Page No: (245-249), September 2011.
[6] 9Weicheng Ma, Xiuxia Chen and Wenqian Shang, �Advanced deep web crawler based on Dom�, published in IEEE Fifth International Joint Conference on Computational Sciences and Optimization, print ISBN 978-1-4673-1365-0, Page No: (605-609), June 2012
[7] Mejdl S. Safran, Abdullah Althagafi and Dunren Che, �Improving Relevance Prediction for Focused Web Crawlers�, published in IEEE/ACIS 11th International Conference on Computer and Information Science, print ISBN 978-1-4673-1536-4, page no: (161-166), May 2012.
[8] Jatinder Manhas, �A Study of Factors Affecting Websites Page Loading Speed for Efficient Web Performance�, published in International Journal of Computer Sciences and Engineering (IJCSE), Vol-1, Issue-3, Nov 2013.