An Approach to Design Personalized Focused Crawler
H.P. Trivedi1 , G.N. Daxini2 , J.A. Oswal3 , V.D. Gor4 , S. Mali5
Section:Research Paper, Product Type: Journal Paper
Volume-2 ,
Issue-3 , Page no. 144-147, Mar-2014
Online published on Mar 30, 2014
Copyright © H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
View this paper at Google Scholar | DPI Digital Library
How to Cite this Paper
- IEEE Citation
- MLA Citation
- APA Citation
- BibTex Citation
- RIS Citation
IEEE Style Citation: H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali, “An Approach to Design Personalized Focused Crawler,” International Journal of Computer Sciences and Engineering, Vol.2, Issue.3, pp.144-147, 2014.
MLA Style Citation: H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali "An Approach to Design Personalized Focused Crawler." International Journal of Computer Sciences and Engineering 2.3 (2014): 144-147.
APA Style Citation: H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali, (2014). An Approach to Design Personalized Focused Crawler. International Journal of Computer Sciences and Engineering, 2(3), 144-147.
BibTex Style Citation:
@article{Trivedi_2014,
author = {H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali},
title = {An Approach to Design Personalized Focused Crawler},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {3 2014},
volume = {2},
Issue = {3},
month = {3},
year = {2014},
issn = {2347-2693},
pages = {144-147},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=86},
publisher = {IJCSE, Indore, INDIA},
}
RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=86
TI - An Approach to Design Personalized Focused Crawler
T2 - International Journal of Computer Sciences and Engineering
AU - H.P. Trivedi, G.N. Daxini, J.A. Oswal, V.D. Gor, S. Mali
PY - 2014
DA - 2014/03/30
PB - IJCSE, Indore, INDIA
SP - 144-147
IS - 3
VL - 2
SN - 2347-2693
ER -
VIEWS | XML | |
3694 | 3409 downloads | 3631 downloads |
Abstract
The amount of data and its dynamicity makes it impossible to crawl the World Wide Web (WWW) completely. It�s a challenge in front of crawlers to crawl only the relevant pages from this information explosion. Thus a focused crawler solves this issue of relevancy to a certain level, by focusing on web pages for some given topic or a set of topics. Also a focused crawler with a page change detection policy can help in narrowing down the search to only newer pages, and thus eliminates risk of redundancy and missing updated data. This paper proposes a policy for design of a focused crawler with web page change detection policy.
Key-Words / Index Term
Web Crawler, Focused Crawler, World Wide Web(WWW),Content Analysis, Link Scoring, Change Detection
References
[1] Mahdi Bazarganigilani, Ali Syed and Sandid Burki, �Focused web crawling using decay concept and genetic programming�, published in International Journal of Data Mining & Knowledge Management Process (IJDKP), Vol.1, No.1, Page no(1-12), January 2011.
[2] 3Swati Mali and B B Meshram, �Focused Web Crawler with Page Change Detection Policy�, published in International Journal of Computer Applications (IJCA) proceedings on International Conference and workshop on Emerging Trends in Technology (ICWET), No 9 Article 9, Page No 51-56, 2011.
[3] 4DivakarYadav, AK Sharma, Sonia Sanchez-Cuadrado, Jorge Morato, �an approach to design incremental parallel webcrawler�, published in Journal of Theoretical and Applied Information Technology, Volume 43 No 1, Page no:(8-29), 15 September 2012.
[4] 6Anshika Pal, Deepak Tomar and S.C. Shrivastava, �Effective Focused Crawling Based on Content and Link Structure Analysis�, published in (IJCSIS) International Journal of Computer Science and Information Security, Vol. 2, no. 1, Page No: (1-5), June 2009.
[5] 7Ioannis Avraam and Ioanni Anagnostopoulos, �A Comparison over Focused Web Crawling Strategies�, published in Panhellenic Conference on Informatics(IEEE), Print ISBN 978-1-61284-962-1,Page No: (245-249), September 2011.
[6] 9Weicheng Ma, Xiuxia Chen and Wenqian Shang, �Advanced deep web crawler based on Dom�, published in IEEE Fifth International Joint Conference on Computational Sciences and Optimization, print ISBN 978-1-4673-1365-0, Page No: (605-609), June 2012
[7] Mejdl S. Safran, Abdullah Althagafi and Dunren Che, �Improving Relevance Prediction for Focused Web Crawlers�, published in IEEE/ACIS 11th International Conference on Computer and Information Science, print ISBN 978-1-4673-1536-4, page no: (161-166), May 2012.
[8] Jatinder Manhas, �A Study of Factors Affecting Websites Page Loading Speed for Efficient Web Performance�, published in International Journal of Computer Sciences and Engineering (IJCSE), Vol-1, Issue-3, Nov 2013.