Open Access   Article Go Back

Comprehensive Overview On Web Usage Mining Its Task & Techniques

Sonam Singh Gurjar1 , Khushboo Agrawal2

Section:Survey Paper, Product Type: Journal Paper
Volume-7 , Issue-5 , Page no. 590-599, May-2019

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v7i5.590599

Online published on May 31, 2019

Copyright © Sonam Singh Gurjar, Khushboo Agrawal . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Sonam Singh Gurjar, Khushboo Agrawal, “Comprehensive Overview On Web Usage Mining Its Task & Techniques,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.5, pp.590-599, 2019.

MLA Style Citation: Sonam Singh Gurjar, Khushboo Agrawal "Comprehensive Overview On Web Usage Mining Its Task & Techniques." International Journal of Computer Sciences and Engineering 7.5 (2019): 590-599.

APA Style Citation: Sonam Singh Gurjar, Khushboo Agrawal, (2019). Comprehensive Overview On Web Usage Mining Its Task & Techniques. International Journal of Computer Sciences and Engineering, 7(5), 590-599.

BibTex Style Citation:
@article{Gurjar_2019,
author = {Sonam Singh Gurjar, Khushboo Agrawal},
title = {Comprehensive Overview On Web Usage Mining Its Task & Techniques},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {5 2019},
volume = {7},
Issue = {5},
month = {5},
year = {2019},
issn = {2347-2693},
pages = {590-599},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4285},
doi = {https://doi.org/10.26438/ijcse/v7i5.590599}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i5.590599}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4285
TI - Comprehensive Overview On Web Usage Mining Its Task & Techniques
T2 - International Journal of Computer Sciences and Engineering
AU - Sonam Singh Gurjar, Khushboo Agrawal
PY - 2019
DA - 2019/05/31
PB - IJCSE, Indore, INDIA
SP - 590-599
IS - 5
VL - 7
SN - 2347-2693
ER -

VIEWS PDF XML
337 209 downloads 173 downloads
  
  
           

Abstract

Internet users in the world increasing rapidly. At the present time, the best way of conveying information is the World Wide Web. There are so many websites for learning, shopping, selling, businesses and many more. The expansion of Internet usage will result in increasing web data speedily. To exploit the information of internet usage, it becomes necessary to extract the access behavior of the users.Web usage mining is one of such Data mining technique used for mining, web access log.These access logs are saved on the web server.Access log is the records of all the user, requests for a particular file from a website.Web usage mining will help in improving the design of the website and the personalization of the content. This paper gives the comparative study of web usage mining, it also summarizes the web usage mining approach like pre-processing, pattern discovery, pattern analysis, visualization. This survey listed various research work done by the researcher. It delivers numerous techniques and algorithms used in web usage mining.

Key-Words / Index Term

server log, access log, web usage mining, pre-processing, user identification, session identification, clustering, classification, pattern discovery & analysis

References

[1] Daniel T. Larose, Discovering knowledge in data: An Introduction to Data Mining, USA: A John Wiley & Sons, INC, publication, 2005.
[2] Bing Liu, Web data mining: Exploring Hyperlinks, Contents, and usage data, German: Springer-Verlag Berlin Heidelberg, pp 527-540, 2007, ISBN 978-3-642-19459-7.
[3] R. Kosala and H. Blockeel, Web mining research: A survey, ACM SIGKDD Explore. 2 (2000) 1–15
[4] Qingyu Zhang and Richards S. Segall, International Journal of Information Technology & Decision-Making Vol. 7, No. 4 (2008) 683–720
[5] M. Eirinaki and M. Vazirgiannis, “Web mining for web personalization,” ACM Trans. Inter. Tech., Vol. 3, No. 1, pp. 1-27, 2003
[6] B.Lalithadevi, A.Merry Ida, A New Approach For Improving World Wide Web Techniques in Data Mining, International Journal of Advanced Research in Computer Science and Software Engineering, volume 3,issue1, January 2013
[7] M. Aldekhail, Application and Significance of Web Usage Mining in the 21st Century: A Literature Review, International Journal of Computer Theory and Engineering, Vol. 8, No. 1, February 2016
[8] Murat Ali Bayir, Ismail Hakki Toroslu, Ahmet Cosar and Guven Fidan “Discovering more accurate Frequent Web Usage Patterns,” arXiv0804.1409v1, 2008
[9] Michal Munk, Jozef Kapusta, Peter Švec, Constantine the Philosopher University in Nitra, Department of Informatics, Tr. A.Hlinku 1, 949 74 Nitra, Slovakia, “Data Pre-processing Evaluation for Web Log Mining: Reconstruction of Activities of a Web Visitor”, International Conference on Computational Science, ICCS 2010
[10] Mr. Shivkumar Khosla, Mrs. Varunakshi Bhojane, Department of Computer Engineering, Mumbai University, India, “Capturing Web Log and Performing Pre-processing of the User’s Accessing Distance Education System”, International Journal of Modern Engineering Research (IJMER) www.ijmer.com Vol.2, Issue.5, Sep.-Oct. 2012
[11] V. Chitraa, Dr. Antony Selvadoss Thanamani, A Novel Technique for Session Identification in Web Usage Mining Pre-processing, International Journal of Computer Application (0975 8887) Volume 34 No. 9, November 2011.
[12] Chaitra L Mugali, Pre-Processing and Analysis of Web Server Logs, International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: 2349-2163, Issue 8, Volume 2 (August 2015)
RAJASHREE SHETTAR ISSN: 2250–3676
[IJESAT] INTERNATIONAL JOURNAL OF ENGINEERING SCIENCE & ADVANCED TECHNOLOGY
RAJASHREE SHETTAR ISSN: 2250–3676
[IJESAT] INTERNATIONAL JOURNAL OF ENGINEERING SCIENCE & ADVANCED TECHNOLOGY
[13] Rajashree shettar, sequential pattern mining from web log data, IJESAT, ISSN:2250–3676, Volume-2, Issue-2, 204 – 208
[14] P. Fournier-Vige, “Mining partially-ordered sequential rules common to multiple sequences,” IEEE Transactions on Knowledge and Data Engineering, vol. 27(8), pp. 2203–2216, 2015.
[15] P. Fournier-Viger, T. Gueniche, et al, “ERMiner: sequential rule mining using equivalence classes,” The International Symposium on Intelligent Data Analysis, pp. 108–119, 2014.
[16] S. Padmaja et al., International Journal of Engineering and Technology (IJET), Vol 8 No 1 Feb-Mar 2016
[17] Viswanathan K, Mayilvahanan K, and R. Christy Pushpaleela, “Performance Comparison of SVM and C4.5 Algorithms for Heart Disease in Diabetic”, International Journal of Control Theory and Applications, ISSN: 0974-5572, Volume 10, Number 25, 2017.
[18] Ketan D. Patel, “Pre-processing on web server log data for web usage pattern discovery”, International Journal of Computer Applications (0975 – 8887) Volume 165 – No.10, May 2017
[19] Reeny Zackarias, “Predicting Users with Similar Behaviour Through Session”, International Journal of Advanced Engineering and Research Development (IJAERD) Volume 4, Issue 3, March -2017, e-ISSN: 2348 – 4470 [20] Jiaoling Du, Xiangqi Zhang, Hongmei Zhang and Lei Chen, "Research and Improvement of Apriori Algorithm", IEEESixth International Conference on Science and Technology, pp.117-121,2016. [21] V. Chitraa and Antony Selvadoss Thanamani, “Clustering of Navigation Patterns using Bolzwano_WeierstrassTheorem”, Indian Journal of Science and Technology,Vol8(12),69283, June 2015.PP1-9 [22] P. Sukumar, “Review on Modern Data Pre-processing Techniques in Web Usage Mining (WUM),” International Conference on Computational Systems and Information Systems for Sustainable Solutions,978-1-50901022-6/16/IEEE(2016). [23] S S Patil and HP Khandagale, “Enhancing Web Navigation Usability Using Web Usage Mining Techniques”, International Research Journal of Engineering and Technology IRJET, vol 4 6, June 2016. [24]S Sharma and S S Lodhi, “Development of Decision Tree Algorithm for Mining Web Data Stream”, International Journal of Computer Applications, March 2016. [25] Shlin He, Qingwei Lin, et al, “Identifying Impactful Service System Problems Via Log Analysis”, ESE/FSE’18, November 4–9,2018, lake-Buena-Vista, Florida, USA. [26] Sonia Sharma et al, “Customer Behaviour Analysis using Web Usage Mining”, International Journal of Scientific Research in Computer Science and Engineering, vol 5, issue 6, pp4750, December (2017).
[27] Madihah Mohd Saudi, et al,” An Efficient Data Transformation Technique for Web Log”, WCE 2017, July 5–7,2017, London, UK.
[28] TAWFIQ A. AL-ASDI, et al, “An Efficient Web Usage Mining Algorithm Based on Log File Data”, Journal of Theoretical and Applied Information Technology,31 October 2016 vol 92 No 2, ISSN:1992 – 8645.
[29] Arjun Ram Meghwal and Dr. Arvind K Sharma,” Identifying System Error through Web Server Log File in Web Log Mining”, International Journal of Computer Science And Technology,Vol.7, ISSN 1, Jan–March 2016 [30] Jayanti Mehra and Dr. R S Thakur, “An Efficient method for Web Log Pre-processing and Page Access Frequency using Web Usage Mining”, International Journal of Applied Engineering Research ISSN 0973–4562 Vol–13,November–2(2018),pp1227–1232. [31] B. Rajeshwari, “Web Page Prediction Using Web Mining”, IRJET, Vol:5, Issue 5, May 2018, e–ISSN:2395–0056. [32] Aanum Shaikh, “Web Usage Mining Using Apriori and FP Growth Algorithm”, International Journal of Computer Science and Information Technology, Vol– 6, pp 354–357,2015