Open Access   Article Go Back

An Efficient Algorithm for Data Pre-Processing and Personalization in Web Usage Mining

Preeti Rathi1 , Nipur Singh2

Section:Research Paper, Product Type: Journal Paper
Volume-7 , Issue-5 , Page no. 160-164, May-2019

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v7i5.160164

Online published on May 31, 2019

Copyright © Preeti Rathi, Nipur Singh . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Preeti Rathi, Nipur Singh, “An Efficient Algorithm for Data Pre-Processing and Personalization in Web Usage Mining,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.5, pp.160-164, 2019.

MLA Style Citation: Preeti Rathi, Nipur Singh "An Efficient Algorithm for Data Pre-Processing and Personalization in Web Usage Mining." International Journal of Computer Sciences and Engineering 7.5 (2019): 160-164.

APA Style Citation: Preeti Rathi, Nipur Singh, (2019). An Efficient Algorithm for Data Pre-Processing and Personalization in Web Usage Mining. International Journal of Computer Sciences and Engineering, 7(5), 160-164.

BibTex Style Citation:
@article{Rathi_2019,
author = {Preeti Rathi, Nipur Singh},
title = {An Efficient Algorithm for Data Pre-Processing and Personalization in Web Usage Mining},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {5 2019},
volume = {7},
Issue = {5},
month = {5},
year = {2019},
issn = {2347-2693},
pages = {160-164},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4215},
doi = {https://doi.org/10.26438/ijcse/v7i5.160164}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i5.160164}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4215
TI - An Efficient Algorithm for Data Pre-Processing and Personalization in Web Usage Mining
T2 - International Journal of Computer Sciences and Engineering
AU - Preeti Rathi, Nipur Singh
PY - 2019
DA - 2019/05/31
PB - IJCSE, Indore, INDIA
SP - 160-164
IS - 5
VL - 7
SN - 2347-2693
ER -

VIEWS PDF XML
584 325 downloads 174 downloads
  
  
           

Abstract

With the huge amount of data in web, web mining is the process to extract useful data from web. Web usage mining is the type of web mining to retrieve data from web in form of logs, and it is also called web log mining. Web log mining extract useful pattern or information from log files and it help to determine user behaviour. In this paper we proposed an algorithm for data pre-processing and personalization in web usage mining. Firstly collect the data from server and merge these log files into single log file. After collection of data separate each field using field separate algorithm, then cleaning the data to remove noise and unwanted data and after personalize these data for further used.

Key-Words / Index Term

Data Cleaning, Data field Extraction, Cluster, Session Identification, User Identification, Pre-processing, personalization.

References

[1]. Bhupendra Kumar Malviya, Jitendra Agrawal, ”A Study on Web Usage Mining: Theory and Applications”, Fifth International Conference on Communication Systems and Network Technologies, IEEE, Page: 935-939, April 2015, ISBN (Print) 978-1-4799-1797-6/15
[2]. Dr. Girish S. Katkar, Amit Dipchandji Kasliwal,” Use of Log Data for Predictive Analytics through Data Mining”, Current Trends in Technology and Science, page-217-222, ISSN: 2279-0535. Volume: 3, Issue: 3 (Apr-May. 2014).
International Journal of Computer Applications (0975 – 8887) Volume 103 – No.6, October 2014
[3]. M.Praveen Kumar,” An Effective Analysis of Weblog Files to improve Website Performance”, International Journal of Computer Science & Communication Networks, Vol. 2(1), Page: 55-60, 2011, ISSN: 2249-5789.
[4]. Mr. Jitendra B. Upadhyay, Dr. S. V. Patel,” A Review Analysis of Preprocessing Techniques in Web usage Mining”, International Journal of Engineering Research & Technology (IJERT), Vol. 4 Issue 04, April-2015, page -1160-1166,ISSN: 2278-0181
[5]. Nehal G. Karelia, Prof. Shweta Shukla,” Data Preprocessing: A Pre requisite for Web Log Files”, International Journal of Engineering Research & Technology (IJERT), page-1571-1574, Vol. 3 Issue 4, April – 2014, ISSN: 2278-0181
[6]. Oren Etzioni,” The World-Wide Web: Quagmire or Gold Mine?” ACM, Vol. 39, No. 11, November 1996, Page: 66-68.
[7]. Sameer Dixit, Navjot Gwal,” An Implementation of Data Pre-Processing for Small Dataset”,
[8]. Saurabh Choudhry, Prof A. K Solanki “ Errors in Internet Log files for Website Improvement and Interaction”, International Journal of Advanced Research in Computer Science and Software Engineering, Page-365-371, Volume 4, Issue 10, October 2014, ISSN- 2277 128X
[9]. Shakti Kundu, “An Intelligent approach of web data mining”, International Journal on Computer Science and Engineering, page-919-928, Vol. 4 No. 05 May 2012, ISSN: 0975-3397.
[10]. Sheetal A. Raiyani, Rakesh Pandey, Shivkumar Singh Tomar, ”Performance Enhancement of Web Server log for Distinct User Identification through different Factors”, International Journal of Advanced Research in Computer and Communication Engineering, Vol. 3, Issue 6, June 2014, Page: 7262-7267, ISSN (Online) : 2278-1021, ISSN (Print) : 2319-5940.
[11]. Shivaprasad G., N.V. Subba Reddy, U. Dinesh Acharya,” Knowledge Discovery from Web Usage Data: An Efficient Implementation of Web Log Preprocessing Techniques”, International Journal of Computer Applications (0975 – 8887) Volume 111 – No 13, February 2015
[12]. Surbhi Anand , Rinkle Rani Aggarwal “An Efficient Algorithm for Data Cleaning of Log File using File Extensions “, International Journal of Computer Applications (0975 – 888)Volume 48– No.8, June 2012
[13]. V.Chitraa, Dr.Antony Selvadoss Thanamani ,” A Novel Technique for Sessions Identification in Web Usage Mining Preprocessing”, International Journal of Computer Applications (0975 – 8887) Volume 34– No.9, November 2011
[14]. Jiang Chang-bin, “Web Log Data Preprocessing Based on Collaborative Filtering”, 2010 Second International Workshop on Education Technology and Computer Science.
[15]. K. S. R. Pawan Kumar,”A Critique on Web Usage Mining”, International Journal of Computer Science and Information Technologies, Vol. 3 (5) , 2012,5276-5279.
[16]. Gajendra Singh, “A New Algorithm for Web Log Mining”, International Journal of Computer Applications (0975 – 8887) Volume 90 – No 17, March 2014 20
[17]. Gajendra Singh Chandel, “A Result Evolution Approach for Web usage mining using Fuzzy C-Mean Clustering Algorithm”, IJCSNS International Journal of Computer Science and Network Security, VOL.16 No.1, January 2016
[18]. Doddegowda B J,” A Novel Algorithm for Web Personalization through Integration of Web User Profiles and Behavioural Patterns”, IRACST - International Journal of Computer Science and Information Technology & Security (IJCSITS), ISSN: 2249-9555, Vol.7, No.2, Mar-April 2017