International Journal of
Computer Sciences and Engineering

Scholarly Peer-Reviewed Scientific Research Publishing Journal
Effectuation of Web Log Preprocessing and Page Access Frequency using Web Usage Mining
Effectuation of Web Log Preprocessing and Page Access Frequency using Web Usage Mining
B. Bakariya1 , G.S. Thakur2

Section:Research Paper, Product Type: Journal Paper
Volume-1 , Issue-1 , Page no. 1-5, Sep-2013

Online published on Sep 30, 2013

Copyright B. Bakariya, G.S. Thakur . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
 
View this paper at   Google Scholar | DPI Digital Library
  XML View PDF Download  
Citation

IEEE Style Citation: B. Bakariya, G.S. Thakur, Effectuation of Web Log Preprocessing and Page Access Frequency using Web Usage Mining, International Journal of Computer Sciences and Engineering, Vol.1, Issue.1, pp.1-5, 2013.

MLA Style Citation: B. Bakariya, G.S. Thakur "Effectuation of Web Log Preprocessing and Page Access Frequency using Web Usage Mining." International Journal of Computer Sciences and Engineering 1.1 (2013): 1-5.

APA Style Citation: B. Bakariya, G.S. Thakur, (2013). Effectuation of Web Log Preprocessing and Page Access Frequency using Web Usage Mining. International Journal of Computer Sciences and Engineering, 1(1), 1-5.
           
Abstract :
For accessing the information from web log, this is very important task and this task can be accomplished by web usage mining technique. Through web usage mining technique we can find out visitors behavior which can automatically and very fast access intrinsic information from huge amount of web log data, such as interesting access path, identify the user, accessing the web page group, web user clustering and web pre-fetching. Web usage mining is milestone for decision making process for an organization. Data preprocessing is very important concepts for the mining process. If our web log data is preprocessed then we can easily find out the desire information about visitor and also retrieve other hidden information from web log data. In this paper we focus on data preprocessing technique of web usage mining, after completion of data preprocessing, any king of irrelevant information can be sort out. We have also proposed an algorithm and its implementation for web log preprocessing in web usage mining. Every page has been assigned with an individual token. According to this token and frequency, data mining technique (Classification, Association Rules, and Clustering) can be applied. In this article we can easily find the highest and lowest value according to page access frequency.
Key-Words / Index Term :
Web Usage Mining, Preprocessing, Web Log Data, Frequency, Clustering
References :
[1] Theint Theint Aye, "Web Log Cleaning of Web Usage Patterns," IEEE, 2011.
[2] Ms.Dipa Dixit and Ms. M. Kiruthika, "Preprocessing of Web Logs," International Journal on Computer Science and Engineering,vol. 02, 2010.
[3] Arshi Shamsi, Rahul Nayak, Pankaj Pratap Singh and Mahesh Kumar Tiwari , "Web Usage Mining by Data Preprocessing," IJCST, vol. 3, 2012.
[4] Mahendra Pratap Yadav,Pankaj Kumar Keserwani and Shefalika Ghosh Samaddar, "An Efficient Web Mining Algorithm for Web Log Analysis: E-Web Miner," IEEE, 2012.
[5] Shaimaa Ezzat Salama, Mohamed I. Marie, "Web Server Logs preprocessing for Web Intrusion Detection," Computer and Information Science, vol. 4, 2011.
[6] Jaideep Srivastava, Robert Cooley, Mukund Deshpande and Pang-Ning Tan, "Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data," SIGKDD Explorations, vol. 1, 2000.
[7] Liu Kewen, "Analysis of Preprocessing Methods for Web Usage Data," International Conference on Measurement , Information and control(MIC),IEEE,2012.
[8] R. Cooley,B. Mobasher and J Shrivastava, "Web Mining:information and pattern discoveryon the World Wide web," Ninth International Conference, 2011.
[9] Web Log Data, "http://ita.ee.lbl.gov/html/contrib/NASA-HTTP.html,".
[10] Zhuang Like, Kou Zhongbao and Zhang Changshui, "Session identification based on time intervals in Web log mining," Journal of Tsinghua University (Science and Technology), 2005.
[11] N. Zhang and W. F. Lu, " An Efficient Data Preprocessing Method for Mining Customer Survey Data," IEEE, 2007.
[12] Tasawar Hussain, Dr. Sohail Asghar, Dr. Nayyer Masood, " Web Usage Mining: A Survey on Preprocessing of Web Log File," IEEE, 2010.
[13] T. Murata and K. Saito, "Extracting Users` Interests from Web Log Data," Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings, 2006.
[14] Ling Zheng , Hui Gui and Feng Li, "Optimized Data Preprocessing Technology for Web Log Mining," International Conference On Computer Design And Appliations ICCDA, 2010.
[15] R. Cooley, B. Mobasher and J. Srivastava, "Data preparation for mining world wide web browsing patterns," Knowledge and Information System, 1999.
[16] Brijesh Bakariya and G.S.Thakur, "Preprocessing on Web Log Data in Web Usage Mining," International Conference on Intelligent Computing and Information System ICICIS, 2012.
[17] Thi Thanh Sang Nguyen, Hai Yan Lu and Jie Lu, "Web-page Recommendation based on Web Usage and Domain Knowledge," IEEE, 2013.