Open Access   Article Go Back

Preprocessing Application based on Structured Query Language for Web Log Mining

K. Vadivazhagan1 , M. Karthikeyan2

Section:Review Paper, Product Type: Journal Paper
Volume-7 , Issue-3 , Page no. 544-549, Mar-2019

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v7i3.544549

Online published on Mar 31, 2019

Copyright © K. Vadivazhagan, M. Karthikeyan . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: K. Vadivazhagan, M. Karthikeyan, “Preprocessing Application based on Structured Query Language for Web Log Mining,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.3, pp.544-549, 2019.

MLA Style Citation: K. Vadivazhagan, M. Karthikeyan "Preprocessing Application based on Structured Query Language for Web Log Mining." International Journal of Computer Sciences and Engineering 7.3 (2019): 544-549.

APA Style Citation: K. Vadivazhagan, M. Karthikeyan, (2019). Preprocessing Application based on Structured Query Language for Web Log Mining. International Journal of Computer Sciences and Engineering, 7(3), 544-549.

BibTex Style Citation:
@article{Vadivazhagan_2019,
author = {K. Vadivazhagan, M. Karthikeyan},
title = {Preprocessing Application based on Structured Query Language for Web Log Mining},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {3 2019},
volume = {7},
Issue = {3},
month = {3},
year = {2019},
issn = {2347-2693},
pages = {544-549},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=3877},
doi = {https://doi.org/10.26438/ijcse/v7i3.544549}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i3.544549}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=3877
TI - Preprocessing Application based on Structured Query Language for Web Log Mining
T2 - International Journal of Computer Sciences and Engineering
AU - K. Vadivazhagan, M. Karthikeyan
PY - 2019
DA - 2019/03/31
PB - IJCSE, Indore, INDIA
SP - 544-549
IS - 3
VL - 7
SN - 2347-2693
ER -

VIEWS PDF XML
280 199 downloads 112 downloads
  
  
           

Abstract

Web Log Mining, also known as Web Usage Mining (WUM) is the application of Data Mining techniques, which is applied on web log data to extract interesting patterns. An enormous increase in the use of web applications as medium of the organizations and institutions, the web page hits are consistently increasing. The web servers have the facility to save the web navigational sequence as web log file. The enormous amount of irrelevant information in the web log file demands proper preprocessing. This renders the file, with the intent of making it more appropriate for a variety of downstream purposes such as analytics. There are various traditional techniques involved in preprocessing. The implementation of preprocessing model presented in this paper over other traditional preprocessing methods is to employ an efficient Structured Query Language (SQL) based technique. The proposed SQL based preprocessing technique reduces process time drastically. The resulting structured log file is well suited for further pattern mining and analytics.

Key-Words / Index Term

Preprocessing, Web Log Mining, Server log, User Identification, Session Identification

References

[1] J. Srivatsava, R. Cooley, M. Deshpande, and P. N. Tan, “Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data”, SIGKDD Explorations, Vol.1, Issue.2, pp.12-23, 2000.
[2] V. Chitraa, and Antony Selvadoss Devamani, “A Novel Technique for Sessions Identification in Web Usage Mining Preprocessing”, International Journal of Computer Applications, Vol.34, Issue.9, pp.23-27, 2011.
[3] K. Vadivazhagan and M. Karthikeyan, “Preprocessing Techniques in Web Log Mining to Group Users and Identify User Session”, International Journal of Engineering Science Invention, Vol.4, pp.26-33, 2018.
[4] K. Vadivazhagan and M. Karthikeyan, “Mining Frequent Link Sets from Web Log Using Apriori Algorithm”, Journal of Computational and Theoretical Nanoscience, American Scientific Publishersber, Vol. 16, pp. 1–7, 2019.
[5] P. Sukumar, L. Robert and S. Yuvaraj, "Review on modern Data Preprocessing techniques in Web usage mining (WUM)", In the Proceedings of the 2016 IEEE International Conference on Computation System and Information Technology for Sustainable Solutions (CSITSS), Bangalore, pp.64-69, 2016.
[6] Bharat Chauhan, Hemant Kumar, Mihul Singh, Piyush Kumar and Sakshi Hooda, "An Improved Preprocessing and Clustering Using Web Log Data", International Journal of Advanced Research in Computer and Communication Engineering, Vol.5, Issue.11, pp.95–98, 2016.
[7] Janusz Kacprzyk and Sławomir Zadrozny, "Linguistic Summarization of the Contents of Web Server Logs via the Ordered Weighted Averaging (OWA) Operators", Fuzzy Sets and Systems, Elsevier North-Holland, Inc., Vol.285, pp.182-198, 2016.
[8] F. Mary Harin Fernandez and R. Ponnusamy, "Data Preprocessing and Cleansing in Web Log on Ontology for Enhanced Decision Making", Indian Journal of Science and Technology, Vol.9, Issue.10, pp.1-10, 2016.
[9] S. Uma Maheswari and S. K. Srivatsa, "An Application of Preprocessing and Clustering in Web Log Mining", International Journal of Philosophies in Computer Science, Vol.1, Issue.1, pp.21-30, 2015.
[10] K. R. Suneetha, R. Krishnamoorthi, “Identifying User Behavior by Analyzing Web Server Access Log File”, IJCSNS International Journal of Computer Science and Network Security, Vol.9, Issue.4, pp.327-332, 2009.
[11] M. Udantha, S. Ranathunga and G. Dias, "Modelling Website User Behaviors By Combining the EM and DBSCAN Algorithms", In the Proceedings of the 2016 IEEE Moratuwa Engineering Research Conference (MERCon), Moratuwa, pp. 168-173, 2016.
[12] Hsin-Jung Cheng and Akhil Kumar, "Process Mining on Noisy Logs - Can Log Sanitization Help to Improve Performance?", Decision Support Systems, Elsevier B.V., Vol.79, pp. 138-149, 2015.
[13] Yin-Fu Huang and Jhao-Min Hsu, "Mining Web Logs to Improve Hit Ratios of Prefetching and Caching", The 2005 IEEE International Conference on Web Intelligence (WI`05), Compiegne, France, pp. 577-580, 2005.
[14] R.Sandrilla, M. Savitha Devi, "A Study on Data Preprocessing Methods on Web Log Data in Web Usage Mining", International Journal of Computer Sciences and Engineering, Vol.6, Issue.7, pp.920-928, 2018.
[15] AshirrK Kashyap, Iflah Naseem and Dheeraj Mandloi, "Web Mining an Approach to Evaluate the Web", International Journal of Scientific Research in Computer Science and Engineering, Vol.5, Issue.3, pp.79-85, 2017.
[16] Sonia Sharma and Munishwar Rai, "Customer Behaviour Analysis using Web Usage Mining", International Journal of Scientific Research in Computer Science and Engineering, Vol.5, Issue.6, pp.47-50, 2017.
[17] Namrata Ghuse, Pranali Pawar and Amol Potgantwar, "An Improved Approch For Fraud Detection In Health Insurance Using Data Mining Techniques", International Journal of Scientific Research in Network Security and Communication, Vol.5, Issue.5, pp.27-32, 2017.
[18] M. Karthikeyan and P. Aruna, "Probability based Document Clustering and Image Clustering using Content-based Image Retrieval", Applied Soft Computing, Elsevier, Vol.13, Issue.2, pp.959-966, 2013.