Open Access   Article Go Back

A Review on Document Retrieval from Unstructured Text

Sneha Lohbare1 , Ashwini Meshram2

Section:Review Paper, Product Type: Journal Paper
Volume-2 , Issue-11 , Page no. 76-80, Nov-2014

Online published on Nov 30, 2014

Copyright © Sneha Lohbare , Ashwini Meshram . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Sneha Lohbare , Ashwini Meshram, “A Review on Document Retrieval from Unstructured Text,” International Journal of Computer Sciences and Engineering, Vol.2, Issue.11, pp.76-80, 2014.

MLA Style Citation: Sneha Lohbare , Ashwini Meshram "A Review on Document Retrieval from Unstructured Text." International Journal of Computer Sciences and Engineering 2.11 (2014): 76-80.

APA Style Citation: Sneha Lohbare , Ashwini Meshram, (2014). A Review on Document Retrieval from Unstructured Text. International Journal of Computer Sciences and Engineering, 2(11), 76-80.

BibTex Style Citation:
@article{Lohbare_2014,
author = {Sneha Lohbare , Ashwini Meshram},
title = {A Review on Document Retrieval from Unstructured Text},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {11 2014},
volume = {2},
Issue = {11},
month = {11},
year = {2014},
issn = {2347-2693},
pages = {76-80},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=306},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=306
TI - A Review on Document Retrieval from Unstructured Text
T2 - International Journal of Computer Sciences and Engineering
AU - Sneha Lohbare , Ashwini Meshram
PY - 2014
DA - 2014/11/30
PB - IJCSE, Indore, INDIA
SP - 76-80
IS - 11
VL - 2
SN - 2347-2693
ER -

VIEWS PDF XML
3956 3426 downloads 3565 downloads
  
  
           

Abstract

A simple search over a document can be considered as a traditional method of searching from a single document in database. A keyword or string is considered as core element while searching where string may be strings of words, characters for any phrase. Many problems in such keyword or phrase-based searching arise when a keyword or phrase is intended to be searched in multiple documents. For the same, a solution suggested is a repetitive procedure of searching for every document. It can be helpful for limited number of copies of document. But this solution can never be considered efficient and effective in case of large number of documents in database which is supposed to be increasing continuously. Also, searching for the pattern based or the regular expression based content from the document is one of the demanding topics of research. Processing such queries requires a lot of processing time and complete indexing of data is bit difficult process.

Key-Words / Index Term

Document retrieval, Indexing, Unstructured Text

References

[1] Debnath Bhattacharyya, Poulami Das,” Unstructured Document Categorization: A Study”, International Journal of Signal Processing, Image Processing and Pattern Recognition, pp. 55-62,Jan 2008.
[2] Weiguo Fan, ”Tapping into the Power of Text Mining”, article accepted for publication at the Communications of ACM, pp. 02-15, February 16, 2005.
[3] V.V.Jaya Rama Krishnaiah, D.V.Chandra Sekhar, Dr. K. Ramchand H Rao, Dr. R Satya Prasad,” Predicting the Diabetes using Duo Mining Approach”, International Journal of Advanced Research in Computer and Communication Engineering ISSN : 2278 – 1021,Vol. 1, Issue 6, pp. 423-431, August 2012.
[4] K.Sreerama Murthy, Dr G. Samuel Varaprasad Raju, Dr C. Sunil Kumar,” Text Mining For Retrieving The Vital Information”, International Journal of Research in Computer and Communication Technology, Vol. 3, Issue 1, pp.99-103,Jan 2014.
[5] Manish Sharma, Rahul Patel,” A Survey on Information Retrieval Models, Techniques and Applications”, International Journal of Emerging Technology and Advanced Engineering, ISSN 2250-2459,pp.542-545, November 2013.
[6] B.Ganga,” Phrase Based Document Retrieving by Combining Suffix Tree index data structure and Boyer- Moore faster string searching algorithm”, International Journal of Advancements in Research & Technology, ISSN 2278-7763,Vol. 3, Issue 3, pp. 147-153,March 2014.
[7] Ian H. Witten,” Text mining”, Computer Science, University of Waikato, Hamilton, New Zealand, pp 01-23,2004.
[8] Roi Blanco González,” Index Compression for Information Retrieval Systems”, Ph.D. Thesis, University of A Coruña, 2008.
[9] Deepak Agnihotri, Kesari Verma, Priyanka Tripathi,” Pattern and Cluster Mining on Text Data”, Fourth International Conference on Communication Systems and Network Technologies, IEEE Computer Society, pp. 428-432, 2014.
[10] R. Sagayam, S.Srinivasan, S. Roshni,” A Survey of Text Mining: Retrieval, Extraction and Indexing Techniques”, International Journal Of Computational Engineering Research, ISSN 2250-3005, Vol. 2 Issue. 5, pp. 1443-1446, September 2012.
[11] Sonali Vijay Gaikwad, Prof. Archana Chaugule, Swapnil Kulkarni, ” Performance Comparison for Text Mining Methods: Review”, International Journal of Advanced Engineering Research and Studies, E-ISSN 2249–8974, pp. 01-04, Oct.-Dec, 2014.
[12] Ning Zhong, Yuefeng Li, and Sheng-Tang Wu,” Effective Pattern Discovery for Text Mining”, IEEE Transactions On Knowledge And Data Engineering, Vol. 24, No. 1,pp. 30-44, Jan. 2012.
[13] S.S. Patil,V.M. Gaikwad, ” Developing New Software Metric Pattern Discovery for Text Mining”, International Journal of Computer Sciences and Engineering, Vol. 2, Issue-4,pp. 119-125, April 2014.
[14] Bhushan Inje, Ujawla Patil,” Operational Pattern Revealing Technique in Text Mining”, IEEE Students’ Conference on Electrical, Electronics and Computer Science,2014.
[15] Ziqi Wang, Gu Xu, Hang Li, and Ming Zhang,” A Probabilistic Approach to String Transformation”,published in IEEE Transactions On Knowledge And Data Engineering, Vol. 26, No. 5,pp. 1063-1075, May 2014.
[16] Saima Hasib, Mahak Motwani, Amit Saxena,” Importance of Aho-Corasick String Matching Algorithm in Real World Applications” published in International Journal of Computer Science and Information Technologies, ISSN: 0975-9646, Vol. 4 (3) , pp. 467-469,2013.