Comparative Study of String Matching Algorithms for DNA dataset
Pooja Manisha Rahate1 , M. B. Chandak2
- Dept. of Computer Science and Tech., Shri Ramdeobaba College of Engineering & Management, Nagpur, Maharashtra, India.
- Dept. of Computer Science and Tech., Shri Ramdeobaba College of Engineering & Management, Nagpur, Maharashtra, India.
- Dept. of Computer Science and Tech., Shri Ramdeobaba College of Engineering & Management, Nagpur, Maharashtra, India.
Section:Research Paper, Product Type: Journal Paper
Volume-6 ,
Issue-5 , Page no. 1067-1074, May-2018
CrossRef-DOI: https://doi.org/10.26438/ijcse/v6i5.10671074
Online published on May 31, 2018
Copyright © Pooja Manisha Rahate, M. B. Chandak . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
View this paper at Google Scholar | DPI Digital Library
How to Cite this Paper
- IEEE Citation
- MLA Citation
- APA Citation
- BibTex Citation
- RIS Citation
IEEE Style Citation: Pooja Manisha Rahate, M. B. Chandak, “Comparative Study of String Matching Algorithms for DNA dataset,” International Journal of Computer Sciences and Engineering, Vol.6, Issue.5, pp.1067-1074, 2018.
MLA Style Citation: Pooja Manisha Rahate, M. B. Chandak "Comparative Study of String Matching Algorithms for DNA dataset." International Journal of Computer Sciences and Engineering 6.5 (2018): 1067-1074.
APA Style Citation: Pooja Manisha Rahate, M. B. Chandak, (2018). Comparative Study of String Matching Algorithms for DNA dataset. International Journal of Computer Sciences and Engineering, 6(5), 1067-1074.
BibTex Style Citation:
@article{Rahate_2018,
author = {Pooja Manisha Rahate, M. B. Chandak},
title = {Comparative Study of String Matching Algorithms for DNA dataset},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {5 2018},
volume = {6},
Issue = {5},
month = {5},
year = {2018},
issn = {2347-2693},
pages = {1067-1074},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=2110},
doi = {https://doi.org/10.26438/ijcse/v6i5.10671074}
publisher = {IJCSE, Indore, INDIA},
}
RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v6i5.10671074}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=2110
TI - Comparative Study of String Matching Algorithms for DNA dataset
T2 - International Journal of Computer Sciences and Engineering
AU - Pooja Manisha Rahate, M. B. Chandak
PY - 2018
DA - 2018/05/31
PB - IJCSE, Indore, INDIA
SP - 1067-1074
IS - 5
VL - 6
SN - 2347-2693
ER -
VIEWS | XML | |
758 | 432 downloads | 288 downloads |
Abstract
String matching algorithms are widely used in computer science fields for information retrieval, intrusion detection, music retrieval, database queries, language syntax checker, bioinformatics, DNA sequence matching and etc. The most common and well-known use of string matching algorithms is for bioinformatics. In bioinformatics the DNA sequences of the normal human being and matched with the DNA sequence of a person having viruses or any kind disease. The pattern of any disease or virus is matched with the normal DNA genome sequence. If the pattern is found in the sequence which is in the form of string it is considered that the human being or patient is having the tested disease. Thus the pattern is matched with the large amount of DNA sequence which is sometimes very complex and not easy to retrieve. Thus to get the result or matched pattern in the less time with more accuracy the algorithms such as Knuth-Morris-Pratt(KMP), Boyer-Moore, Brute Force, Rabin-Karp and other algorithms are used. This paper presents five string matching algorithms from which four are exact matching algorithms and one is approximate string matching algorithm (Edit Distance). The above listed algorithms complexity will be compared using the DNA dataset to find the appropriate algorithm with high quality time and accuracy.
Key-Words / Index Term
String Matching Algorithm, DNA sequence
References
[1] NYO ME TUN, THIN MYA MYA SWE, “Comparison of Three Pattern Matching Algorithms using DNA Sequences”, IJSETR, Vol.3, Issue.35, pp.6916-6920, 2014.
[2] https://en.wikipedia.org/wiki/DNA
[3] Robert Sedgewick, Kevin Wayne, “Algorithms”, Fourth Edition, Addison-Wesley,Pearson Edition, India, pp. 760-776, 2011.
[4] Thomas Cormen,Charles E. Leiserson,Ronald L. Rivest,Clifford Stein, “Introduction to Algorithms”, McGraw-Hill Publication, India, pp.909-926, 2001.
[5] Raju Bhukya, DVLN Somayajulu, “Exact Multiple Pattern Matching Algorithm using DNA Sequence and Pattern Pair”, International Journal of Computer Applications, Number 8,Article 6, pp.32-38, 2011.
[6] Petteri Jokinen, Jorma Tarhio and Esko Ukkonen, “A Comparison of Approximate String Matching Algorithms”, Software-Practice and Experience, Vol.1(1), pp.1-4, 1988.
[7] http://ccg.vitalit.ch/cgibin/htpselex/show?htpselex&tf=NF1_1&clone
[8] https://archive.ics.uci.edu/ml/machine-learning-databases/molecular-biology/promoter-gene-sequences/promoters.data