Open Access   Article Go Back

An Overview of Speech Recognition Techniques and the Progress in Speech Technology for Indian Scripts

Bharat C. Patel1 , Jagin M. Patel2 , Manish M. Kayasth3

Section:Review Paper, Product Type: Journal Paper
Volume-3 , Issue-2 , Page no. 94-102, Feb-2015

Online published on Feb 28, 2015

Copyright © Bharat C. Patel, Jagin M. Patel, Manish M. Kayasth . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Bharat C. Patel, Jagin M. Patel, Manish M. Kayasth, “An Overview of Speech Recognition Techniques and the Progress in Speech Technology for Indian Scripts,” International Journal of Computer Sciences and Engineering, Vol.3, Issue.2, pp.94-102, 2015.

MLA Style Citation: Bharat C. Patel, Jagin M. Patel, Manish M. Kayasth "An Overview of Speech Recognition Techniques and the Progress in Speech Technology for Indian Scripts." International Journal of Computer Sciences and Engineering 3.2 (2015): 94-102.

APA Style Citation: Bharat C. Patel, Jagin M. Patel, Manish M. Kayasth, (2015). An Overview of Speech Recognition Techniques and the Progress in Speech Technology for Indian Scripts. International Journal of Computer Sciences and Engineering, 3(2), 94-102.

BibTex Style Citation:
@article{Patel_2015,
author = {Bharat C. Patel, Jagin M. Patel, Manish M. Kayasth},
title = {An Overview of Speech Recognition Techniques and the Progress in Speech Technology for Indian Scripts},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {2 2015},
volume = {3},
Issue = {2},
month = {2},
year = {2015},
issn = {2347-2693},
pages = {94-102},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=5591},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=5591
TI - An Overview of Speech Recognition Techniques and the Progress in Speech Technology for Indian Scripts
T2 - International Journal of Computer Sciences and Engineering
AU - Bharat C. Patel, Jagin M. Patel, Manish M. Kayasth
PY - 2015
DA - 2015/02/28
PB - IJCSE, Indore, INDIA
SP - 94-102
IS - 2
VL - 3
SN - 2347-2693
ER -

VIEWS PDF XML
2273 2329 downloads 2182 downloads
  
  
           

Abstract

Speech is widely regarded as the most natural form of communication between human beings. The field of speech recognition has facilitated the development of man-machine conversations, leading to the creation of automatic speech recognition systems. These systems enable interactions between humans and machines, and their applications are diverse and extensive. This paper aims to provide an overview of speech recognition techniques, explore the various application areas of speech recognition systems, and review the progress made in speech technology, particularly for Indian scripts. Additionally, the paper focuses on feature extraction techniques and classification methods used in speech recognition systems. The primary objective of this paper is to facilitate newcomers in understanding the flow of speech recognition and guide them in their pursuit of further research in this field.

Key-Words / Index Term

Human-Machine Interaction, Speech Recognition System, Feature Extraction Methods, Classification Techniques

References

[1]. Davis K. H., Biddulph R. and Balashek S., “Automatic recognition of spoken digits,” J. Acoust. Soc. Am., Volume 24, Issue 6, pp. 637-642, 1952.
[2] Huang X., and Acero A., “Spoken Language Processing: A Guide to Theory, Algorithm and System Development, Prentice Hall PTR, 2001.
[3] Wang Ye-Yi, Dong Yu, Yun-Cheng Ju and Acero A., “An introduction to voice search”, IEEE Signal Processing Magazine, 25(3), pp. 28–38,2008.
[4] Patel Bharat C and Desai Apurva A., “Recognition of spoken Gujarati Numerals Using Dynamic Time Warping” A.VNSGU Journal of Science and Technology, Volume 3, Issue 2, pp. 81-88, March 2012.
[5] Patel Himanshu N. and Virparia P.V., “A Small Vocabulary Speech Recognition for Gujarati”, International Journal of Advanced Research in Computer Science, Volume 2, Issue 1, pp. 208-210, Jan-Feb 2011.
[6] Patel Bharat C. and Desai Apurva, “Recognition of Spoken Gujarati Numeral and Its Conversion into Electronic Form”, International Journal of Engineering Research & Technology (IJERT), Volume 3, Issue 9, pp. 474-480, September- 2014.
[7] Pipalia D. S. and Dave Bhoomika, “An Approach to Increase Word Recognition Accuracy in Gujarati Language”, international journal of innovative research in computer and communication engineering (ijircce), vol. 3297, no. 9, pp. 6442–6450, 2007.
[8] Pandit Purnima and Bhatt Shardav, “Automatic Speech Recognition of Gujarati digits using Dynamic Time Warping”, International Journal of Engineering and Innovative Technology (IJETT), Volume 3, Issue 12, pp. 69-73, June 2014.
[9] Panwar Madhvi, Sharma R.P., Khan I. and Farooq O., “Design of Wavelet Based Features for Recognition of Hindi Digits”, 2011 International Conference on Multimedia, Signal Processing and Communication Technologies, pp. 232-235, 2011.
[10] Kumar Kuldeep, Aggarwal R.K. and Jain Ankita, “A Hindi speech recognition system for connected words using HTK”, Int. J. Computational Systems Engineering, Volume 1, Issue 1, pp. 25-32, 2012.
[11] Kumar Kuldeep and Aggarwal R. K., “Hindi speech recognition system using HTK”, International Journal of Computing and Business Research, Volume 2 Issue 2, May 2011.
[12] Verma Sharmila and Mishra A.N., “Analysis of Speech Recognition Techniques on the Hindi Speech Digits Database”, International Journal of Electronic Engineering Research, Volume 3, Issue 3, pp. 321-327, 2011.
[13] Saini Preeti, Kaur Parneet and Dua Mohit, “Hindi Automatic Speech Recognition Using HTK”, International Journal of Engineering Trends and Technology (IJETT), Volume 4, Issue 6, pp. 2223-2229,June 2013.
[14] Kayte Charansing Nathoosing, “Isolated Word Recognition for Marathi Language using VQ and HMM”, Science Research Reporter, Volume 2, Issue 2, pp. 161-165, April 2012.
[15] Patil P. P., and Pardeshi S. A., “Marathi connected word speech recognition system”, 2014 First International Conference on Networks & Soft Computing, pp. 314-318, 2014.
[16] Khetri Gajanan Pandurang, Padme Satish L., Jain Dinesh Chnadra, Fadewar H. S., Sontakke B. R. and Pawar Vrushsen P., “Automatic Speech Recognition for Marathi Isolated Words”, International Journal of Application or Innovation in Engineering & Management, Volume 1, Issue 3, pp. 69-74, November 2012.
[17] Nimbhore S. S., Ramteke G. D., and Ramteke R. J., “Pitch estimation of Marathi spoken numbers in various speech signals”, 2013 International Conference on Communication and Signal Processing, pp. 405-409, 2013.
[18] Gaikwad Santosh, Gawali Bharti and Mehrotra S.C., “Polly Clinic Inquiry System using IVR in Marathi Language”, International Journal of Machine Intelligence, Volume 3, Issue 3, 2011, pp-142-145.
[19] Katyal Anchal, Kaur Amanpreet and Gill Jasmeen, “Punjabi Speech Recognition of Isolated Words Using Compound EEMD and Neural Network”, International Journal of Soft Computing and Engineering (IJSCE), Volume-4, Issue-1, pp. 150-154, March 2014.
[20] Ghai Wiqas and Singh Navdeep, “Continuous Speech Recognition for Punjabi Language”, International Journal of Computer Applications, Volume 72, Issue 14, May 2013, pp. 23-28.
[21] Ravinder Kumar,” Comparison of HMM and DTW for Isolated Word Recognition System of Punjabi Language”, International Journal of Soft Computing, Volume 5, Issue 3, pp. 244-252, February 2010.
[22] Ghai Wiqas and Singh Navdeep, “Phone based acoustic modeling for automatic speech recognition for Punjabi language”, Journal of Speech Sciences, Volume 1, Issue 3, pp. 69-83, 2013.
[23] Dua Mohit, Aggarwal R. K. , Kadyan Virender and Dua Shelza, “Punjabi Automatic Speech Recognition Using HTK”, IJCSI International Journal of Computer Science Issues, Volume 9, Issue 4, No 1, pp. 359-364, July 2012.
[24] Ghanty Sumit Kumar, Shaikh Soharab Hossain and Chaki Nabendu, “On Recognition of Spoken Bengali Numerals”, 2010 International Conference on Computer Information Systems and Industrial Management Applications (CISIM), pp. 54-59, 2010.
[25] Mandal Sandipan, Das Biswajit and Mitra Pabitra, “Shruti-II: A vernacular speech recognition system in Bengali and an application for visually impaired community”, 2010 IEEE Students Technology Symposium (TechSym), pp. 229-233, 2010.
[26] Das B., Mandal S. and Mitra P., “Bengali speech corpus for continuous automatic speech recognition system”, 2011 International Conference on Speech Database and Assessments (Oriental COCOSDA), pp. 51-55, 2011.
[27] Hossain M. A., Rahman M. M., Prodhan U. K. and Khan M. F., “Implementation of back-propagation neural network for isolated Bengali speech recognition”, International Journal of Information Sciences and Techniques (IJIST) Volume 3, Issue 4, July 2013.
[28] Kurian Cini and Balakrishnan Kannan, “Speech Recognition of Malayalam Numbers”, 2009 World Congress on Nature and Biologically Inspired Computing (NaBIC 2009), pp. 1475-1479, 2009.
[29] Mohamed Anuj and Nair K.N. Ramachandran, “Continuous Malayalam Speech Recognition Using Hidden Markov Models”, A2CWiC `10: Proceedings of the 1st Amrita ACM-W Celebration on Women in Computing in India, September 2010, Article No.: 59, pp. 1–4.
[30] Anand A. V., Shobana Devi P., Stephen J. and Bhadran V. K., “Malayalam Speech Recognition system and its application for visually impaired people”, 2012 Annual IEEE India Conference (INDICON), pp. 619-624, 2012.
[31] Kurian C. and Balakrishnan K., “Connected digit speech recognition system for Malayalam language”, Sadhana, Volume 38, Issue 6, pp. 1339–1346, 2013.
[32] Sreejith C and Reghuraj P C, “Isolated Spoken Word Identification in Malayalam using Mel-frequency Cepstral Coefficients and K-means clustering”, International Journal of Science and Research (IJSR), Volume 1, Issue 3, pp. 163-167, December 2012.
[33] Hemakumar G. and Punitha P., “Speaker Independent Isolated Kannada Word Recognizer”, Multimedia Processing, Communication and Computing Applications, pp. 333–345, 2013.
[34] Kannadaguli Prashanth and Bhat Vidya, “Phoneme Modeling for Speech Recognition in Kannada using Multivariate Bayesian Classifier”, SSRG International Journal of Electronics and Communication Engineering (SSRG-IJECE), volume 1, issue 9, pp. 1-4, Nov 2014.
[35] Hegde S., Achary K.K. and Shetty S., “Isolated Word Recognition for Kannada Language Using Support Vector Machine”, In: Venugopal, K.R., Patnaik, L.M. (eds) Wireless Networks and Computational Intelligence. ICIP 2012. Communications in Computer and Information Science, vol 292. Springer, Berlin, Heidelberg, pp 262–269, 2012.
[36] Muralikrishna H., Ananthakrishna T., and Shama K., “HMM based isolated Kannada digit recognition system using MFCC”, 2013 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 730-733, 2013.
[37] Vimala.C and Radha.V, “Suitable Feature Extraction and Speech Recognition Technique for Isolated Tamil Spoken Words”, International Journal of Computer Science and Information Technologies, Volume 5, Issue 1, pp. 378-383, 2014.
[38] Karpagavalli S., K. Usha Rani, R. Deepika and P. Kokila, “ Isolated Tamil Digits Speech Recognition using Vector Quantization”, International Journal of Engineering Research & Technology (IJERT) Volume 1, Issue 4, pp. 1-12, June – 2012.
[39] Vimla C. M. and Radha V., “Speaker Independent Isolated Speech Recognition System for Tamil Language using HMM”, Procedia Engineering, 30, pp. 1097–1102, 2012.
[40] Kumar C. S., and Foo Say Wei, “A bilingual speech recognition system for English and Tamil”, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint, pp. 1641-1644, 2003.
[41] Thangarajan R., Natarajan A. M. and Selvam M., “Syllable modeling in continuous speech recognition for Tamil language”, International Journal of Speech Technology, Volume 12, Issue 1, pp. 47–57, 2009.