Open Access   Article Go Back

A REVIEW OF OPTIMIZATION METHODS IN DEEP LEARNING

Amita Khatana1 , V.K Narang2 , Vikas Thada3

  1. Department of Computer Science, Amity University, Haryana, India.
  2. Department of Computer Science, Amity University, Haryana, India.
  3. Department of Computer Science, Amity University, Haryana, India.

Section:Review Paper, Product Type: Journal Paper
Volume-6 , Issue-4 , Page no. 440-447, Apr-2018

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v6i4.440447

Online published on Apr 30, 2018

Copyright © Amita Khatana, V.K Narang, Vikas Thada . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Amita Khatana, V.K Narang, Vikas Thada , “A REVIEW OF OPTIMIZATION METHODS IN DEEP LEARNING,” International Journal of Computer Sciences and Engineering, Vol.6, Issue.4, pp.440-447, 2018.

MLA Style Citation: Amita Khatana, V.K Narang, Vikas Thada "A REVIEW OF OPTIMIZATION METHODS IN DEEP LEARNING." International Journal of Computer Sciences and Engineering 6.4 (2018): 440-447.

APA Style Citation: Amita Khatana, V.K Narang, Vikas Thada , (2018). A REVIEW OF OPTIMIZATION METHODS IN DEEP LEARNING. International Journal of Computer Sciences and Engineering, 6(4), 440-447.

BibTex Style Citation:
@article{Khatana_2018,
author = {Amita Khatana, V.K Narang, Vikas Thada },
title = {A REVIEW OF OPTIMIZATION METHODS IN DEEP LEARNING},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {4 2018},
volume = {6},
Issue = {4},
month = {4},
year = {2018},
issn = {2347-2693},
pages = {440-447},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=1915},
doi = {https://doi.org/10.26438/ijcse/v6i4.440447}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v6i4.440447}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=1915
TI - A REVIEW OF OPTIMIZATION METHODS IN DEEP LEARNING
T2 - International Journal of Computer Sciences and Engineering
AU - Amita Khatana, V.K Narang, Vikas Thada
PY - 2018
DA - 2018/04/30
PB - IJCSE, Indore, INDIA
SP - 440-447
IS - 4
VL - 6
SN - 2347-2693
ER -

VIEWS PDF XML
899 512 downloads 311 downloads
  
  
           

Abstract

Deep learning technique is an emerging field of machine learning. In recent years, it has been successfully used in different fields, such as image classification, natural language processing, computer vision, speech reorganization, etc. When compared to the machine learning, deep learning has a high learning ability to extract features of large datasets. Deep learning came into existence in 1971 when Ivakhnenka used group method of data handling algorithm (GMDH) to train 8-layered neural network [1]. This paper focuses on the artificial neural network, learning techniques and optimization methods of deep learning like stochastic gradient descent, batch gradient descent, mini-batch gradient descent and ADAM.

Key-Words / Index Term

Artificial Neural Network, Deep Learning CNN, RNN, Optimization Methods, Gradient Descent, ADAM, Framework,mageClassification.

References

[1] A.G. Ivakhnenko, “Polynomial theory of complex systems” IEEE Transaction on System, Man and Cybernetic vol. 1,no 4, pp. 364-378, 1971.
[2] Xuedam Du,Yinghao Cai, Wang, and Leijie Zhang “ Overview of Deep Learning” 31st Youth Academic Annual Conference of Chinese Association of Automation Wuhan China November 11-13-2016.
[3] Siddhartha Sankar Nath, Janynyaseni Kar, Girish Mishra, Sayan Chakraborty , Nilanjan Dey “ A Survey of Image Classification Methods and Techniques ” ICCCICCT 2014.
[4] A. Krizhevsky, I. Sutskever, and G. E. Hinton. “ImageNet Classification with Deep Convolutional Neural Networks”. Neural Information Processing Systems, Nevada, 2012
[5] Henrik Petersson, David Gustafsson and David Bergstroom “ Hyperspectral Image Analysis using Deep Learning- a Review” IEEE 2016.
[6] Adrian Carrio, Carlos Sampedro, Alejandro Rodriguez Ramos and Pascual Campoy “A Review of Deep Learning Methods and Applications for unmanned Aerial Vehicles ” Hindawi Journal of Sensors 2017.
[7] H. Kamitomo and C. Lu, “3-d face recognition method based on optimum 3-d image measurement technology,” Artificial Life and Robotics, vol. 16, no. 4, pp. 551–554, 2012.
[8] Walaa Hussein Ibrrahim, Ahmed AbdelRhman Ahmed Osman, Yusra Ibrahim Mohamad” MRI Image Classification Using Neural Network” ICCEEE, 2013.
[9] S.Kim, B.Park, B.S Song, and S.Yang, “ Deep belief network based statistical feature learning for fingerprint liveness detection, ” Pattern Recog. Lett., vol 77, ,pp. 58-65,2016.
[10] Gang Liu, Liang Xiao, Caiquan Xiong “ Image Classification with deep belief network and improved gradient descent” IEEE 2017.
[11] Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, “Gradientbased learning applied to document recognition,” Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324, 1998.
[12] Travis Williams, Robert Li “ Advanced Image Classification using Wavelets and Convolution Neural Network” IEEE 2016.
[13] Narek Abroyan , “ Convolutional and Recurrent Neural Network for real time data classification” The Seventh International Conference on Innovative Computing Technology (INTECH 2017).
[14] Ian Goodfellow, Yoshua Bengio, and Aaron Courville, “Deep Learning”, Book in preparation for MIT Press, 2016, on-line version available at http://www.deeplearningbook.org.
[15] Michael A.Nielsen, “Neural Networks and Deep Learning”, Determination Press, 2015.
[16] Marek Dabrowski, Justyna Gromada, Tomasz Michalik Orange Centrum “A Practical study of neural network –based image classification model trained with transfer learning method” FedCSIS 2016.
[17] C. Lu and X. Tang, “Surpassing human-level face verification performance on lfw with gaussianface,” arXiv preprint arXiv:1404.3840, 2014.
[18] L. Deng and D. Yu, “Deep learning: methods and applications,” Foundations and Trends in Signal Processing, vol. 7, no. 3-4, pp. 197–387, 2013.
[19] J. A. Hertz,” Introduction to the theory of neural computation.“ Boulder, USA: Westview Press, 1991.
[20] J. Suykens and J. Vandewalle, “Least squares support vector machine classifiers,” Neural Processing Letters, vol. 9, no. 3, pp. 293–300, 1999.
[21] Y. Xiong and R.Zuo, “ Recognization of geochemical anomalies using a deep autoencoder network,”Computer Geosci-UK, vol.86, pp. 75-82, 2016.
[22] Zejian Shi, Minyong Shi, Chunfang Li,” The prediction of character based on recurrent neural network,” IEEE computer society, Wuhan China, 2017
[23] Taro Ishitakl, Ryolchlro Obukata, Tetsuya Oda, Leonard Baroll, “Application of deep recurrent neural network for prediction of user behavoiur in Tor Network,” 31st International Conference on Advanced Information Networking and Application Workshops, 2017.
[24]Marek Daabrowski, J. Gromada, T. Michalik,” P practical study of neural network-based image classification model trained with transfer learning method,” Federated Conference on Computer Science and Information Systems pp. 49-56, 2016.
[25] B. Wang, K. Yager, D.Yu, Minh Hoai,” X- ray Scattering image classification using deep learning,” IEEE Winter Conference on Application of Computer Science,2017
[26] Nur Anis Mohmon and Norsuzila ya acob,” A review on classification of satellite image using artificial neural network (ANN),” IEEE 5th Control and System Graduate Research Colloquium,2014.
[27] R.Jyothi, Y.K. SundaraKrishna, V. Srinivasa Rao,” Paper Currency recognition for color images based on Artificial Neural Network,” International Conference on Electrical , Electronics and Optimization Techniques ( ICEEOT), 2016.
[28]M.Abadi, A. Agarwal, p. Barham, E. Brevdo, Z.f. Chen, C. C itro,et at.,” Tensorflow : Large-scale machine learning on heterogeneous distributed systems,” arXiv preprint arXiv: 1603.04467,2016.
[29]R. Collobert, S.Bengio, and J.Mariethoz, “ Torch: a modular machine learning software library,” Idiap, No.EPFL-REPORT-82802, 2002.
[30]R. AI-Rfou, G.Alain, A.Almahairi el at.,” Theano ; a python framework for fast computation of mathematics expression,” arXiv preprint arXiv : 1605.02688,2016.
[31]http://www.wpclipart.com/medical/anatomy/cells/neuron/neuron.png.html