Open Access   Article Go Back

Analysis of Retail Data using Apache Spark

Himani Agnihotri1 , Bharti Nagpal2

Section:Research Paper, Product Type: Journal Paper
Volume-7 , Issue-5 , Page no. 1162-1165, May-2019

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v7i5.11621165

Online published on May 31, 2019

Copyright © Himani Agnihotri, Bharti Nagpal . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Himani Agnihotri, Bharti Nagpal, “Analysis of Retail Data using Apache Spark,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.5, pp.1162-1165, 2019.

MLA Style Citation: Himani Agnihotri, Bharti Nagpal "Analysis of Retail Data using Apache Spark." International Journal of Computer Sciences and Engineering 7.5 (2019): 1162-1165.

APA Style Citation: Himani Agnihotri, Bharti Nagpal, (2019). Analysis of Retail Data using Apache Spark. International Journal of Computer Sciences and Engineering, 7(5), 1162-1165.

BibTex Style Citation:
@article{Agnihotri_2019,
author = {Himani Agnihotri, Bharti Nagpal},
title = {Analysis of Retail Data using Apache Spark},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {5 2019},
volume = {7},
Issue = {5},
month = {5},
year = {2019},
issn = {2347-2693},
pages = {1162-1165},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4379},
doi = {https://doi.org/10.26438/ijcse/v7i5.11621165}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i5.11621165}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4379
TI - Analysis of Retail Data using Apache Spark
T2 - International Journal of Computer Sciences and Engineering
AU - Himani Agnihotri, Bharti Nagpal
PY - 2019
DA - 2019/05/31
PB - IJCSE, Indore, INDIA
SP - 1162-1165
IS - 5
VL - 7
SN - 2347-2693
ER -

VIEWS PDF XML
394 184 downloads 98 downloads
  
  
           

Abstract

The use of social media sites and Internet is increasing at an alarming rate. Therefore data is generated in huge amounts fraction of a second. This huge amount of data which is characterized by volume, velocity and variety is termed as big data. There is need for a framework that can process this huge amount of data and also analyze it efficiently. Apache Spark is an open source cluster computing platform which can process and analyze data efficiently. In this paper an overview and a simple example of analysis of retail data using Apache Spark is given to demonstrate its functionality.

Key-Words / Index Term

Big Data, Hadoop, MapReduce, Yarn, Spark, RDD

References

[1]. Mantripatjit Kaur, Gurleen Kaur Dhaliwal, “Performance Comparison of Map Reduce and Apache Spark on Hadoop for Big Data Analysis”, International Journal of Computer Sciences and Engineering, Vol.3, Issue.11, pp.66-69, 2015.
[2]. Lei Gu, Huan Li, “Memory or Time: Performance Evaluation for Iterative Operation on Hadoop and Spark”, In IEEE 10th International Conference on High Performance Computing and Communications and IEEE International Conference on Embedded and Ubiquitous Computing (IEEE 2013), pp.721-727 , 2013.
[3]. Abdul Ghaffar Shoro, Tariq Rahim Soomro, “Big Data Analysis: Ap Spark Perspective”, Global Journal of Computer Science and Technology, Vol.15, Issue.1, pp.7-14, 2015.
[4]. Abhishek Bhattacharya, Shefali Bhatnagar, “Big Data and Apache Spark: A Review”, International Journal of Engineering Research & Science (IJOER), Vol.2, Issue.5, pp.206-210, 2016.
[5]. V Srinivas Jonnalagadda, P Srikanth, Krishnamachari Thumati, Sri Hari Nallamala, ”A Review Study of Apache Spark in Big Data Processing”, International Journal of Computer Science Trends and Technology (IJCST), Vol. 4, Issue.3, pp.93-98, 2016.
[6]. Priya Dahiya, Chaitra.B, Usha Kumari, “Survey on Big Data using Apache Hadoop and Spark”, International Journal of Computer Engineering In Research Trends, Vol. 4, Issue.6, pp.195-201, 2017.
[7]. Amit Palve1, Rohini D. Sonawane, Amol D. Potgantwar, “Sentiment Analysis of Twitter Streaming Data for Recommendation using Apache Spark”, International Journal of Scientific Research in Network Security and Communication, Vol.5, Issue.3, pp.99-103, 2017.
[8]. Vivek Francis Pinto, Sampath Kini, Igneta Mcluren Dsouza, “A Review Document on Apache Spark for Big Data Analytics with Case Studies”, International Journal of Computer Science Trends and Technology (IJCST),Vol.5, Issue.5, pp.99-103, 2017
[9]. Kalyani K. Pathrikar, Prof. Arundhati A. Dudhgaonkar, ”Review on apache spark technology”, International Research Journal of Engineering and Technology (IRJET), Vol.4, Issue.10, pp.1386-1388, 2017.
[10]. Smita M. Deshpande, R. S. Shirsath, “Ranking of Product on Big Data using Apache Spark”, Sixth Post Graduate Conference for Computer Engineering (cPGCON 2017) Procedia International Journal on Emerging Trends in Technology (IJETT), 2017.
[11]. S.N. Patil, S.M. Deshpande , Amol D. Potgantwar, “Product Recommendation using Multiple Filtering Mechanisms on Apache Spark”, International Journal of Scientific Research in Network Security and Communication, Vol.5, Issue.3, pp.76-83, 2017.