CFP last date
17 June 2024
Reseach Article

Solving Big Data Problem using Hadoop File System(HDFS)

Published on September 2015 by Smita Chaturvedi, Nivedita Bhirud, Fiona Lowden
International Conference and Workshop on Communication, Computing and Virtualization
Foundation of Computer Science USA
ICWCCV2015 - Number 3
September 2015
Authors: Smita Chaturvedi, Nivedita Bhirud, Fiona Lowden
ffeb689d-a56d-4093-aa39-1140f666bd0d

Smita Chaturvedi, Nivedita Bhirud, Fiona Lowden . Solving Big Data Problem using Hadoop File System(HDFS). International Conference and Workshop on Communication, Computing and Virtualization. ICWCCV2015, 3 (September 2015), 0-0.

@article{
author = { Smita Chaturvedi, Nivedita Bhirud, Fiona Lowden },
title = { Solving Big Data Problem using Hadoop File System(HDFS) },
journal = { International Conference and Workshop on Communication, Computing and Virtualization },
issue_date = { September 2015 },
volume = { ICWCCV2015 },
number = { 3 },
month = { September },
year = { 2015 },
issn = 2249-0868,
pages = { 0-0 },
numpages = 1,
url = { /proceedings/icwccv2015/number3/804-1576/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference and Workshop on Communication, Computing and Virtualization
%A Smita Chaturvedi
%A Nivedita Bhirud
%A Fiona Lowden
%T Solving Big Data Problem using Hadoop File System(HDFS)
%J International Conference and Workshop on Communication, Computing and Virtualization
%@ 2249-0868
%V ICWCCV2015
%N 3
%P 0-0
%D 2015
%I International Journal of Applied Information Systems
Abstract

The data which is useful not only for one person but for all, that data is called as Big data or It's a data to be too big to be processed in a single machine is known as Big data. Big data are the data which are extremely large in size that may be analyses computationally to disclose the patterns, associations and trends etc. For Example: Many users visited the amazon site; in particular page for how many user visit that page, from which IP address they visit the page, for how many hours they visit the page etc information stored in the amazon site is known as the example of Big data. Huge amount of data is created by phone data, online stores and by research data. Potentially data is created fast, the data coming from different sources in various formats and not most data are worthless but some data does has low value. Hadoop solves the Big data problem using the concept HDFS (Hadoop Distributed File System). In this paper the running of map reduce code in apache Hadoop is shown. Hadoop solves the problem of Big data by storing the data in distributed form in different machines. There are plenty of data but that data have to be store in a cost effective way and process it efficiently.

References
  1. The diverse and exploding digital universe. http://www. emc. com/digital universe, 2009.
  2. Hadoop. http://hadoop. apache. org, 2009.
  3. HDFS (hadoop distributed file system) architecture. http://hadoop. apache. org/common/docs/current/hdfs design. html, 2009.
  4. R. Abbott and H. Garcia-Molina. Scheduling I/O requestswith dead-lines: A performance evaluation. InProceedings of the 11th Real-TimeSystems Symposium, pages 113–124, Dec 1990.
  5. G. Candea, N. Polyzotis, and R. Vingralek. A scalable, predictable joinoperator for highly concurrent data warehouses. In35th InternationalConference on Very Large Data Bases (VLDB), 2009.
  6. The Hadoop Distributed File System : Balancing Portability, A. Hemanth, Dr. R. V. Krishniah (International Journal of Computer Engineering & Applications, Vol. III, Issue III, ISSN 2321-3469)
  7. The Data Recovery File System Systems for Hadoop Cluster- Review Paper, V. S. Karwande, Dr. S. S. Lomte, Prof. R. A. Auti (ISSN:0975-9646)
  8. The book titled with " Hadoop : The Definitive Guide" By Tom White
  9. The book titled with "Hadoop Operations" by Eric Sammer
Index Terms

Computer Science
Information Sciences

Keywords

Big data mapreduce 3V Eco System HDFS Hadoop.