Hadoop Distributed file system, Hive and Its Applications: A Survey

Main Article Content

Mr. Prashant R. Mahajan, Prof. Amrit Priyadarshi

Abstract

Business intelligence is growing area across the industry and data getting collected and analyzed in rapid way due to which legacy warehousing tools has become very costly. Hadoop is framework which is open source and stores data and runs applications on cluster of normal i.e commodity hardware. Hadoop provides large amount of processing power and storage for various kinds of data. It is able to handle concurrent tasks or jobs. HDFS (Hadoop Distributed File System) is a distributed file system which can provide high performance data access across Hadoop cluster of servers. Due to Managing pools of big data and supporting big data analytics application HDFS has become a strong tool. Developer has to write custom programs in map reduce programming model which are difficult to maintain and reuse. Hive is open source solution built on top of hadoop which is used as data ware house. Hive supports HiveQL which is SQL-like language, which are compiled into mapreduce jobs to be executed on Hadoop.

Article Details

How to Cite
, M. P. R. M. P. A. . P. (2015). Hadoop Distributed file system, Hive and Its Applications: A Survey. International Journal on Recent and Innovation Trends in Computing and Communication, 3(11), 6262–6265. https://doi.org/10.17762/ijritcc.v3i11.5031
Section
Articles