NOAA’s National Climatic Data Center (NCDC) Weather Data Analysis on Apache Hadoop Yarn Single Cluster Environment

Vol-3 | Issue-05 | May-2016 | Published Online: 05 May 2016    PDF ( 714 KB )
Author(s)
Dr. Dhaval S. Vyas 1; Mr. Bhavin J. Mathiya 2; Dr. Vinodkumar L. Desai 3

1Dean- M.Phil Programme C.U. Shah University Wadhwan City, Gujarat, (India)

2Research Scholar, C.U. Shah University Wadhwan City, Gujarat, (India)

3Department of Computer Science Government Science College Chikhli, Navsari, Gujarat, (India)

Abstract

Apache Hadoop performance analysis and tuning done through parameter configuration using Real time application (NOAA’s National Climatic Data Center (NCDC) Weather Data Analysis. Apache Hadoop currently use in various kind of real word application like weather data analysis, Social Networking sites analysis, Medical Data analysis, Sensor Data analysis. In this NOAA’s National Climatic Data Center (NCDC) Weather Data Analysis is carried out using different Apache Hadoop customize parameter configuration for performance tuning to find out Hot and Cold days based on temperature recorded. NOAA’s National Climatic Data Center (NCDC) is responsible for storing, observing, retrieving and provide public access to weather data. User can download this weather data using FTP functionality.

Keywords
Apache Hadoop Yarn, HDFS, MapReduce, TeraGen, TeraSort, Tera Validate, TestDFSIO(Read), TestDFSIO(Write) WordCount.
Statistics
Article View: 438