forked from jseidman/hadoop-R
-
Notifications
You must be signed in to change notification settings - Fork 5
/
README
21 lines (11 loc) · 1.07 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
Examples of integrating Hadoop and R. This directory contains the following:
airline/
Examples which use the flight arrival and departure data available here:
http://stat-computing.org/dataexpo/2009/the-data.html
Note that this is the same data set used for many of the examples in the RHIPE documentation.
The following examples are in this directory:
airline/src/deptdelay_by_month/R/streaming/ - Example that uses the Hadoop streaming MapReduce interface to calculate average departure delay by month for each airline.
airline/src/deptdelay_by_month/R/hive - Example using Hadoop Interactive for running MapReduce code to calculate average departure delay by month for each airline.
airline/src/deptdelay_by_month/R/rhipe - Example using RHIPE to run MapReduce code that calculates average departure delay by month for each airline and then visualize the results.
airline/src/deptdelay_by_month/R/rmr - Example using Revolution Analytics rmr package to calculate average departure delay by month for each airline.
Instructions for running the code can be found with each example.