diff --git a/docs/modules/demos/pages/hbase-hdfs-load-cycling-data.adoc b/docs/modules/demos/pages/hbase-hdfs-load-cycling-data.adoc index 2c0437a..da9ed88 100644 --- a/docs/modules/demos/pages/hbase-hdfs-load-cycling-data.adoc +++ b/docs/modules/demos/pages/hbase-hdfs-load-cycling-data.adoc @@ -84,7 +84,7 @@ This demo will run two jobs to automatically load data. === distcp-cycling-data -{distcp}[DistCp] (distributed copy) is used for large inter/intra-cluster copying. +{distcp}[DistCp] (distributed copy) efficiently transfers large amounts of data from one location to another. It uses MapReduce to effect its distribution, error handling, recovery, and reporting. It expands a list of files and directories into input to map tasks, each of which will copy a partition of the files specified in the source list. Therefore, the first Job uses DistCp to copy data from a S3 bucket into HDFS.