Skip to content

Commit

Permalink
Update docs/modules/demos/pages/hbase-hdfs-load-cycling-data.adoc
Browse files Browse the repository at this point in the history
Co-authored-by: Razvan-Daniel Mihai <84674+razvan@users.noreply.github.com>
  • Loading branch information
fhennig and razvan authored Sep 16, 2024
1 parent 7baba0d commit 887bad1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/modules/demos/pages/hbase-hdfs-load-cycling-data.adoc
Original file line number Diff line number Diff line change
Expand Up @@ -84,7 +84,7 @@ This demo will run two jobs to automatically load data.

=== distcp-cycling-data

{distcp}[DistCp] (distributed copy) is used for large inter/intra-cluster copying.
{distcp}[DistCp] (distributed copy) efficiently transfers large amounts of data from one location to another.
It uses MapReduce to effect its distribution, error handling, recovery, and reporting.
It expands a list of files and directories into input to map tasks, each of which will copy a partition of the files specified in the source list.
Therefore, the first Job uses DistCp to copy data from a S3 bucket into HDFS.
Expand Down

0 comments on commit 887bad1

Please sign in to comment.