command-line utility to identify duplicate files in a directory tree
go get github.com/zackse/find_dupes
This program crawls a directory tree and finds duplicate files. First, files with identical sizes are grouped together, then within each group, files with duplicate md5sums are emitted to standard output.
find_dupes <PATH> [ <NUM_WORKERS> ]NUM_WORKERS defaults to 2. Note that this limit only affects the number of
goroutines collecting file sizes (calling os.Stat()) in parallel. For each
file size with more than one match, the program will launch a goroutine for
every duplicate entry to generate its md5sum.
find_dupes ~/Pictures 4