If you're ever confused about our runner, run the command below.
$ hadoop jar dga-giraph-0.0.1.jar com.soteradefense.dga.DGARunner -h
The command will output something like this.
usage: hadoop jar dga-giraph-0.0.1.jar com.soteradefense.dga.DGARunner <analytic> <input-path> <output-path> [options]
louvain - Louvain Modularity
hbse - High Betweenness Set Extraction
wcc - Weakly Connected Components
lc - Leaf Compression
pr - Page Rank
-ca <arg> Any custom arguments to pass in to giraph
-D<arg> System parameters to pass through to be added to the conf
-h Prints this help documentation and exits
-q Run analytic in quiet mode
-w <arg> The number of giraph workers to use for the analytic
$ ./bin/dga-giraph wcc /path/to/input/ /path/to/output/ -ca io.edge.reverse.duplicator=true
$ ./bin/dga-giraph wcc /path/to/input/ /path/to/output/ -ca pivot.batch.size=0.25
See Configurations for a property list.
$ ./bin/dga-giraph wcc /path/to/input/ /path/to/output/ -Dgiraph.zkList=localhost:2121
$ ./bin/dga-giraph wcc /path/to/input/ /path/to/output/ -Dpivot.batch.size=0.25