Skip to content

The Benefits of Faunus

Dan LaRocque edited this page Sep 5, 2014 · 26 revisions
This is the documentation for Faunus 0.4.
Faunus was merged into Titan and renamed Titan-Hadoop in version 0.5.
Documentation for the latest Titan version is available at http://s3.thinkaurelius.com/docs/titan/current.

Faunus is a Hadoop centric graph analytics engine. It is used to compute graph derivations, graph statistics, and graph data mappings (input/output mappings) on massive-scale graphs represented across a multi-machine cluster. Beyond this foundation, other benefits are itemized below.

  • Process “infinite sized” graphs using the functional, parallel MapReduce computing model.
  • Express complex global graph traversals or graph statistics as simple Gremlin queries (via REPL or CLI). No knowledge of MapReduce or Java code required as Gremlin expressions are automatically compiled to MapReduce steps.
  • Leverage vertex-centric distributed graph computing against a live graph database.
  • Numerous input/output formats supported which enables Faunus to be used as a graph data mapping tool.
  • Process graphs in the distributed graph database Titan. Take advantage of Cassandra’s and HBase’s native Hadoop connectivity.
  • Process graphs exposed by TinkerPop’s Rexster graph server. Simply pull graph data out of any Blueprints enabled graph database.
  • Process graphs represented in GraphSON. Simply store the text-representation in HDFS.
  • Process graphs represented in various RDF formats. Simply store the text-representation in HDFS.
  • Output graphs as an edge list to easily interoperate with other graph analysis packages (e.g. R iGRaph).
  • Custom input and output format scripts can be provided for general purpose data import/export.