edu.umd.cloud9.collection.wikipedia
Class BuildWikipediaLinkGraph
java.lang.Object
org.apache.hadoop.conf.Configured
edu.umd.cloud9.collection.wikipedia.BuildWikipediaLinkGraph
- All Implemented Interfaces:
- Configurable, Tool
public class BuildWikipediaLinkGraph
- extends Configured
- implements Tool
Tool for extracting the link graph out of Wikipedia. Sample invocation:
hadoop jar cloud9.jar edu.umd.cloud9.collection.wikipedia.BuildWikipediaLinkGraph \
-libjars bliki-core-3.0.15.jar,commons-lang-2.5.jar \
/user/jimmy/Wikipedia/compressed.block/en-20101011 \
/user/jimmy/Wikipedia/edges /user/jimmy/Wikipedia/adjacency 10
- Author:
- Jimmy Lin
BuildWikipediaLinkGraph
public BuildWikipediaLinkGraph()
run
public int run(String[] args)
throws Exception
- Specified by:
run in interface Tool
- Throws:
Exception
main
public static void main(String[] args)
throws Exception
- Throws:
Exception