edu.umd.cloud9.collection.wikipedia
Class BuildWikipediaForwardIndex
java.lang.Object
org.apache.hadoop.conf.Configured
edu.umd.cloud9.collection.wikipedia.BuildWikipediaForwardIndex
- All Implemented Interfaces:
- Configurable, Tool
public class BuildWikipediaForwardIndex
- extends Configured
- implements Tool
Tool for building a document forward index for Wikipedia. Sample invocation:
hadoop jar cloud9.jar edu.umd.cloud9.collection.wikipedia.BuildWikipediaForwardIndex \
-libjars bliki-core-3.0.15.jar,commons-lang-2.5.jar \
/user/jimmy/Wikipedia/compressed.block/en-20101011 tmp \
/user/jimmy/Wikipedia/compressed.block/findex-en-20101011.dat
- Author:
- Jimmy Lin
BuildWikipediaForwardIndex
public BuildWikipediaForwardIndex()
run
public int run(String[] args)
throws Exception
- Specified by:
run in interface Tool
- Throws:
Exception
main
public static void main(String[] args)
throws Exception
- Throws:
Exception