|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.hadoop.conf.Configured
edu.umd.cloud9.example.cooccur.ComputeCooccurrenceMatrixPairs
public class ComputeCooccurrenceMatrixPairs
Implementation of the "pairs" algorithm for computing co-occurrence matrices from a large text collection. This algorithm is described in Chapter 3 of "Data-Intensive Text Processing with MapReduce" by Lin & Dyer, as well as the following paper:
Jimmy Lin. Scalable Language Processing Algorithms for the Masses: A Case Study in Computing Word Co-occurrence Matrices with MapReduce. Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing (EMNLP 2008), pages 419-428.
This program takes the following command-line arguments:
| Constructor Summary | |
|---|---|
ComputeCooccurrenceMatrixPairs()
Creates an instance of this tool. |
|
| Method Summary | |
|---|---|
static void |
main(String[] args)
Dispatches command-line arguments to the tool via the ToolRunner. |
int |
run(String[] args)
Runs this tool. |
| Methods inherited from class org.apache.hadoop.conf.Configured |
|---|
getConf, setConf |
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Methods inherited from interface org.apache.hadoop.conf.Configurable |
|---|
getConf, setConf |
| Constructor Detail |
|---|
public ComputeCooccurrenceMatrixPairs()
| Method Detail |
|---|
public int run(String[] args)
throws Exception
run in interface ToolException
public static void main(String[] args)
throws Exception
ToolRunner.
Exception
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||