|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.hadoop.mapred.FileInputFormat<K,V>
edu.umd.cloud9.collection.IndexableFileInputFormat<LongWritable,TrecWebDocument>
edu.umd.cloud9.collection.trecweb.TrecWebDocumentInputFormat
public class TrecWebDocumentInputFormat
Hadoop InputFormat for processing the TREC collection.
| Nested Class Summary | |
|---|---|
static class |
TrecWebDocumentInputFormat.TrecWebRecordReader
Hadoop RecordReader for reading TREC-formatted documents. |
| Field Summary |
|---|
| Fields inherited from class org.apache.hadoop.mapred.FileInputFormat |
|---|
LOG |
| Constructor Summary | |
|---|---|
TrecWebDocumentInputFormat()
|
|
| Method Summary | |
|---|---|
RecordReader<LongWritable,TrecWebDocument> |
getRecordReader(InputSplit inputSplit,
JobConf conf,
Reporter reporter)
Returns a RecordReader for this InputFormat. |
| Methods inherited from class org.apache.hadoop.mapred.FileInputFormat |
|---|
addInputPath, addInputPaths, getInputPathFilter, getInputPaths, getSplits, setInputPathFilter, setInputPaths, setInputPaths |
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public TrecWebDocumentInputFormat()
| Method Detail |
|---|
public RecordReader<LongWritable,TrecWebDocument> getRecordReader(InputSplit inputSplit,
JobConf conf,
Reporter reporter)
throws IOException
RecordReader for this InputFormat.
getRecordReader in interface InputFormat<LongWritable,TrecWebDocument>getRecordReader in class FileInputFormat<LongWritable,TrecWebDocument>IOException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||