edu.umd.cloud9.collection.medline
Class MedlineCitationInputFormat

java.lang.Object
  extended by org.apache.hadoop.mapred.FileInputFormat<K,V>
      extended by edu.umd.cloud9.collection.IndexableFileInputFormat<LongWritable,MedlineCitation>
          extended by edu.umd.cloud9.collection.medline.MedlineCitationInputFormat
All Implemented Interfaces:
InputFormat<LongWritable,MedlineCitation>, JobConfigurable

public class MedlineCitationInputFormat
extends IndexableFileInputFormat<LongWritable,MedlineCitation>
implements JobConfigurable

Hadoop InputFormat for processing the MEDLINE citations in XML format.

Author:
Jimmy Lin

Nested Class Summary
static class MedlineCitationInputFormat.MedlineCitationRecordReader
          Hadoop RecordReader for reading MEDLINE citations in XML format.
 
Field Summary
 
Fields inherited from class org.apache.hadoop.mapred.FileInputFormat
LOG
 
Constructor Summary
MedlineCitationInputFormat()
          Creates a MedlineCitationInputFormat.
 
Method Summary
 void configure(JobConf conf)
           
 RecordReader<LongWritable,MedlineCitation> getRecordReader(InputSplit inputSplit, JobConf conf, Reporter reporter)
          Returns a RecordReader for this InputFormat.
 
Methods inherited from class org.apache.hadoop.mapred.FileInputFormat
addInputPath, addInputPaths, getInputPathFilter, getInputPaths, getSplits, setInputPathFilter, setInputPaths, setInputPaths
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

MedlineCitationInputFormat

public MedlineCitationInputFormat()
Creates a MedlineCitationInputFormat.

Method Detail

configure

public void configure(JobConf conf)
Specified by:
configure in interface JobConfigurable

getRecordReader

public RecordReader<LongWritable,MedlineCitation> getRecordReader(InputSplit inputSplit,
                                                                  JobConf conf,
                                                                  Reporter reporter)
                                                           throws IOException
Returns a RecordReader for this InputFormat.

Specified by:
getRecordReader in interface InputFormat<LongWritable,MedlineCitation>
Specified by:
getRecordReader in class FileInputFormat<LongWritable,MedlineCitation>
Throws:
IOException