| Interface | Description |
|---|---|
| SequenceFileInputFilter.Filter |
filter interface
|
| Class | Description |
|---|---|
| CombineFileInputFormat<K,V> |
An abstract
InputFormat that returns CombineFileSplit's in
InputFormat.getSplits(JobContext) method. |
| CombineFileRecordReader<K,V> |
A generic RecordReader that can hand out different recordReaders
for each chunk in a
CombineFileSplit. |
| CombineFileRecordReaderWrapper<K,V> |
A wrapper class for a record reader that handles a single file split.
|
| CombineFileSplit |
A sub-collection of input files.
|
| CombineSequenceFileInputFormat<K,V> |
Input format that is a
CombineFileInputFormat-equivalent for
SequenceFileInputFormat. |
| CombineTextInputFormat |
Input format that is a
CombineFileInputFormat-equivalent for
TextInputFormat. |
| CompressedSplitLineReader |
Line reader for compressed splits
Reading records from a compressed split is tricky, as the
LineRecordReader is using the reported compressed input stream
position directly to determine when a split has ended.
|
| DelegatingInputFormat<K,V> |
An
InputFormat that delegates behavior of paths to multiple other
InputFormats. |
| DelegatingMapper<K1,V1,K2,V2> |
An
Mapper that delegates behavior of paths to multiple other
mappers. |
| DelegatingRecordReader<K,V> |
This is a delegating RecordReader, which delegates the functionality to the
underlying record reader in
TaggedInputSplit |
| FileInputFormat<K,V> |
A base class for file-based
InputFormats. |
| FileSplit |
A section of an input file.
|
| FixedLengthInputFormat |
FixedLengthInputFormat is an input format used to read input files
which contain fixed length records.
|
| FixedLengthRecordReader |
A reader to read fixed length records from a split.
|
| KeyValueLineRecordReader |
This class treats a line in the input as a key/value pair separated by a
separator character.
|
| KeyValueTextInputFormat |
An
InputFormat for plain text files. |
| LineRecordReader |
Treats keys as offset in file and value as line.
|
| MultipleInputs |
This class supports MapReduce jobs that have multiple input paths with
a different
InputFormat and Mapper for each path |
| NLineInputFormat |
NLineInputFormat which splits N lines of input as one split.
|
| SequenceFileAsBinaryInputFormat |
InputFormat reading keys, values from SequenceFiles in binary (raw)
format.
|
| SequenceFileAsBinaryInputFormat.SequenceFileAsBinaryRecordReader |
Read records from a SequenceFile as binary (raw) bytes.
|
| SequenceFileAsTextInputFormat |
This class is similar to SequenceFileInputFormat, except it generates
SequenceFileAsTextRecordReader which converts the input keys and values
to their String forms by calling toString() method.
|
| SequenceFileAsTextRecordReader |
This class converts the input keys and values to their String forms by
calling toString() method.
|
| SequenceFileInputFilter<K,V> |
A class that allows a map/red job to work on a sample of sequence files.
|
| SequenceFileInputFilter.FilterBase |
base class for Filters
|
| SequenceFileInputFilter.MD5Filter |
This class returns a set of records by examing the MD5 digest of its
key against a filtering frequency f.
|
| SequenceFileInputFilter.PercentFilter |
This class returns a percentage of records
The percentage is determined by a filtering frequency f using
the criteria record# % f == 0.
|
| SequenceFileInputFilter.RegexFilter |
Records filter by matching key to regex
|
| SequenceFileInputFormat<K,V> |
An
InputFormat for SequenceFiles. |
| SequenceFileRecordReader<K,V> |
An
RecordReader for SequenceFiles. |
| SplitLineReader | |
| TextInputFormat |
An
InputFormat for plain text files. |
| UncompressedSplitLineReader |
SplitLineReader for uncompressed files.
|
| Enum | Description |
|---|---|
| FileInputFormat.Counter | Deprecated |
| FileInputFormatCounter |
| Exception | Description |
|---|---|
| InvalidInputException |
This class wraps a list of problems with the input, so that the user
can get a list of problems together instead of finding and fixing them one
by one.
|
Copyright © 2022 Apache Software Foundation. All rights reserved.