|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.hadoop.mapred.lib.InputSampler.IntervalSampler<K,V>
public static class InputSampler.IntervalSampler<K,V>
Sample from s splits at regular intervals. Useful for sorted data.
| Constructor Summary | |
|---|---|
InputSampler.IntervalSampler(double freq)
Create a new IntervalSampler sampling all splits. |
|
InputSampler.IntervalSampler(double freq,
int maxSplitsSampled)
Create a new IntervalSampler. |
|
| Method Summary | |
|---|---|
K[] |
getSample(InputFormat<K,V> inf,
JobConf job)
For each split sampled, emit when the ratio of the number of records retained to the total record count is less than the specified frequency. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public InputSampler.IntervalSampler(double freq)
freq - The frequency with which records will be emitted.
public InputSampler.IntervalSampler(double freq,
int maxSplitsSampled)
freq - The frequency with which records will be emitted.maxSplitsSampled - The maximum number of splits to examine.getSample(org.apache.hadoop.mapred.InputFormat, org.apache.hadoop.mapred.JobConf) | Method Detail |
|---|
public K[] getSample(InputFormat<K,V> inf,
JobConf job)
throws IOException
getSample in interface InputSampler.Sampler<K,V>IOException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||