|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.hadoop.mapreduce.lib.partition.InputSampler.RandomSampler<K,V>
public static class InputSampler.RandomSampler<K,V>
Sample from random points in the input. General-purpose sampler. Takes numSamples / maxSplitsSampled inputs from each split.
| Constructor Summary | |
|---|---|
InputSampler.RandomSampler(double freq,
int numSamples)
Create a new RandomSampler sampling all splits. |
|
InputSampler.RandomSampler(double freq,
int numSamples,
int maxSplitsSampled)
Create a new RandomSampler. |
|
| Method Summary | |
|---|---|
K[] |
getSample(InputFormat<K,V> inf,
Job job)
Randomize the split order, then take the specified number of keys from each split sampled, where each key is selected with the specified probability and possibly replaced by a subsequently selected key when the quota of keys from that split is satisfied. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public InputSampler.RandomSampler(double freq,
int numSamples)
freq - Probability with which a key will be chosen.numSamples - Total number of samples to obtain from all selected
splits.
public InputSampler.RandomSampler(double freq,
int numSamples,
int maxSplitsSampled)
freq - Probability with which a key will be chosen.numSamples - Total number of samples to obtain from all selected
splits.maxSplitsSampled - The maximum number of splits to examine.| Method Detail |
|---|
public K[] getSample(InputFormat<K,V> inf,
Job job)
throws IOException,
InterruptedException
getSample in interface InputSampler.Sampler<K,V>IOException
InterruptedException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||