Class BloomFilter
- All Implemented Interfaces:
Writable
- Direct Known Subclasses:
RetouchedBloomFilter
The Bloom filter is a data structure that was introduced in 1970 and that has been adopted by the networking research community in the past decade thanks to the bandwidth efficiencies that it offers for the transmission of set membership information between networked hosts. A sender encodes the information into a bit vector, the Bloom filter, that is more compact than a conventional representation. Computation and space costs for construction are linear in the number of elements. The receiver uses the filter to test whether various elements are members of the set. Though the filter will occasionally return a false positive, it will never return a false negative. When creating the filter, the sender can choose its desired point in a trade-off between the false positive rate and the size.
Originally created by European Commission One-Lab Project 034819.
- See Also:
-
The general behavior of a filter- Space/Time Trade-Offs in Hash Coding with Allowable Errors
-
Field Summary
Fields inherited from class org.apache.hadoop.util.bloom.Filter
hash, hashType, nbHash, vectorSize -
Constructor Summary
ConstructorsConstructorDescriptionDefault constructor - use with readFieldsBloomFilter(int vectorSize, int nbHash, int hashType) Constructor -
Method Summary
Modifier and TypeMethodDescriptionvoidadd(org.apache.hadoop.util.bloom.Key key) Adds a key to this filter.voidand(org.apache.hadoop.util.bloom.Filter filter) Peforms a logical AND between this filter and a specified filter.intbooleanmembershipTest(org.apache.hadoop.util.bloom.Key key) Determines wether a specified key belongs to this filter.voidnot()Performs a logical NOT on this filter.voidor(org.apache.hadoop.util.bloom.Filter filter) Peforms a logical OR between this filter and a specified filter.voidreadFields(DataInput in) Deserialize the fields of this object fromin.toString()voidwrite(DataOutput out) Serialize the fields of this object toout.voidxor(org.apache.hadoop.util.bloom.Filter filter) Peforms a logical XOR between this filter and a specified filter.Methods inherited from class org.apache.hadoop.util.bloom.Filter
add, add, add
-
Constructor Details
-
BloomFilter
public BloomFilter()Default constructor - use with readFields -
BloomFilter
public BloomFilter(int vectorSize, int nbHash, int hashType) Constructor- Parameters:
vectorSize- The vector size of this filter.nbHash- The number of hash function to consider.hashType- type of the hashing function (seeHash).
-
-
Method Details
-
add
public void add(org.apache.hadoop.util.bloom.Key key) Description copied from class:org.apache.hadoop.util.bloom.FilterAdds a key to this filter.- Specified by:
addin classorg.apache.hadoop.util.bloom.Filter- Parameters:
key- The key to add.
-
and
public void and(org.apache.hadoop.util.bloom.Filter filter) Description copied from class:org.apache.hadoop.util.bloom.FilterPeforms a logical AND between this filter and a specified filter.Invariant: The result is assigned to this filter.
- Specified by:
andin classorg.apache.hadoop.util.bloom.Filter- Parameters:
filter- The filter to AND with.
-
membershipTest
public boolean membershipTest(org.apache.hadoop.util.bloom.Key key) Description copied from class:org.apache.hadoop.util.bloom.FilterDetermines wether a specified key belongs to this filter.- Specified by:
membershipTestin classorg.apache.hadoop.util.bloom.Filter- Parameters:
key- The key to test.- Returns:
- boolean True if the specified key belongs to this filter. False otherwise.
-
not
public void not()Description copied from class:org.apache.hadoop.util.bloom.FilterPerforms a logical NOT on this filter.The result is assigned to this filter.
- Specified by:
notin classorg.apache.hadoop.util.bloom.Filter
-
or
public void or(org.apache.hadoop.util.bloom.Filter filter) Description copied from class:org.apache.hadoop.util.bloom.FilterPeforms a logical OR between this filter and a specified filter.Invariant: The result is assigned to this filter.
- Specified by:
orin classorg.apache.hadoop.util.bloom.Filter- Parameters:
filter- The filter to OR with.
-
xor
public void xor(org.apache.hadoop.util.bloom.Filter filter) Description copied from class:org.apache.hadoop.util.bloom.FilterPeforms a logical XOR between this filter and a specified filter.Invariant: The result is assigned to this filter.
- Specified by:
xorin classorg.apache.hadoop.util.bloom.Filter- Parameters:
filter- The filter to XOR with.
-
toString
-
getVectorSize
public int getVectorSize()- Returns:
- size of the the bloomfilter
-
write
Description copied from interface:WritableSerialize the fields of this object toout.- Specified by:
writein interfaceWritable- Overrides:
writein classorg.apache.hadoop.util.bloom.Filter- Parameters:
out-DataOuputto serialize this object into.- Throws:
IOException- any other problem for write.
-
readFields
Description copied from interface:WritableDeserialize the fields of this object fromin.For efficiency, implementations should attempt to re-use storage in the existing object where possible.
- Specified by:
readFieldsin interfaceWritable- Overrides:
readFieldsin classorg.apache.hadoop.util.bloom.Filter- Parameters:
in-DataInputto deseriablize this object from.- Throws:
IOException- any other problem for readFields.
-