Using Amazon ElastiCache for Redis

suyog · September 17, 2020, 9:08am

Bloom filters are an interesting probabilistic data structure that have can be used to see if an item has never been added previously

Bloom filters work by running an item through a quick hashing function and sampling bits from that hash and setting them from a 0 to 1 at particular interval in a bitfield. To check for existence in a Bloom filter, the same bits are sampled. Many item may have bits that overlap, but since a hashing function produce unique identifiers, if a single bit from the hash is still a 0, then we know it has not been previously added.

A good use case for a Bloom filter is to check for an already used username. On a small scale, this is no problem, but as a service grows, this can be very taxing on a database. It is very simple to implement this with a ReBloom.

First, let’s add a handful of usernames as a test:
BF.ADD

>  usernames funnyfred 
(integer) 1 
> BF.ADD usernames fredisfunny 
(integer) 1 
> BF.ADD usernames fred 
(integer) 1 
> BF.ADD usernames funfred 
(integer) 1

Now, let’s run some test versus the Bloom filter.
BF.EXISTS

> BF.EXISTS usernames fred 
(integer) 1 
> BF.EXISTS usernames fred_is_funny 
(integer) 0

As expected, fred_is_funny yields a 0. A response of zero means we can be sure that this username has not been used. A response of 1 means it might have been used. We can’t say for certain as it might a case of overlapping bits between multiple items.

Topic		Replies	Views
Differences and Similarities Between Open Source Redis and Amazon Elasticache Redis administration	3	694	October 5, 2020
Performance improvement of redis transactions Redis administration hashes , redisearch , java , aws , elasticache	0	545	January 19, 2022
About Redis Administration Redis administration	15	1433	November 17, 2020
Use Cases RedisBloom	2	733	February 12, 2020
Redis Bloom Use Cases RedisBloom	2	910	June 9, 2020

Using Amazon ElastiCache for Redis

Related topics