Rule of thumb for number of shards and number of documents

bogumil · September 21, 2020, 10:50pm

At what point (number of documents) redisearch should be partitioned for best read (latency) full text search performance? Is there a general rule of thumb e.g. to have 20ms response time one shard should have no more than 1MLN documents etc?

suyog · September 22, 2020, 8:20am

@bogumil

Are you using Redis OSS or Redis Enterprise (by Redis Labs) ?

If you are using Redis Enterprise version then you should see here and reachout Redis Labs team for further guidance:

Hope this helps.

bogumil · September 22, 2020, 10:09am

@suyog thank you for the suggestion.

I am on OSS.

I have checked the sizing calculator but it does not have latency input.

I see long delays with short terms when searching: “a”, “ab”, etc … The more characters or tokens the faster response. However I need to keep it fast even for short terms when searching.

Hope that makes sense?

suyog · September 22, 2020, 3:31pm

@bogumil

have you tried setting LIMIT value?

suyog · September 22, 2020, 3:32pm

@bogumil

Are you looking autosuggestion functionality with just prefix matches ? https://oss.redislabs.com/redisearch/Commands/#suggestions

bogumil · September 22, 2020, 10:47pm

LIMIT works on the result set.

bogumil · September 22, 2020, 10:50pm

Can not use autosuggestion as it only works on document’s (as a whole) prefix and not the term prefix. So for example my document has: “star wars” and I need this document to be found when user types “w”…

harry · September 24, 2020, 2:47pm

As a general thumb, I would recommend to do some doing some estimation by benchmarking the search your document on one shard/process prior to going live. If you need more performance or to scale, you can try to setup a redis cluster instead, which usually required 3 node at minimum (to achieve quorum).

Also, these partitioning maintenance or management can be simplified by using redis enterprise as @suyog suggested.

Topic		Replies	Views
Considering RS - Question on: Search index size / performance / maintenance RediSearch	1	633	December 11, 2019
Tuning redisearch for FT.SEARCH (prefix matching) performance for MINPREFIX 1, 2 or 3 any suggestions? RediSearch redisearch	0	1243	September 18, 2020
RediSearch and memory overhead when inserting new documents RediSearch	7	2328	June 30, 2020
RediSearch - Wildcard queries return *99 records RediSearch redisearch , node	1	628	April 25, 2023
Memory leaking when using RediSearch and lots of updates RediSearch	10	914	May 8, 2022

Rule of thumb for number of shards and number of documents

Related Topics