A Redis sharding implementation.
Redis is great. It's fast, lightweight and easy to use. But when we want to store a mass of data into one single instance, we may encounter some problems such as performance degradation and slow recovery and we need to scale it.
First, Create an RedisShardAPI instance with multiple nodes, node name must be unique:
from redis_shard.shard import RedisShardAPI
servers = [
{'name': 'server1', 'host': '127.0.0.1', 'port': 10000, 'db': 0},
{'name': 'server2', 'host': '127.0.0.1', 'port': 11000, 'db': 0},
{'name': 'server3', 'host': '127.0.0.1', 'port': 12000, 'db': 0},
]
client = RedisShardAPI(servers, hash_method='md5')
Then, you can access the Redis cluster as you use redis-py:
client.set('test', 1)
client.get('test') # 1
client.zadd('testset', 'first', 1)
client.zadd('testset', 'second', 2)
client.zrange('testset', 0, -1) # [first, second]
If you want to store specific keys on one node for some reason (such as you prefer single instance pipeline, or
you want to use multi-keys command such as sinter), you should use Hash Tags:
client.set('foo', 2)
client.set('a{foo}', 5)
client.set('b{foo}', 5)
client.set('{foo}d', 5)
client.set('d{foo}e', 5)
client.get_server_name('foo') == client.get_server_name('a{foo}') == client.get_server_name('{foo}d') \
== client.get_server_name('d{foo}e') # True
The string in a braces of a key is the Hash Tag of the key. The hash of a Hash Tag will be treated the hash of the key.
So, keys foo, bar{foo} and b{foo}ar will be sotred in the same node.
Note
Hash Tags are not supported with bytes keys in Python 3.
Just add tag_ prefix, you can use any of the normal redis method on the same hash tag:
client.tag_mget("{user:1}question1", "{user:1}question2")
client.tag_delete("{user:1}question1", "{user:1}question2")
Only support mget, mset and flushdb.
There are three parameters servers, hash_method and sentinel in the class redis_shard.shard.RedisShardAPI.
servers is a list. Each element in it should be a dict or a URL schema.
dict:
[ {'name': 'server1', 'host': '127.0.0.1', 'port': 10000, 'db': 0}, {'name': 'server2', 'host': '127.0.0.1', 'port': 11000, 'db': 0, 'max_connections': 50}, {'name': 'server3', 'host': '127.0.0.1', 'port': 12000, 'db': 0}, ]URL schema:
[ 'redis://127.0.0.1:10000/0?name=node1', 'redis://127.0.0.1:11000/0?name=node2&max_connections=50', 'redis://127.0.0.1:12000/0?name=node3' ]
If the following parameter sentinel is enabled, only name is needed for the servers config.
hash_method is a string which indicate the method of generating the hash key of the consistent hash ring.
The default value is crc32. It also supports md5 and sha1.
sentinel is the config for Redis Sentinel. With the sentinel support, redis-shard
will do read/write splitting. Config is like this:
{"hosts": [('localhost', 26379)], "socket_timeout": 0.1}
- Redis Shard dose not support all Redis commands.
- As mentioned above, Redis Shard does not support all multi-keys commands crossing different nodes, you have to use Hash Tag to work with those commands.
- Redis Shard does not have any replication mechanism.
Redis Shard is basically inspired by this article.