Overview
Valkey-dict
ValkeyDict is a Python library that offers a convenient and familiar interface for interacting with Valkey, treating it as if it were a Python dictionary. Its goal is to help developers write clean, Pythonic code while using Valkey as a storage solution for seamless distributed computing. Valkey-Dict utilizes Valkey as a key-value store and supports various data types, including strings, integers, floats, booleans, lists, and dictionaries. Additionally, developers can extend ValkeyDict to work with custom objects.
The library includes utility functions for more complex use cases such as caching, batching, and more. By leveraging Valkey for efficient key-value storage, ValkeyDict enables high-performance data management, maintaining efficiency even with large datasets and Valkey instances.
Usage | Types | Expiration | Batching | Custom Types | Security
Features
Dictionary-like interface: Use familiar Python dictionary syntax to interact with Valkey.
Data Type Support: Comprehensive support for various data types.
Pipelining support: Use pipelines for batch operations to improve performance.
Expiration Support: Enables the setting of expiration times either globally or individually per key, through the use of context managers.
Efficiency and Scalability: ValkeyDict is designed for use with large datasets and is optimized for efficiency. It retrieves only the data needed for a particular operation, ensuring efficient memory usage and fast performance.
Namespace Management: Provides simple and efficient namespace handling to help organize and manage data in Valkey, streamlining data access and manipulation.
Distributed Computing: With its ability to seamlessly connect to other instances or servers with access to the same Valkey instance, ValkeyDict enables easy distributed computing.
Custom data: types: Add custom types encoding/decoding to store your data types.
Encryption: allows for storing data encrypted, while retaining the simple dictionary interface.
Usage
pip install valkey-dict>>> from valkey_dict import ValkeyDict >>> dic = ValkeyDict() >>> dic['foo'] = 42 >>> dic['foo'] 42 >>> 'foo' in dic True >>> dic["baz"] = "hello world" >>> dic {'foo': 42, 'baz': 'hello world'} >>> from datetime import datetime >>> dic["datetime"] = datetime.now()In Valkey our example looks like this.
127.0.0.1:6379> KEYS "*" 1) "main:foo" 2) "main:baz" 127.0.0.1:6379> GET "main:foo" "int:42" 127.0.0.1:6379> GET "main:baz" "str:hello world" 127.0.0.1:6379> GET "main:datetime" "datetime:2025-02-20T19:37:54.214274"Types
Standard types
ValkeyDict supports a range of Python data types, from basic types to nested structures. Basic types are handled natively, while complex data types like lists and dictionaries, ValkeyDict uses JSON serialization, specifically avoiding pickle due to its security vulnerabilities within distributed computing contexts. Although the library supports nested structures, the recommended best practice is to use ValkeyDict as a shallow dictionary. This approach optimizes Valkey database performance and efficiency by ensuring that each set and get operation efficiently maps to Valkey’s key-value storage capabilities, while still preserving the library’s Pythonic interface. Following types are supported:
str, int, float, bool, NoneType, list, dict, tuple, set, datetime, date, time, timedelta, Decimal, complex, bytes, UUID, OrderedDict, defaultdict, frozenset
from valkey_dict import ValkeyDict from uuid import UUID from decimal import Decimal from collections import OrderedDict, defaultdict from datetime import datetime, date, time, timedelta dic = ValkeyDict() dic["string"] = "Hello World" dic["number"] = 42 dic["float"] = 3.14 dic["bool"] = True dic["None"] = None dic["list"] = [1, 2, 3] dic["dict"] = {"a": 1, "b": 2} dic["tuple"] = (1, 2, 3) dic["set"] = {1, 2, 3} dic["datetime"] = datetime.date(2024, 1, 1, 12, 30, 45) dic["date"] = date(2024, 1, 1) dic["time"] = time(12, 30, 45) dic["delta"] = timedelta(days=1, hours=2) dic["decimal"] = Decimal("3.14159") dic["complex"] = complex(1, 2) dic["bytes"] = bytes([72, 101, 108, 108, 111]) dic["uuid"] = UUID('12345678-1234-5678-1234-567812345678') dic["ordered"] = OrderedDict([('a', 1), ('b', 2)]) dic["default"] = defaultdict(int, {'a': 1, 'b': 2}) dic["frozen"] = frozenset([1, 2, 3])Namespaces
Acting as an identifier for your dictionary across different systems, ValkeyDict employs namespaces for organized data management. When a namespace isn’t specified, “main” becomes the default. Thus allowing for data organization across systems and projects with the same valkey instance. This approach also minimizes the risk of key collisions between different applications, preventing hard-to-debug issues. By leveraging namespaces, ValkeyDict ensures a cleaner and more maintainable data management experience for developers working on multiple projects.
Advanced Features
Expiration
Valkey provides a valuable feature that enables keys to expire. ValkeyDict supports this feature in the following ways:
Set a default expiration time when creating a ValkeyDict instance. In this example, the keys will have a default expiration time of 10 seconds. Use seconds with an integer or pass a datetime timedelta.
dic = ValkeyDict(expire=10) dic['gone'] = 'in ten seconds'Or, for a more Pythonic approach, use a timedelta.
from datetime import timedelta dic = ValkeyDict(expire=timedelta(minutes=1)) dic['gone'] = 'in a minute'
Temporarily set the default expiration time within the scope using a context manager. In this example, the key ‘gone’ will expire after 60 seconds. The default expiration time for other keys outside the context manager remains unchanged. Either pass an integer or a timedelta.
dic = ValkeyDict() seconds = 60 with dic.expire_at(seconds): dic['gone'] = 'in sixty seconds'
Updating keys while preserving the initial timeout In certain situations, there is a need to update the value while keeping the expiration intact. This is achievable by setting the ‘preserve_expiration’ to true.
import time dic = ValkeyDict(expire=10, preserve_expiration=True) dic['gone'] = 'in ten seconds' time.sleep(5) dic['gone'] = 'gone in 5 seconds'Batching
Efficiently batch your requests using the Pipeline feature, which can be easily utilized with a context manager.
dic = ValkeyDict(namespace="example") # one round trip to valkey with dic.pipeline(): for index in range(100): dic[str(index)] = indexDistributed computing
You can use ValkeyDict for distributed computing by starting multiple ValkeyDict instances on different servers or instances that have access to the same Valkey instance:
# On server 1 from valkey_dict import ValkeyDict dic = ValkeyDict(namespace="example") dic["foo"] = "bar" # On server 2 from valkey_dict import ValkeyDict dic = ValkeyDict(namespace="example") print(dic["foo"]) # outputs "bar"Additional Examples
Caching made simple
import time from datetime import timedelta from valkey_dict import ValkeyDict def expensive_function(x): time.sleep(x) return x * 2 cache = ValkeyDict(namespace="cache", expire=timedelta(minutes=60)) def cached_expensive_function(x): if x not in cache: cache[x] = expensive_function(x) return cache[x] start_time = time.time() print(cached_expensive_function(5)) # Takes around 5 seconds to compute and caches the result. print(f"Time taken: {time.time() - start_time:.2f} seconds") start_time = time.time() print(cached_expensive_function(5)) # Fetches the result from the cache, taking almost no time. print(f"Time taken: {time.time() - start_time:.2f} seconds")Valkey-dict as dictionary
from valkey_dict import ValkeyDict # Create a ValkeyDict instance with a namespace dic = ValkeyDict(namespace="example") # Set key-value pairs dic["name"] = "John Doe" dic["age"] = 32 dic["city"] = "Amsterdam" # Get value by key, from any instance connected to the same valkey/namespace print(dic["name"]) # Output: John Doe # Update value by key, got a year older dic["age"] = 33 # Check if key exists print("name" in dic) # Output: True print("country" in dic) # Output: False # Get value with a default value if the key doesn't exist print(dic.get("country", "NL")) # Output: NL # Get the length (number of keys) of the ValkeyDict print(len(dic)) # Output: 3 # Iterate over keys for key in dic: print(key, dic[key]) # Delete a key-value pair del dic["city"] # Clear all key-value pairs in the ValkeyDict dic.clear() # Get the length (number of keys) of the ValkeyDict print(len(dic)) # Output: 0 # Update ValkeyDict with multiple key-value pairs dic.update({"a": 1, "b": 2, "c": 3}) # Use methods of a normal dict print(list(dic.keys())) # Output: ['a', 'b', 'c'] print(list(dic.values())) # Output: [1, 2, 3] print(list(dic.items())) # Output: [('a', 1), ('b', 2), ('c', 3)] # Using pop() and popitem() methods value = dic.pop("a") print(value) # Output: 1 key, value = dic.popitem() print(key, value) # Output: 'c' 3 (example) # Using setdefault() method dic.setdefault("d", 4) print(dic["d"]) # Output: 4 from datetime import datetime, timedelta # Valkey dict support datetime dic["now"] = datetime.now() print(dic["now"]) # 2025-02-20 19:25:38.835816 # SValkey dict support timedelta and more types dic["time"] = timedelta(days=1) print(dic["time"]) # 1 day, 0:00:00 print(dic) {'now': datetime.datetime(2025, 2, 20, 19, 25, 38, 835816), 'time': datetime.timedelta(days=1), 'b': 2, 'd': 4}Additional Examples
For more advanced examples of ValkeyDict, please refer to the unit-test files in the repository. All features and functionalities are thoroughly tested in unit tests (here) Or take a look at load test for batching load test. The unit-tests can be as used as a starting point.
Nested types
ValkeyDict supports nested structures with mixed types through JSON serialization. The feature works by utilizing JSON encoding and decoding under the hood. While this represents an upgrade in functionality, the feature is not fully implemented and should be used with caution. For optimal performance, using shallow dictionaries is recommended.
from datetime import datetime, timedelta dic["mixed"] = [1, "foobar", 3.14, [1, 2, 3], datetime.now()] dic['dic'] = {"elapsed_time": timedelta(hours=60)}JSON Encoding - Decoding
The nested type support in ValkeyDict is implemented using custom JSON encoders and decoders. These JSON encoders and decoders are built on top of ValkeyDict’s own encoding and decoding functionality, extending it for JSON compatibility. Since JSON serialization was a frequently requested feature, these enhanced encoders and decoders are available for use in other projects:
import json from datetime import datetime from valkey_dict import ValkeyDictJSONDecoder, ValkeyDictJSONEncoder data = [1, "foobar", 3.14, [1, 2, 3], datetime.now()] encoded = json.dumps(data, cls=ValkeyDictJSONEncoder) result = json.loads(encoded, cls=ValkeyDictJSONDecoder)Custom Types
Extending ValkeyDict with Custom Types
ValkeyDict supports custom type serialization. Here’s how to add a new type:
import json class Person: def __init__(self, name, age): self.name = name self.age = age def encode(self) -> str: return json.dumps(self.__dict__) @classmethod def decode(cls, encoded_str: str) -> 'Person': return cls(**json.loads(encoded_str)) valkey_dict = ValkeyDict() # Extend valkey dict with the new type valkey_dict.extends_type(Person) # ValkeyDict can now seamlessly handle Person instances. person = Person(name="John", age=32) valkey_dict["person1"] = person result = valkey_dict["person1"] assert result.name == person.name assert result.age == person.ageInsertion Order
For insertion order, use the PythonValkeyDict. This class is focused on Python dictionary behavior one-to-one. It will eventually become a drop-in replacement for dictionary. Currently, nested types and typed keys are not yet supported but will be added in the future.
from valkey_dict import PythonValkeyDict dic = PythonValkeyDict() dic["1"] = "one" dic["2"] = "two" dic["3"] = "three" assert list(dic.keys()) == ["1", "2", "3"]For more information on extending types.
Security
Security is an important aspect of production projects. Valkey-dict was developed within a strict compliance environment. Best practice in Valkey is to use passwords and network encryption through TLS. However, Valkey-dict offers an additional feature by using extended types. It is possible to store the values of keys encrypted. The values are encrypted with AES GCM, which is currently considered best practice for security.
Storage Encryption
For storing data values encrypted can be achieved using encrypted values, more documentation on that later. For now code example within this test file. encrypted test.
Encryption Network
Setup guide for configuring and utilizing encrypted Valkey TLS for valkey-dict. Setup guide
Tests
The ValkeyDict library includes a comprehensive suite of tests that ensure its correctness and resilience. The test suite covers various data types, edge cases, and error handling scenarios. It also employs the Hypothesis library for property-based testing, which provides fuzz testing to evaluate the implementation
Valkey config
To configure ValkeyDict using your Valkey config.
Configure both the host and port. Or configuration with a setting dictionary.
dic = ValkeyDict(host='127.0.0.1', port=6380) valkey_config = { 'host': '127.0.0.1', 'port': 6380, } config_dic = ValkeyDict(**valkey_config)Installation
pip install valkey-dictNote
Please be aware that this project is currently being utilized by various organizations in their production environments. If you have any questions or concerns, feel free to raise issues
This project only uses valkey as dependency