You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Nov 4, 2024. It is now read-only.
Copy file name to clipboardexpand all lines: doc/benchmark.md
+10-1
Original file line number
Diff line number
Diff line change
@@ -4,6 +4,15 @@ To test performance of KVDK, you can run our benchmark tool "bench", the tool is
4
4
5
5
You can manually run individual benchmark follow the examples as shown bellow, or simply run our basic benchmark script "scripts/run_benchmark.py" to test all the basic read/write performance.
6
6
7
+
To run the script, you shoulf first build kvdk, then run:
data_type: Which data type to benchmark, it can be string/sorted/hash/list/blackhole/all
14
+
15
+
key distribution: Distribution of key of the benchmark workloads, it can be random/zipf/all
7
16
## Fill data to new instance
8
17
9
18
To test performance, we need to first fill key-value pairs to the KVDK instance. Since KVDK did not support cross-socket access yet, we need to bind bench program to a numa node:
@@ -20,7 +29,7 @@ Explanation of arguments:
20
29
21
30
-space: PMem space that allocate to the KVDK instance.
22
31
23
-
-max_access_threads: Max concurrent access threads of the KVDK instance, set it to the number of the hyper-threads for performance consideration.
32
+
-max_access_threads: Max concurrent access threads in the KVDK instance, set it to the number of the hyper-threads for performance consideration. You can call KVDK API with any number of threads, but if your parallel threads more than max_access_threads, the performance will be degraded due to synchronization cost
24
33
25
34
-type: Type of key-value pairs to benchmark, it can be "string", "hash" or "sorted".
Copy file name to clipboardexpand all lines: doc/user_doc.md
+68-30
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,9 @@
1
1
KVDK
2
2
=======
3
3
4
-
KVDK(Key-Value Development Kit) is a Key-Value store for Persistent memory(PMem).
4
+
KVDK(Key-Value Development Kit) is a Key-Value store for Persistent Memory (PMem).
5
5
6
-
KVDK supports both sorted and unsorted KV-Pairs.
6
+
KVDK supports basic read and write operations on both sorted and unsorted KV-Pairs, it also support some advanced features, such as **backup**, **checkpoint**, **expire key**, **atomic batch write** and **transactions**.
7
7
8
8
Code snippets in this user documents are from `./examples/tutorial/cpp_api_tutorial.cpp`, which is built as `./build/examples/tutorial/cpp_api_tutorial`.
9
9
@@ -70,7 +70,7 @@ int main()
70
70
`kvdk::Status` indicates status of KVDK function calls.
71
71
Functions return `kvdk::Status::Ok` if such a function call is a success.
72
72
If exceptions are raised during function calls, other `kvdk::Status` is returned,
73
-
such as `kvdk::Status::MemoryOverflow`.
73
+
such as `kvdk::Status::MemoryOverflow` while no enough memory to allocate.
74
74
75
75
## Close a KVDK instance
76
76
@@ -97,26 +97,45 @@ int main()
97
97
```
98
98
99
99
## Data types
100
-
KVDK currently supports string type for both keys and values.
101
-
### Strings
102
-
All keys and values in a KVDK instance are strings.
100
+
KVDK currently supports raw string, sorted collection, hash collection and list data type.
101
+
102
+
### Raw String
103
+
104
+
All keys and values in a KVDK instance are strings. You can directly store or read key-value pairs in global namespace, which is accessible via Get, Put, Delete and Modify operations, we call them string type data in kvdk.
103
105
104
106
Keys are limited to have a maximum size of 64KB.
105
107
106
-
A string value can be at max 64MB in length by default. The maximum length can be configured when initializing a KVDK instance.
108
+
A value can be at max 64MB in length by default. The maximum length can be configured when initializing a KVDK instance.
109
+
110
+
### Collections
111
+
112
+
Instead of raw string, you can organize key-value pairs to a collection, each collection has its own namespace.
113
+
114
+
Currently we have three types of collection:
115
+
116
+
#### Sorted Collection
117
+
118
+
KV pairs are stored with some kind of order (lexicographical order by default) in Sorted Collection, they can be iterated forward or backward starting from an arbitrary point(at a key or between two keys) by an iterator. They can also be directly accessed via SortedGet, SortedPut, SortedDelete operations.
119
+
120
+
#### Hash Collection
121
+
122
+
Hash Collection is like Raw String with a name space, you can access KV pairs via HashGet, HashPut, HashDelete and HashModify operations.
123
+
124
+
In current version, performance of operations on hash collection is similar to sorted collection, which much slower than raw-string, so we recomend use raw-string or sorted collection as high priority.
125
+
126
+
#### List
107
127
108
-
## Collections
109
-
All Key-Value pairs(KV-Pairs) are organized into collections.
128
+
List is a list of string elements, you can access elems at the front or back via ListPushFront, ListPushBack, ListPopFron, ListPopBack, or operation elems with index via ListInsertAt, ListInsertBefore, ListInsertAfter and ListErase. Notice that operation with index take O(n) time, while operation on front and back only takes O(1).
110
129
111
-
There is an anonymous global collection with KV-Pairs directly accessible via Get, Put, Delete operations. The anonymous global collection is unsorted.
130
+
### Namespace
112
131
113
-
Users can also create named collections.
132
+
Each collection has its own namespace, so you can store same key in every collection. Howevery, collection name and raw string key are in a same namespace, so you can't assign same name for a collection and a string key, otherwise a error status (Status::WrongType) will be returned.
114
133
115
-
KVDK currently supports sorted named collections. Users can iterate forward or backward starting from an arbitrary point(at a key or between two keys) by an iterator. Elements can also be directly accessed via SortedGet, SortedPut, SortedDelete operations.
134
+
## API Examples
116
135
117
-
## Reads and Writes in Anonymous Global Collection
136
+
###Reads and Writes with String type
118
137
119
-
A KVDK instance provides Get, Put, Delete methods to query/modify/delete entries.
138
+
A KVDK instance provides Get, Put, Delete methods to query/modify/delete raw string kvs.
120
139
121
140
The following code performs a series of Get, Put and Delete operations.
122
141
@@ -125,7 +144,7 @@ int main()
125
144
{
126
145
... Open a KVDK instance as described in "Open a KVDK instance" ...
127
146
128
-
// Reads and Writes on Anonymous Global Collection
147
+
// Reads and Writes String KV
129
148
{
130
149
std::string key1{"key1"};
131
150
std::string key2{"key2"};
@@ -173,11 +192,11 @@ int main()
173
192
}
174
193
```
175
194
176
-
## Reads and Writes in a Named Collection
195
+
###Reads and Writes in a Sorted Collection
177
196
178
197
A KVDK instance provides SortedGet, SortedPut, SortedDelete methods to query/modify/delete sorted entries.
179
198
180
-
The following code performs a series of SortedGet, SortedPut and SortedDelete operations, which also initialize a named collection implicitly.
199
+
The following code performs a series of SortedGet, SortedPut and SortedDelete operations on a sorted collection.
181
200
182
201
```c++
183
202
intmain()
@@ -194,9 +213,13 @@ int main()
194
213
std::string value2{"value2"};
195
214
std::string v;
196
215
216
+
// You must create sorted collections before you do any operations on them
217
+
status = engine->SortedCreate(collection1);
218
+
assert(status == kvdk::Status::Ok);
219
+
status = engine->SortedCreate(collection2);
220
+
assert(status == kvdk::Status::Ok);
221
+
197
222
// Insert key1-value1 into "my_collection_1".
198
-
// Implicitly create a collection named "my_collection_1" in which
199
-
// key1-value1 is stored.
200
223
status = engine->SortedPut(collection1, key1, value1);
201
224
assert(status == kvdk::Status::Ok);
202
225
@@ -206,8 +229,6 @@ int main()
206
229
assert(v == value1);
207
230
208
231
// Insert key1-value2 into "my_collection_2".
209
-
// Implicitly create a collection named "my_collection_2" in which
210
-
// key1-value2 is stored.
211
232
status = engine->SortedPut(collection2, key1, value2);
212
233
assert(status == kvdk::Status::Ok);
213
234
@@ -236,8 +257,13 @@ int main()
236
257
status = engine->SortedDelete(collection1, key1);
237
258
assert(status == kvdk::Status::Ok);
238
259
239
-
printf("Successfully performed SortedGet, SortedPut, SortedDelete operations on named "
240
-
"collections.\n");
260
+
// Destroy sorted collections
261
+
status = engine->SortedDestroy(collection1);
262
+
assert(status == kvdk::Status::Ok);
263
+
status = engine->SrotedDestroy(collection2);
264
+
assert(status == kvdk::Status::Ok);
265
+
266
+
printf("Successfully performed SortedGet, SortedPut, SortedDelete operations.\n");
241
267
}
242
268
243
269
... Do something else with KVDK instance ...
@@ -246,17 +272,18 @@ int main()
246
272
}
247
273
```
248
274
249
-
## Iterating a Named Collection
250
-
The following example demonstrates how to iterate through a named collection. It also demonstrates how to iterate through a range defined by Key.
275
+
###Iterating a Sorted Collection
276
+
The following example demonstrates how to iterate through a sorted collection at a consistent view of data. It also demonstrates how to iterate through a range defined by Key.
251
277
252
278
```c++
253
279
intmain()
254
280
{
255
281
... Open a KVDK instance as described in "Open a KVDK instance" ...
// Sort kv_pairs for checking the order of "my_sorted_collection".
283
310
std::sort(kv_pairs.begin(), kv_pairs.end());
284
311
285
-
// Iterate through collection "my_sorted_collection"
312
+
// Iterate through collection "my_sorted_collection", the iter is
313
+
// created on a consistent view while you create it, e.g. all
314
+
// modifications after you create the iter won't be observed
286
315
auto iter = engine->SortedIteratorCreate(sorted_collection);
287
316
iter->SeekToFirst();
288
317
{
@@ -320,7 +349,7 @@ int main()
320
349
}
321
350
}
322
351
323
-
printf("Successfully iterated through a sorted named collections.\n");
352
+
printf("Successfully iterated through a sorted collections.\n");
324
353
engine->SortedIteratorRelease(iter);
325
354
}
326
355
@@ -330,7 +359,7 @@ int main()
330
359
}
331
360
```
332
361
333
-
## Atomic Updates
362
+
###Atomic Updates
334
363
KVDK supports organizing a series of Put, Delete operations into a `kvdk::WriteBatch` object as an atomic operation. If KVDK fail to apply the `kvdk::WriteBatch` object as a whole, i.e. the system shuts down during applying the batch, it will roll back to the status right before applying the `kvdk::WriteBatch`.
335
364
336
365
```c++
@@ -387,7 +416,12 @@ A KVDK instance can be accessed by multiple read and write threads safely. Synch
387
416
Users can configure KVDK to adapt to their system environment by setting up a `kvdk::Configs` object and passing it to 'kvdk::Engine::Open' when initializing a KVDK instance.
388
417
389
418
### Max Access Threads
390
-
Maximum number of access threads is specified by `kvdk::Configs::max_access_threads`. Defaulted to 48. It's recommended to set this number to the number of threads provided by CPU.
419
+
Maximum number of internal access threads in kvdk is specified by `kvdk::Configs::max_access_threads`. Defaulted to 64. It's recommended to set this number to the number of threads provided by CPU.
420
+
421
+
You can call KVDK API with any number of threads, but if your parallel threads more than max_access_threads, the performance will be degraded due to synchronization cost
422
+
423
+
### Clean Threads
424
+
KVDK reclaim space of updated/deleted data in background with dynamic number of clean threads, you can specify max clean thread number with `kvdk::Configs::clean_threads`. Defaulted to 8, you can config more clean threads in delete intensive workloads to avoid space be exhausted.
391
425
392
426
### PMem File Size
393
427
`kvdk::Configs::pmem_file_size` specifies the space allocated to a KVDK instance. Defaulted to 2^38Bytes = 256GB.
@@ -418,3 +452,7 @@ Specified by `kvdk::Configs::hash_bucket_num`. Greater number will improve perfo
418
452
419
453
### Buckets per Slot
420
454
Specified by `kvdk::Configs::num_buckets_per_slot`. Smaller number will improve performance by reducing lock contentions and improving caching at the cost of greater DRAM space. Please read Architecture Documentation for details before tuning this parameter.
455
+
456
+
## Advanced features and more API
457
+
458
+
Please read examples/tutorial for more API and advanced features in KVDK.
0 commit comments