T

andrew ee36bda8f8 Track initializedness of Node memory more precisely

By not initializing Node members with dummy default values.

This has performance/code size benefits, and improves debugging when
running under valgrind.

Unfortunately this also makes it easy to write code that uses
uninitialized memory, so if valgrind doesn't have good coverage then we
might let some uninit usages sneak through.

We plan to have good coverage for valgrind, so I think it's ok. If
writing correct code becomes too tedious then we can go back to
initializing Node fields with dummy default values.

2024-03-14 14:58:20 -07:00

.vscode

Fill in "Checking point reads"

2024-03-06 21:22:30 -08:00

corpus

Update corpus

2024-03-12 14:52:43 -07:00

include

Improve ConflictSet.h readability

2024-03-07 16:17:17 -08:00

paper

Tidying

2024-03-07 12:33:34 -08:00

third_party

Vendor valgrind headers

2024-02-21 16:15:40 -08:00

.clangd

Add benchmarks for getChild{L,G}eq

2024-01-30 13:08:01 -08:00

.gitignore

Add .gitignore, CMakeLists.txt, ConflictSet.cpp

2024-01-17 10:51:22 -08:00

.pre-commit-config.yaml

Bring back precommit check for SHOW_MEMORY

2024-03-08 14:44:35 -08:00

aarch64-toolchain.cmake

Try cross-compiling to arm in jenkins

2024-03-04 22:58:04 -08:00

Bench.cpp

Use a warmup instead

2024-03-05 17:22:11 -08:00

CMakeLists.txt

Use asan and ubsan for whitebox/fuzz tests

2024-03-14 13:47:21 -07:00

conflict_set_c_api_test.c

Interface change! addWrites now takes a single write version

2024-03-05 16:55:27 -08:00

conflict_set_cxx_api_test.cpp

Interface change! addWrites now takes a single write version

2024-03-05 16:55:27 -08:00

ConflictSet.cpp

Track initializedness of Node memory more precisely

2024-03-14 14:58:20 -07:00

Dockerfile

Trim down docker file and avoid valgrind when cross-compiling

2024-03-05 11:39:36 -08:00

fdb-patch.txt

Update fdb patch

2024-03-06 18:25:23 -08:00

FuzzTestDriver.cpp

Add forgotten file

2024-01-30 11:56:26 -08:00

HashTable.cpp

Drain all pending work in hashtable's setOldestVersion

2024-03-05 17:18:58 -08:00

Internal.h

Fix SHOW_MEMORY for gcc and glibc on linux

2024-03-13 16:57:38 -07:00

Jenkinsfile

Set -DNVALGRIND for release artifacts

2024-03-13 20:45:59 -07:00

LICENSE

Add license

2024-02-20 13:22:22 -08:00

linker.map

Test symbol visibility

2024-01-30 10:39:43 -08:00

README.md

Update readme benchmarks

2024-03-07 15:55:31 -08:00

RealDataBench.cpp

Interface change! addWrites now takes a single write version

2024-03-05 16:55:27 -08:00

SkipList.cpp

Interface change! addWrites now takes a single write version

2024-03-05 16:55:27 -08:00

symbols.txt

Interface change! addWrites now takes a single write version

2024-03-05 16:55:27 -08:00

test_symbols.sh

Allow __stack_chk_[a-z]*

2024-03-05 11:37:33 -08:00

TestDriver.cpp

Fix lastLeq bug

2024-02-01 11:24:57 -08:00

README.md

A data structure for optimistic concurrency control on ranges of bitwise-lexicographically-ordered keys.

Intended to replace FoundationDB's skip list.

FoundationDB's benchmark

Skip list

New conflict set: 2.404 sec
                  0.520 Mtransactions/sec
                  2.080 Mkeys/sec
Detect only:      2.266 sec
                  0.552 Mtransactions/sec
                  2.207 Mkeys/sec
Skiplist only:    1.594 sec
                  0.784 Mtransactions/sec
                  3.137 Mkeys/sec
Performance counters:
               Build: 0.071
                 Add: 0.0641
              Detect: 2.27
              D.Sort: 0.44
           D.Combine: 0.018
         D.CheckRead: 0.855
   D.CheckIntraBatch: 0.00903
        D.MergeWrite: 0.739
      D.RemoveBefore: 0.201

Radix tree (this implementation)

New conflict set: 1.743 sec
                  0.717 Mtransactions/sec
                  2.869 Mkeys/sec
Detect only:      1.611 sec
                  0.776 Mtransactions/sec
                  3.103 Mkeys/sec
Skiplist only:    0.919 sec
                  1.360 Mtransactions/sec
                  5.440 Mkeys/sec
Performance counters:
               Build: 0.0657
                 Add: 0.0628
              Detect: 1.61
              D.Sort: 0.442
           D.Combine: 0.0178
         D.CheckRead: 0.395
   D.CheckIntraBatch: 0.00776
        D.MergeWrite: 0.524
      D.RemoveBefore: 0.221

Our benchmark

Skip list

ns/op	op/s	err%	total	benchmark
270.07	3,702,706.03	0.4%	0.01	`point reads`
285.76	3,499,437.03	1.5%	0.01	`prefix reads`
532.54	1,877,794.90	0.7%	0.01	`range reads`
528.50	1,892,132.94	0.7%	0.01	`point writes`
516.53	1,935,978.22	0.9%	0.01	`prefix writes`
303.34	3,296,630.84	3.6%	0.05	`range writes`
502.88	1,988,553.24	2.0%	0.01	`monotonic increasing point writes`

Radix tree (this implementation)

ns/op	op/s	err%	total	benchmark
14.52	68,850,842.99	1.2%	0.01	`point reads`
60.89	16,422,538.22	1.5%	0.01	`prefix reads`
226.89	4,407,362.98	0.5%	0.01	`range reads`
22.99	43,498,198.49	0.2%	0.01	`point writes`
50.51	19,799,864.54	1.0%	0.01	`prefix writes`
82.50	12,121,212.12	2.6%	0.03	`range writes`
119.94	8,337,354.54	2.1%	0.01	`monotonic increasing point writes`

"Real data" test

Point queries only, best of three runs. Gc ratio is the ratio of time spent doing garbage collection to time spent adding writes or doing garbage collection. Lower is better.

skip list

Check: 12.7863 seconds, 292.384 MB/s, Add: 19.8276 seconds, 35.4071 MB/s, Gc ratio: 23.5314%

radix tree

Check: 3.60187 seconds, 1037.94 MB/s, Add: 3.03958 seconds, 230.966 MB/s, Gc ratio: 52.3876%

hash table

(The hash table implementation doesn't work on range queries, and its purpose is to provide an idea of how fast point queries can be)

Check: 2.15925 seconds, 1731.4 MB/s, Add: 1.08519 seconds, 646.926 MB/s, Gc ratio: 52.1526%

Releases 13

v0.0.13 Latest

2024-08-26 21:24:21 +00:00

Languages

C++ 82.4%

TeX 8%

CMake 4.9%

Python 3.3%

Shell 0.9%

Other 0.5%