rocksdb

fork of https://github.com/oxigraph/rocksdb and https://github.com/facebook/rocksdb for nextgraph and oxigraph

History

Peter Dillinger 5f8f2fda0e Refactor / clean up / optimize FullFilterBitsReader (#5941 ) Summary: FullFilterBitsReader, after creating in BloomFilterPolicy, was responsible for decoding metadata bits. This meant that FullFilterBitsReader::MayMatch had some metadata checks in order to implement "always true" or "always false" functionality in the case of inconsistent or trivial metadata. This made for ugly mixing-of-concerns code and probably had some runtime cost. It also didn't really support plugging in alternative filter implementations with extensions to the existing metadata schema. BloomFilterPolicy::GetFilterBitsReader is now (exclusively) responsible for decoding filter metadata bits and constructing appropriate instances deriving from FilterBitsReader. "Always false" and "always true" derived classes allow FullFilterBitsReader not to be concerned with handling of trivial or inconsistent metadata. This also makes for easy expansion to alternative filter implementations in new, alternative derived classes. This change makes calls to FilterBitsReader::MayMatch necessarily virtual because there's now more than one built-in implementation. Compared with the previous implementation's extra 'if' checks in MayMatch, there's no consistent performance difference, measured by (an older revision of) filter_bench (differences here seem to be within noise): Inside queries... - Dry run (407) ns/op: 35.9996 + Dry run (407) ns/op: 35.2034 - Single filter ns/op: 47.5483 + Single filter ns/op: 47.4034 - Batched, prepared ns/op: 43.1559 + Batched, prepared ns/op: 42.2923 ... - Random filter ns/op: 150.697 + Random filter ns/op: 149.403 ---------------------------- Outside queries... - Dry run (980) ns/op: 34.6114 + Dry run (980) ns/op: 34.0405 - Single filter ns/op: 56.8326 + Single filter ns/op: 55.8414 - Batched, prepared ns/op: 48.2346 + Batched, prepared ns/op: 47.5667 - Random filter ns/op: 155.377 + Random filter ns/op: 153.942 Average FP rate %: 1.1386 Also, the FullFilterBitsReader ctor was responsible for a surprising amount of CPU in production, due in part to inefficient determination of the CACHE_LINE_SIZE used to construct the filter being read. The overwhelming common case (same as my CACHE_LINE_SIZE) is now substantially optimized, as shown with filter_bench with -new_reader_every=1 (old option - see below) (repeatable result): Inside queries... - Dry run (453) ns/op: 118.799 + Dry run (453) ns/op: 105.869 - Single filter ns/op: 82.5831 + Single filter ns/op: 74.2509 ... - Random filter ns/op: 224.936 + Random filter ns/op: 194.833 ---------------------------- Outside queries... - Dry run (aa1) ns/op: 118.503 + Dry run (aa1) ns/op: 104.925 - Single filter ns/op: 90.3023 + Single filter ns/op: 83.425 ... - Random filter ns/op: 220.455 + Random filter ns/op: 175.7 Average FP rate %: 1.13886 However PR#5936 has/will reclaim most of this cost. After that PR, the optimization of this code path is likely negligible, but nonetheless it's clear we aren't making performance any worse. Also fixed inadequate check of consistency between filter data size and num_lines. (Unit test updated.) Pull Request resolved: https://github.com/facebook/rocksdb/pull/5941 Test Plan: previously added unit tests FullBloomTest.CorruptFilters and FullBloomTest.RawSchema Differential Revision: D18018353 Pulled By: pdillinger fbshipit-source-id: 8e04c2b4a7d93223f49a237fd52ef2483929ed9c		6 years ago
..
aligned_buffer.h	Document AlignedBuffer (#5345 )	6 years ago
autovector.h	Use placement new and delete in autovector (#5080 )	7 years ago
autovector_test.cc	Move some memory related files from util/ to memory/ (#5382 )	6 years ago
bloom.cc	Refactor / clean up / optimize FullFilterBitsReader (#5941 )	6 years ago
bloom_impl.h	Apply formatter on recent 45 commits. (#5827 )	6 years ago
bloom_test.cc	Refactor / clean up / optimize FullFilterBitsReader (#5941 )	6 years ago
build_version.cc.in	Add copyright headers per FB open-source checkup tool. (#5199 )	7 years ago
build_version.h	Change RocksDB License	8 years ago
cast_util.h	Add a missing "once" in .h	8 years ago
channel.h	Support pragma once in all header files and cleanup some warnings (#4339 )	7 years ago
coding.cc	Enable MSVC W4 with a few exceptions. Fix warnings and bugs	8 years ago
coding.h	Avoid user key copying for Get/Put/Write with user-timestamp (#5502 )	6 years ago
coding_test.cc	Move test related files under util/ to test_util/ (#5377 )	6 years ago
compaction_job_stats_impl.cc	Refresh snapshot list during long compactions (2nd attempt) (#5278 )	7 years ago
comparator.cc	Add support for timestamp in Get/Put (#5079 )	6 years ago
compression.h	Revert to storing UncompressionDicts in the cache (#5645 )	6 years ago
compression_context_cache.cc	run make format for PR 3838 (#3954 )	7 years ago
compression_context_cache.h	run make format for PR 3838 (#3954 )	7 years ago
concurrent_task_limiter_impl.cc	Compaction limiter miscs (#4795 )	7 years ago
concurrent_task_limiter_impl.h	Apply formatter on recent 45 commits. (#5827 )	6 years ago
core_local.h	Change RocksDB License	8 years ago
crc32c.cc	Cleanup the Arm64 CRC32 unused warning (#5565 )	6 years ago
crc32c.h	Updated CRC32 Power Optimization Changes	8 years ago
crc32c_arm64.cc	Apply formatter to recent 200+ commits. (#5830 )	6 years ago
crc32c_arm64.h	Apply formatter to recent 200+ commits. (#5830 )	6 years ago
crc32c_ppc.c	C file should not include <cinttypes>, it is a C++ header. (#5499 )	6 years ago
crc32c_ppc.h	Remove PATENTS text from a few straggler files (#5326 )	6 years ago
crc32c_ppc_asm.S	Remove PATENTS text from a few straggler files (#5326 )	6 years ago
crc32c_ppc_constants.h	Remove PATENTS text from a few straggler files (#5326 )	6 years ago
crc32c_test.cc	Move test related files under util/ to test_util/ (#5377 )	6 years ago
duplicate_detector.h	simplify include directive involving inttypes (#5402 )	6 years ago
dynamic_bloom.cc	Apply formatter to recent 200+ commits. (#5830 )	6 years ago
dynamic_bloom.h	MultiGet batching in memtable (#5818 )	6 years ago
dynamic_bloom_test.cc	Apply formatter to recent 200+ commits. (#5830 )	6 years ago
file_reader_writer_test.cc	Divide file_reader_writer.h and .cc (#5803 )	6 years ago
filelock_test.cc	Move some memory related files from util/ to memory/ (#5382 )	6 years ago
filter_bench.cc	Fix some implicit conversions in filter_bench (#5894 )	6 years ago
filter_policy.cc	Change RocksDB License	8 years ago
gflags_compat.h	filter_bench - a prelim tool for SST filter benchmarking (#5825 )	6 years ago
hash.cc	Add GCC 8 to Travis (#3433 )	7 years ago
hash.h	Faster new DynamicBloom implementation (for memtable) (#5762 )	6 years ago
hash_map.h	Change RocksDB License	8 years ago
hash_test.cc	Apply formatter to recent 200+ commits. (#5830 )	6 years ago
heap.h	Add compaction logic to RangeDelAggregatorV2 (#4758 )	7 years ago
heap_test.cc	fix gflags namespace	8 years ago
kv_map.h	Consolidate hash function used for non-persistent data in a new function (#5155 )	7 years ago
log_write_bench.cc	Divide file_reader_writer.h and .cc (#5803 )	6 years ago
murmurhash.cc	Add GCC 8 to Travis (#3433 )	7 years ago
murmurhash.h	Change RocksDB License	8 years ago
mutexlock.h	Apply formatter on recent 45 commits. (#5827 )	6 years ago
ppc-opcode.h	Remove PATENTS text from a few straggler files (#5326 )	6 years ago
random.cc	Change RocksDB License	8 years ago
random.h	Fix some implicit conversions in filter_bench (#5894 )	6 years ago
rate_limiter.cc	Move some memory related files from util/ to memory/ (#5382 )	6 years ago
rate_limiter.h	rate limit auto-tuning	8 years ago
rate_limiter_test.cc	Apply formatter to recent 200+ commits. (#5830 )	6 years ago
repeatable_thread.h	Move test related files under util/ to test_util/ (#5377 )	6 years ago
repeatable_thread_test.cc	Move some memory related files from util/ to memory/ (#5382 )	6 years ago
set_comparator.h	WritePrepared Txn: Move DuplicateDetector to util	8 years ago
slice.cc	Apply modernize-use-override (2nd iteration)	7 years ago
slice_transform_test.cc	Move test related files under util/ to test_util/ (#5377 )	6 years ago
status.cc	Allow users to stop manual compactions (#3971 )	6 years ago
stderr_logger.h	Change RocksDB License	8 years ago
stop_watch.h	Make statistics's stats_level change thread-safe (#5030 )	7 years ago
string_util.cc	Refactor trimming logic for immutable memtables (#5022 )	6 years ago
string_util.h	Refactor trimming logic for immutable memtables (#5022 )	6 years ago
thread_list_test.cc	Move test related files under util/ to test_util/ (#5377 )	6 years ago
thread_local.cc	Enable building of ARM32 (#4349 )	7 years ago
thread_local.h	Provide a way to override windows memory allocator with jemalloc for ZSTD	7 years ago
thread_local_test.cc	Move some memory related files from util/ to memory/ (#5382 )	6 years ago
thread_operation.h	Add inline comments to flush job (#4464 )	7 years ago
threadpool_imp.cc	Apply formatter to recent 200+ commits. (#5830 )	6 years ago
threadpool_imp.h	Support lowering CPU priority of background threads	8 years ago
timer_queue.h	Move test related files under util/ to test_util/ (#5377 )	6 years ago
timer_queue_test.cc	Change RocksDB License	8 years ago
user_comparator_wrapper.h	Fix perf_context.user_key_comparison_count for range scan (#5098 )	7 years ago
util.h	Add GCC 8 to Travis (#3433 )	7 years ago
vector_iterator.h	Make clang-analyzer happy (#5821 )	6 years ago
xxhash.cc	Add copyright headers per FB open-source checkup tool. (#5199 )	7 years ago
xxhash.h	Add copyright headers per FB open-source checkup tool. (#5199 )	7 years ago