rocksdb

fork of https://github.com/oxigraph/rocksdb and https://github.com/facebook/rocksdb for nextgraph and oxigraph

History

Peter Dillinger 239d17a19c Support optimize_filters_for_memory for Ribbon filter (#7774 ) Summary: Primarily this change refactors the optimize_filters_for_memory code for Bloom filters, based on malloc_usable_size, to also work for Ribbon filters. This change also replaces the somewhat slow but general BuiltinFilterBitsBuilder::ApproximateNumEntries with implementation-specific versions for Ribbon (new) and Legacy Bloom (based on a recently deleted version). The reason is to emphasize speed in ApproximateNumEntries rather than 100% accuracy. Justification: ApproximateNumEntries (formerly CalculateNumEntry) is only used by RocksDB for range-partitioned filters, called each time we start to construct one. (In theory, it should be possible to reuse the estimate, but the abstractions provided by FilterPolicy don't really make that workable.) But this is only used as a heuristic estimate for hitting a desired partitioned filter size because of alignment to data blocks, which have various numbers of unique keys or prefixes. The two factors lead us to prioritize reasonable speed over 100% accuracy. optimize_filters_for_memory adds extra complication, because precisely calculating num_entries for some allowed number of bytes depends on state with optimize_filters_for_memory enabled. And the allocator-agnostic implementation of optimize_filters_for_memory, using malloc_usable_size, means we would have to actually allocate memory, many times, just to precisely determine how many entries (keys) could be added and stay below some size budget, for the current state. (In a draft, I got this working, and then realized the balance of speed vs. accuracy was all wrong.) So related to that, I have made CalculateSpace, an internal-only API only used for testing, non-authoritative also if optimize_filters_for_memory is enabled. This simplifies some code. Pull Request resolved: https://github.com/facebook/rocksdb/pull/7774 Test Plan: unit test updated, and for FilterSize test, range of tested values is greatly expanded (still super fast) Also tested `db_bench -benchmarks=fillrandom,stats -bloom_bits=10 -num=1000000 -partition_index_and_filters -format_version=5 [-optimize_filters_for_memory] [-use_ribbon_filter]` with temporary debug output of generated filter sizes. Bloom+optimize_filters_for_memory: 1 Filter size: 197 (224 in memory) 134 Filter size: 3525 (3584 in memory) 107 Filter size: 4037 (4096 in memory) Total on disk: 904,506 Total in memory: 918,752 Ribbon+optimize_filters_for_memory: 1 Filter size: 3061 (3072 in memory) 110 Filter size: 3573 (3584 in memory) 58 Filter size: 4085 (4096 in memory) Total on disk: 633,021 (-30.0%) Total in memory: 634,880 (-30.9%) Bloom (no offm): 1 Filter size: 261 (320 in memory) 1 Filter size: 3333 (3584 in memory) 240 Filter size: 3717 (4096 in memory) Total on disk: 895,674 (-1% on disk vs. +offm; known tolerable overhead of offm) Total in memory: 986,944 (+7.4% vs. +offm) Ribbon (no offm): 1 Filter size: 2949 (3072 in memory) 1 Filter size: 3381 (3584 in memory) 167 Filter size: 3701 (4096 in memory) Total on disk: 624,397 (-30.3% vs. Bloom) Total in memory: 690,688 (-30.0% vs. Bloom) Note that optimize_filters_for_memory is even more effective for Ribbon filter than for cache-local Bloom, because it can close the unused memory gap even tighter than Bloom filter, because of 16 byte increments for Ribbon vs. 64 byte increments for Bloom. Reviewed By: jay-zhuang Differential Revision: D25592970 Pulled By: pdillinger fbshipit-source-id: 606fdaa025bb790d7e9c21601e8ea86e10541912		4 years ago
..
binary_search_index_reader.cc	Separate internal and user key comparators in `BlockIter` (#6944 )	5 years ago
binary_search_index_reader.h	Extend Get/MultiGet deadline support to table open (#6982 )	5 years ago
block.cc	Remove unused includes (#7604 )	4 years ago
block.h	Add EnvTestWithParam::OptionsTest to the ASSERT_STATUS_CHECKED passes (#7283 )	5 years ago
block_based_filter_block.cc	Exclude timestamp from prefix extractor (#7668 )	4 years ago
block_based_filter_block.h	Exclude timestamp from prefix extractor (#7668 )	4 years ago
block_based_filter_block_test.cc	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
block_based_table_builder.cc	Use size_t for filter APIs, protect against overflow (#7726 )	4 years ago
block_based_table_builder.h	Make parallel compression optimization code tidier (#6888 )	4 years ago
block_based_table_factory.cc	Fix the logic of setting read_amp_bytes_per_bit from OPTIONS file (#7680 )	4 years ago
block_based_table_factory.h	Create a Customizable class to load classes and configurations (#6590 )	4 years ago
block_based_table_iterator.cc	Clean up InternalIterator upper bound logic a little bit (#7200 )	5 years ago
block_based_table_iterator.h	Exclude timestamp from prefix extractor (#7668 )	4 years ago
block_based_table_reader.cc	Ensure that MultiGet works properly with compressed cache (#7756 )	4 years ago
block_based_table_reader.h	Add sst_file_dumper status check (#7315 )	5 years ago
block_based_table_reader_impl.h	Divide block_based_table_reader.cc (#6527 )	5 years ago
block_based_table_reader_test.cc	Fix many tests to run with MEM_ENV and ENCRYPTED_ENV; Introduce a MemoryFileSystem class (#7566 )	4 years ago
block_builder.cc	Add pipelined & parallel compression optimization (#6262 )	5 years ago
block_builder.h	Add pipelined & parallel compression optimization (#6262 )	5 years ago
block_prefetcher.cc	Add buffer prefetch support for non directIO usecase (#7312 )	5 years ago
block_prefetcher.h	De-template block based table iterator (#6531 )	5 years ago
block_prefix_index.cc	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
block_prefix_index.h	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
block_test.cc	More Makefile Cleanup (#7097 )	5 years ago
block_type.h	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
cachable_entry.h	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
data_block_footer.cc	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
data_block_footer.h	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
data_block_hash_index.cc	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
data_block_hash_index.h	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
data_block_hash_index_test.cc	More Makefile Cleanup (#7097 )	5 years ago
filter_block.h	Exclude timestamp from prefix extractor (#7668 )	4 years ago
filter_block_reader_common.cc	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
filter_block_reader_common.h	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
filter_policy.cc	Support optimize_filters_for_memory for Ribbon filter (#7774 )	4 years ago
filter_policy_internal.h	Support optimize_filters_for_memory for Ribbon filter (#7774 )	4 years ago
flush_block_policy.cc	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
flush_block_policy.h	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
full_filter_block.cc	Exclude timestamp from prefix extractor (#7668 )	4 years ago
full_filter_block.h	Exclude timestamp from prefix extractor (#7668 )	4 years ago
full_filter_block_test.cc	Use size_t for filter APIs, protect against overflow (#7726 )	4 years ago
hash_index_reader.cc	Separate internal and user key comparators in `BlockIter` (#6944 )	5 years ago
hash_index_reader.h	Extend Get/MultiGet deadline support to table open (#6982 )	5 years ago
index_builder.cc	Move break into block (#7468 )	5 years ago
index_builder.h	Make db_basic_test pass assert status checked (#7452 )	5 years ago
index_reader_common.cc	Divide block_based_table_reader.cc (#6527 )	5 years ago
index_reader_common.h	Divide block_based_table_reader.cc (#6527 )	5 years ago
mock_block_based_table.h	For ApproximateSizes, pro-rate table metadata size over data blocks (#6784 )	5 years ago
parsed_full_filter_block.cc	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
parsed_full_filter_block.h	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
partitioned_filter_block.cc	Use size_t for filter APIs, protect against overflow (#7726 )	4 years ago
partitioned_filter_block.h	Return error if Get/Multi() fails in Prefetching Filter blocks (#7543 )	5 years ago
partitioned_filter_block_test.cc	Remove unused includes (#7604 )	4 years ago
partitioned_index_iterator.cc	Fix misspelling of PartitionedIndexIterator (#7450 )	5 years ago
partitioned_index_iterator.h	Fix misspelling of PartitionedIndexIterator (#7450 )	5 years ago
partitioned_index_reader.cc	Redesign block cache pinning API (#7520 )	5 years ago
partitioned_index_reader.h	Get() to fail with underlying failures in PartitionIndexReader::CacheDependencies() (#7297 )	5 years ago
reader_common.cc	Fix block checksum for >=4GB, refactor (#6978 )	5 years ago
reader_common.h	Bring the Configurable options together (#5753 )	5 years ago
uncompression_dict_reader.cc	Extend Get/MultiGet deadline support to table open (#6982 )	5 years ago
uncompression_dict_reader.h	Extend Get/MultiGet deadline support to table open (#6982 )	5 years ago