rocksdb

fork of https://github.com/oxigraph/rocksdb and https://github.com/facebook/rocksdb for nextgraph and oxigraph

History

Peter Dillinger f9db0c6e9c Refactor block cache tracing w/improved MultiGet (#11339 ) Summary: After https://github.com/facebook/rocksdb/issues/11301, I wasn't sure whether I had regressed block cache tracing with MultiGet. Demo PR https://github.com/facebook/rocksdb/issues/11330 shows the flawed state of tracing MultiGet before my change, and based on the unit test, there was essentially no change in tracing behavior with https://github.com/facebook/rocksdb/issues/11301. This change is to leave that code and behavior better than I found it. This change is not intended to change any production behaviors except when block cache tracing is active, though might improve general read path efficiency by disabling some related tracking when such tracing is disabled. More detail on production code: * Refactoring to consolidate the construction of BlockCacheTraceRecord, and other related functionality, in block-based table reader, though it's somewhat awkward to preserve an optimization to avoid copying Slices into temporary strings in BlockCacheLookupContext. * Accurately track cache hits and misses (etc.) for each data block accessed by a MultiGet(). (Previously reported hits as misses.) * Reduced repeated checking of `block_cache_tracer_` state (by creating lookup_context only when active) for efficiency and to reduce the risk of corner case bugs where tracing is enabled or disabled for different parts of a read op. (See a TODO below) * Improved estimate calculation for num_keys_in_block (see code comment) Possible follow-up: * `XXX:` use_cache=true means double cache query? (possible double-query of block cache when allow_mmap_reads=true) * `TODO:` need more than one lookup_context here to track individual filter and index partition hits and misses * `TODO:` optimize more state checks of `block_cache_tracer_` down to `lookup_context != nullptr` * Pre-existing `XXX:` There appear to be 'break' statements above that bypass this writing of the block cache trace record * Expand test coverage (see below) Pull Request resolved: https://github.com/facebook/rocksdb/pull/11339 Test Plan: * Added a basic unit test for block cache tracing MultiGet, for now just covering one data block with two keys. * Added HitMissCountingCache to independently verify that the actual block cache trace and expected block cache trace also agree with the actual number of cache hits / misses (nothing missing or mislabeled). For now only used with MultiGet test. * Better testing of num_keys_in_block, for now just with MultiGet * Misc improvements to table_test to improve clarity, such as making it clear that certain keys are auto-inserted at the start of every test. Performance test: Testing multireadrandom as in https://github.com/facebook/rocksdb/issues/11301, except averaging over distinct runs rather than [-X30] which doesn't seem to sufficiently reset after each run to work as an independent test run. Base with revert of 11301: 3148926 ops/sec Base: 3019146 ops/sec New: 2999529 ops/sec Possibly a tiny MultiGet CPU regression with this change. We are now always allocating an additional vector for the LookupContexts. I'm still contemplating options to try to correct the regression in https://github.com/facebook/rocksdb/issues/11301. Testing readrandom: Base with revert of 11301: 2311988 Base: 2281726 New: 2299722 Possibly a tiny Get CPU improvement with this change. We are now avoiding some unnecessary LookupContext population. Reviewed By: akankshamahajan15 Differential Revision: D44557845 Pulled By: pdillinger fbshipit-source-id: b841691799d2a48fb59cc8880dc7cbb1e107ae3d		2 years ago
..
binary_search_index_reader.cc	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
binary_search_index_reader.h	Extend Get/MultiGet deadline support to table open (#6982 )	5 years ago
block.cc	Better support for merge operation with data block hash index (#11356 )	2 years ago
block.h	Put Cache and CacheWrapper in new public header (#11192 )	2 years ago
block_based_table_builder.cc	Remove compressed block cache (#11117 )	2 years ago
block_based_table_builder.h	Major Cache refactoring, CPU efficiency improvement (#10975 )	2 years ago
block_based_table_factory.cc	Change default block cache from 8MB to 32MB (#11350 )	2 years ago
block_based_table_factory.h	Remove RocksDB LITE (#11147 )	2 years ago
block_based_table_iterator.cc	Provide support for direct_reads with async_io (#10197 )	3 years ago
block_based_table_iterator.h	Format files under table/ by clang-format (#10852 )	3 years ago
block_based_table_reader.cc	Refactor block cache tracing w/improved MultiGet (#11339 )	2 years ago
block_based_table_reader.h	Refactor block cache tracing w/improved MultiGet (#11339 )	2 years ago
block_based_table_reader_impl.h	HyperClockCache support for SecondaryCache, with refactoring (#11301 )	2 years ago
block_based_table_reader_sync_and_async.h	Refactor block cache tracing w/improved MultiGet (#11339 )	2 years ago
block_based_table_reader_test.cc	Add a new MultiGetEntity API (#11222 )	2 years ago
block_builder.cc	Format files under table/ by clang-format (#10852 )	3 years ago
block_builder.h	Format files under table/ by clang-format (#10852 )	3 years ago
block_cache.cc	Simplify tracking entries already in SecondaryCache (#11299 )	2 years ago
block_cache.h	Remove compressed block cache (#11117 )	2 years ago
block_prefetcher.cc	Fix stress test failure for async_io (#10660 )	3 years ago
block_prefetcher.h	Provide support for direct_reads with async_io (#10197 )	3 years ago
block_prefix_index.cc	Fix bug with kHashSearch and changing prefix_extractor with SetOptions (#10128 )	3 years ago
block_prefix_index.h	Fix bug with kHashSearch and changing prefix_extractor with SetOptions (#10128 )	3 years ago
block_test.cc	Print stack traces on frozen tests in CI (#10828 )	3 years ago
block_type.h	Remove deprecated block-based filter (#10184 )	3 years ago
cachable_entry.h	HyperClockCache support for SecondaryCache, with refactoring (#11301 )	2 years ago
data_block_footer.cc	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
data_block_footer.h	Replace namespace name "rocksdb" with ROCKSDB_NAMESPACE (#6433 )	5 years ago
data_block_hash_index.cc	Format files under table/ by clang-format (#10852 )	3 years ago
data_block_hash_index.h	Fix build with gcc 13 by including <cstdint> (#11118 )	2 years ago
data_block_hash_index_test.cc	Format files under table/ by clang-format (#10852 )	3 years ago
filter_block.h	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
filter_block_reader_common.cc	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
filter_block_reader_common.h	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
filter_policy.cc	Remove RocksDB LITE (#11147 )	2 years ago
filter_policy_internal.h	Remove deprecated block-based filter (#10184 )	3 years ago
flush_block_policy.cc	Remove FactoryFunc from LoadXXXObject (#11203 )	2 years ago
flush_block_policy.h	Make FlushBlockPolicyFactory into a Customizable class (#8432 )	4 years ago
full_filter_block.cc	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
full_filter_block.h	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
full_filter_block_test.cc	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
hash_index_reader.cc	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
hash_index_reader.h	Extend Get/MultiGet deadline support to table open (#6982 )	5 years ago
index_builder.cc	Make InternalKeyComparator not configurable (#10342 )	3 years ago
index_builder.h	Format files under table/ by clang-format (#10852 )	3 years ago
index_reader_common.cc	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
index_reader_common.h	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
mock_block_based_table.h	Remove deprecated block-based filter (#10184 )	3 years ago
parsed_full_filter_block.cc	Hide FilterBits{Builder,Reader} from public API (#9592 )	3 years ago
parsed_full_filter_block.h	Major Cache refactoring, CPU efficiency improvement (#10975 )	2 years ago
partitioned_filter_block.cc	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
partitioned_filter_block.h	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
partitioned_filter_block_test.cc	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
partitioned_index_iterator.cc	Provide support for direct_reads with async_io (#10197 )	3 years ago
partitioned_index_iterator.h	Format files under table/ by clang-format (#10852 )	3 years ago
partitioned_index_reader.cc	Use user-provided ReadOptions for metadata block reads more often (#11208 )	2 years ago
partitioned_index_reader.h	Meta-internal folly integration with F14FastMap (#9546 )	3 years ago
reader_common.cc	Remove own ToString() (#9955 )	3 years ago
reader_common.h	Put Cache and CacheWrapper in new public header (#11192 )	2 years ago
uncompression_dict_reader.cc	HyperClockCache support for SecondaryCache, with refactoring (#11301 )	2 years ago
uncompression_dict_reader.h	Format files under table/ by clang-format (#10852 )	3 years ago