rocksdb

fork of https://github.com/oxigraph/rocksdb and https://github.com/facebook/rocksdb for nextgraph and oxigraph

History

Peter Dillinger 01a2e20299 Account for DB ID in stress testing block cache keys (#10388 ) Summary: I recently discovered that block cache keys are slightly lower quality than previously thought, because my stress testing tool failed to simulate the effect of DB ID differences. This change updates the tool and gives us data to guide future developments. (No changes to production code here and now.) Nevertheless, the following promise still holds ``` // In fact, if our SST files are all < 4TB (see // BlockBasedTable::kMaxFileSizeStandardEncoding), then SST files generated // in a single process are guaranteed to have unique cache keys, unless/until // number session ids * max file number = 2**86 ... ``` because although different DB IDs could cause collision in file number and offset data, that would have to be using the same DB session (lower) to cause a block cache key collision, which is not possible in the same process. (A session is associated with only one DB ID.) This change fixes cache_bench -stress_cache_key to set and reset DB IDs in a parameterized way to evaluate the effect. Previous results assumed to be representative (using -sck_keep_bits=43): ``` 15 collisions after 15 x 90 days, est 90 days between (1.03763e+20 corrected) ``` or expected collision on a single machine every 104 billion billion days (see "corrected" value). After accounting for DB IDs, test never really changing, intermediate, and very frequently changing (using default -sck_db_count=100): ``` -sck_newdb_nreopen=1000000000: 15 collisions after 2 x 90 days, est 12 days between (1.38351e+19 corrected) -sck_newdb_nreopen=10000: 17 collisions after 2 x 90 days, est 10.5882 days between (1.22074e+19 corrected) -sck_newdb_nreopen=100: 19 collisions after 2 x 90 days, est 9.47368 days between (1.09224e+19 corrected) ``` or roughly 10x more often than previously thought (still extremely if not impossibly rare), and better than random base cache keys (with -sck_randomize), though < 10x better than random: ``` 31 collisions after 1 x 90 days, est 2.90323 days between (3.34719e+18 corrected) ``` If we simply fixed this by ignoring DB ID for cache keys, we would potentially have a shortage of entropy for some cases, such as small file numbers and offsets (e.g. many short-lived processes each using SstFileWriter to create a small file), because existing DB session IDs only provide ~103 bits of entropy. We could upgrade the entropy in DB session IDs to accommodate, but it's not known what all would be affected by changing from 20 digit session IDs to something larger. Instead, my plan is to 1) Move to block cache keys derived from SST unique IDs (so that we can derive block cache keys from manifest data without reading file on storage), and show no significant regression in expected collision rate. 2) Generate better SST unique IDs in format_version=6 (https://github.com/facebook/rocksdb/issues/9058), which should have ~100x lower expected/predicted collision rate based on simulations with this stress test: ``` ./cache_bench -stress_cache_key -sck_keep_bits=39 -sck_newdb_nreopen=100 -sck_footer_unique_id ... 15 collisions after 19 x 90 days, est 114 days between (2.10293e+21 corrected) ``` Pull Request resolved: https://github.com/facebook/rocksdb/pull/10388 Test Plan: no production changes Reviewed By: jay-zhuang Differential Revision: D37986714 Pulled By: pdillinger fbshipit-source-id: e759b2469e3365cb01c6661a69e0ab849ef4c3df		3 years ago
..
cache.cc	Enable SecondaryCache::CreateFromString to create sec cache based on the uri for CompressedSecondaryCache (#10132 )	3 years ago
cache_bench.cc	Add (& fix) some simple source code checks (#8821 )	4 years ago
cache_bench_tool.cc	Account for DB ID in stress testing block cache keys (#10388 )	3 years ago
cache_entry_roles.cc	Charge blob cache usage against the global memory limit (#10321 )	3 years ago
cache_entry_roles.h	Expose `CacheEntryRole` and map keys for block cache stat collections (#9838 )	3 years ago
cache_entry_stats.h	New stable, fixed-length cache keys (#9126 )	4 years ago
cache_helpers.h	Eliminate the copying of blobs when serving reads from the cache (#10297 )	3 years ago
cache_key.cc	Enhance new cache key testing & comments (#9329 )	3 years ago
cache_key.h	Fix key size in cache_bench (#10234 )	3 years ago
cache_reservation_manager.cc	Charge blob cache usage against the global memory limit (#10321 )	3 years ago
cache_reservation_manager.h	Account memory of FileMetaData in global memory limit (#9924 )	3 years ago
cache_reservation_manager_test.cc	Have Cache use Status::MemoryLimit (#10262 )	3 years ago
cache_test.cc	Temporarily return a LRUCache from NewClockCache (#10351 )	3 years ago
charged_cache.cc	Charge blob cache usage against the global memory limit (#10321 )	3 years ago
charged_cache.h	Charge blob cache usage against the global memory limit (#10321 )	3 years ago
clock_cache.cc	Lock-free ClockCache (#10390 )	3 years ago
clock_cache.h	Lock-free ClockCache (#10390 )	3 years ago
compressed_secondary_cache.cc	Add the secondary cache information into LRUCache:: GetPrintableOptions (#10346 )	3 years ago
compressed_secondary_cache.h	Prevent double caching in the compressed secondary cache (#9747 )	3 years ago
compressed_secondary_cache_test.cc	Enable SecondaryCache::CreateFromString to create sec cache based on the uri for CompressedSecondaryCache (#10132 )	3 years ago
fast_lru_cache.cc	Lock-free Lookup and Release in ClockCache (#10347 )	3 years ago
fast_lru_cache.h	Clock cache (#10273 )	3 years ago
lru_cache.cc	Support using secondary cache with the blob cache (#10349 )	3 years ago
lru_cache.h	Charge blob cache usage against the global memory limit (#10321 )	3 years ago
lru_cache_test.cc	Charge blob cache usage against the global memory limit (#10321 )	3 years ago
sharded_cache.cc	Update Cache::Release param from force_erase to erase_if_last_ref (#9728 )	3 years ago
sharded_cache.h	Charge blob cache usage against the global memory limit (#10321 )	3 years ago