问题
I'm trying to figure out if it's possible to build a conformant, efficient implementation of modern C++'s std::unordered_map using techniques like Cuckoo Hashing, Hopscotch Hashing, and Robin Hood Hashing that allow for very compact tables, high load factors, and maintain high performance. What's special about these techniques is that they involve potentially moving some elements around to make room for others, rather than just chaining, or probing until an open slot is found (as in linear or quadratic probing) or.
From insert http://www.cplusplus.com/reference/unordered_map/unordered_map/insert/
Iterator validity On most cases, all iterators in the container remain valid after the insertion. The only exception being when the growth of the container forces a rehash. In this case, all iterators in the container are invalidated.
A rehash is forced if the new container size after the insertion operation would increase above its capacity threshold (calculated as the container's bucket_count multiplied by its max_load_factor).
References to elements in the unordered_map container remain valid in all cases, even after a rehash.
And for erase http://www.cplusplus.com/reference/unordered_map/unordered_map/erase/
Only the iterators and references to the elements removed are invalidated.
The rest are unaffected.
[c++14 only] The relative order of iteration of the elements not removed by the operation is preserved.
The requirement that other references generally remain valid in both operations would seem to require a probing scheme involving evictions to work on a table structure that allocated nodes separate from the array and pointed at them instead. Perhaps the implementation could keep a separate array of elements that entries in the table itself could index into, to avoid extra dynamic allocation, but that still adds extra indirection. Is there a more efficient way to address that requirement?
The requirement that element references remain valid across insert even after a rehash seems to imply that either dynamic node allocation or something like the above indirection array is required even for a chaining or non-moving open addressing design. Is that right?
In general, do the requirement placed on unordered_map by the standard enforce indirection or some other sort of inefficiency in hash table implementation?
来源:https://stackoverflow.com/questions/37428119/does-unordered-map-iterator-reference-invalidation-allow-for-cuckoo-hopscotch