It looks like GVMLookupForSlot has the same properties as casting - what is cached is relatively fast, so the cache needs to be faster, on the other hand it can have certain tradeoffs like simple/lossy resizing and expiration policies, while providing faster lookups and lock-free add.
It would make sense to try using Casting cache for GVMLookupForSlot.