Implement the 64-bit variant of xxHash. #17656

ghost · 2014-09-30T15:46:06Z

Matches C in speed, but it looks like SipHash was not the only bottleneck for #11783.

Tests include the official data and the relevant ones from SipHash.

rust-highfive · 2014-09-30T15:46:14Z

Warning

These commits modify unsafe code. Please review it carefully!

thestinger · 2014-09-30T15:53:01Z

For security, I'd say it could replace SipHash wholesale, as per this post.

That post doesn't go into any in-depth cryptanalysis. Security against DoS attacks requires a lot more than just good results on statistical tests and avalanche effect resistance.

ghost · 2014-09-30T15:56:15Z

Oh well. At least it's fast :)

thestinger · 2014-09-30T15:57:16Z

Can this replace the FNV implementation that's currently used in rustc?

ghost · 2014-09-30T16:07:56Z

@thestinger: If used with large enough chunks (>32b), it can beat anything. With small chunks, it degrades the same way SipHash does, but I think that's inevitable with a decent hash.

pczarn · 2014-09-30T17:12:14Z

src/libcollections/hash/xxh.rs

+    state.digest()
+}
+
+pub struct XXState {


nit: I think the convention is XxState

pczarn · 2014-09-30T17:39:25Z

This implementation has quite a bit of unsafe code. SipHash's doesn't have any.

Have you checked the C implementation's speed? I think it should be faster than FNV for 8-byte or larger chunks when inlined. I've measured my implementation:

test xxhash::rust::chunks_8_xxh64   ... bench:     76874 ns/iter (+/- 603) = 852 MB/s
test xxhash::rust::chunks_15_xxh64  ... bench:     56387 ns/iter (+/- 681) = 1162 MB/s
test xxhash::rust::chunks_32_xxh64  ... bench:     26776 ns/iter (+/- 121) = 2447 MB/s
test xxhash::rust::chunks_128_xxh64 ... bench:     14792 ns/iter (+/- 263) = 4430 MB/s
test xxhash::rust::chunks_256_xxh64 ... bench:     12603 ns/iter (+/- 354) = 5200 MB/s

test xxhash::c::chunks_8_xxh64      ... bench:     62287 ns/iter (+/- 441) = 1052 MB/s
test xxhash::c::chunks_15_xxh64     ... bench:     43615 ns/iter (+/- 459) = 1502 MB/s
test xxhash::c::chunks_64_xxh64     ... bench:     11531 ns/iter (+/- 109) = 5683 MB/s
test xxhash::c::chunks_128_xxh64    ... bench:      9555 ns/iter (+/- 98) = 6858 MB/s
test xxhash::c::chunks_256_xxh64    ... bench:      8516 ns/iter (+/- 57) = 7695 MB/s

What makes you think a hash can't be fast? Xxhash has two parts at its core.

the inner loop that consumes 32 bytes at a time
the finalizer

For 8 byte chunks, it does only the following:

let total_len = 8;
let mut h64 = self.seed + PRIME5 + total_len;

let mut k1: u64 = source * PRIME2;
k1 = rotl64(k1, 31);
k1 *= PRIME1;
h64 ^= k1;
h64 = rotl64(h64, 27) * PRIME1 + PRIME4;

h64 ^= h64 >> 33;
h64 *= PRIME2;
h64 ^= h64 >> 29;
h64 *= PRIME3;
h64 ^= h64 >> 32;
return h64;

However, the finalizer alone is responsible for avalanche properties and can be used with chunks that have exactly 8 bytes:

let mut hash = source;
hash ^= hash >> 33;
hash *= PRIME2;
hash ^= hash >> 29;
hash *= PRIME3;
hash ^= hash >> 32;
return hash;

gereeter · 2014-09-30T22:13:21Z

src/libcollections/hash/xxh.rs

+
+impl XXHasher {
+    pub fn new() -> XXHasher { #![inline]
+        XXHasher::new_with_seed(18446744073709551557u64)


Where did this seed come from?

It's a large prime. This should be #[cfg(test)] I think, but I wanted to be consistent with SipHash which has new_with_keys(0, 0) here.

For real uses, the seeds should be randomized, which is done with RandomSipHasher in libstd.

ghost · 2014-10-01T10:39:54Z

@pczarn: It's not the hash that's slow, but Rust's handling of them. Hash is inefficient because it needs to be stable (i.e. a.hash() == a.hash(), regardless of where it was computed), and that forces slow paths way too often. I've opened this topic.

arthurprs · 2014-10-03T14:15:44Z

I expressed this concern a couple of times already. I think we're taking security too far in the hashmap implementations. The majority of programming languages are using a per process seed (either for sip hash or others). Rust has a seed per hash map AND a slow overly-secure hash as the default.

vks · 2014-10-03T18:01:39Z

a slow overly-secure hash as the default

This is a better default than an insecure and fast hash. If you care about performance and not about security, you can opt in. If you don't care about performance, you should use the secure version.

arthurprs · 2014-10-10T20:09:12Z

Did you guys see http://discuss.rust-lang.org/t/unstable-hash-architecture/578 ? It's something worth discussing.

alexcrichton · 2014-10-31T04:51:06Z

Unfortunately deleting the source repository causes bors to get stuck, @Jurily would you mind reopening? Thanks!

fix: Allow flyimport to import primitive shadowing modules Fixes rust-lang/rust-analyzer#16371

Implement the 64-bit variant of xxHash.

8b5626b

pczarn reviewed Sep 30, 2014
View reviewed changes

src/libcollections/hash/xxh.rs

state.digest()

}

pub struct XXState {

Copy link

Contributor

pczarn Sep 30, 2014

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: I think the convention is XxState

gereeter reviewed Sep 30, 2014
View reviewed changes

alexcrichton closed this Oct 31, 2014

lnicola pushed a commit to lnicola/rust that referenced this pull request Jul 28, 2024

Auto merge of rust-lang#17656 - Veykril:flyimport-builtin-mod, r=Veykril

9e3482b

fix: Allow flyimport to import primitive shadowing modules Fixes rust-lang/rust-analyzer#16371

RalfJung pushed a commit to RalfJung/rust that referenced this pull request Aug 1, 2024

Auto merge of rust-lang#17656 - Veykril:flyimport-builtin-mod, r=Veykril

fba8f7c

fix: Allow flyimport to import primitive shadowing modules Fixes rust-lang/rust-analyzer#16371

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement the 64-bit variant of xxHash. #17656

Implement the 64-bit variant of xxHash. #17656

ghost commented Sep 30, 2014

rust-highfive commented Sep 30, 2014

thestinger commented Sep 30, 2014

ghost commented Sep 30, 2014

thestinger commented Sep 30, 2014

ghost commented Sep 30, 2014

pczarn Sep 30, 2014

pczarn commented Sep 30, 2014

gereeter Sep 30, 2014

ghost Oct 1, 2014

ghost commented Oct 1, 2014

arthurprs commented Oct 3, 2014

vks commented Oct 3, 2014

arthurprs commented Oct 10, 2014

alexcrichton commented Oct 31, 2014

Implement the 64-bit variant of xxHash. #17656

Implement the 64-bit variant of xxHash. #17656

Conversation

ghost commented Sep 30, 2014

rust-highfive commented Sep 30, 2014

thestinger commented Sep 30, 2014

ghost commented Sep 30, 2014

thestinger commented Sep 30, 2014

ghost commented Sep 30, 2014

pczarn Sep 30, 2014

Choose a reason for hiding this comment

pczarn commented Sep 30, 2014

gereeter Sep 30, 2014

Choose a reason for hiding this comment

ghost Oct 1, 2014

Choose a reason for hiding this comment

ghost commented Oct 1, 2014

arthurprs commented Oct 3, 2014

vks commented Oct 3, 2014

arthurprs commented Oct 10, 2014

alexcrichton commented Oct 31, 2014