t1ha

Fast Positive Hash, aka "Позитивный Хэш" by Positive Technologies.

The Future will Positive. Всё будет хорошо.

Briefly, it is a portable 64-bit hash function:

Intended for 64-bit little-endian platforms, predominantly for Elbrus and x86_64, but portable and without penalties it can run on any 64-bit CPU.
In most cases up to 15% faster than City, xxHash, mum-hash, metro-hash, etc. and all others portable hash-functions (which do not use specific hardware tricks).
Provides a set of terraced hash functions.
Currently not suitable for cryptography.
Licensed under Zlib License.

Also pay attention to Erlang and Golang implementations.

Usage

The t1ha library provides several terraced hash functions with the dissimilar properties and for a different cases. These functions briefly described below, see t1ha.h for more API details.

To use in your own project you may link with the t1ha-library, or just add to your project corresponding source files from /src directory.

Please, feel free to fill an issue or make pull request.

`t1ha0` = 64 bits, "Just Only Faster"

Provides fast-as-possible hashing for current CPU, including 32-bit systems and engaging the available hardware acceleration. You can rest assured that t1ha0 faster than all other fast hashes (with comparable quality) so, otherwise we will extend and refine it time-to-time.

On the other hand, without warranty that the hash result will be same for particular key on another machine or another version. Moreover, is deliberately known that the result will be different for systems with different bitness or endianness. Briefly, such hash-results and their derivatives, should be used only in runtime, but should not be persist or transferred over a network.

Also should be noted, the quality of t1ha0() hashing is a subject for tradeoffs with performance. Therefore the quality and strength of t1ha0() may be lower than t1ha1() and t1ha2(), especially on 32-bit targets, but then much faster. However, guaranteed that it passes all SMHasher tests.

Internally t1ha0() selects most faster implementation for current CPU, for now these are includes:

Implementation	Platform/CPU
`t1ha0_ia32aes_avx()`	x86 with AES-NI and AVX extensions
`t1ha0_ia32aes_avx2()`	x86 with AES-NI and AVX2 extensions
`t1ha0_ia32aes_noavx()`	x86 with AES-NI without AVX extensions
`t1ha0_32le()`	32-bit little-endian
`t1h0a_32be()`	32-bit big-endian
`t1ha1_le()`	64-bit little-endian
`t1ha1_be()`	64-bit big-endian

`t1ha1` = 64 bits, baseline fast portable hash

The first version of "Fast Positive Hash" with reasonable quality for checksum, hash tables and thin fingerprinting. It is stable, e.g. returns same result on all architectures and CPUs.

Speed with the reasonable quality of hashing.
Efficiency on modern 64-bit CPUs, but not in a hardware.
Strong as possible, until no penalties on performance.

Unfortunatelly, Yves Orton discovered that t1ha1() fails the strict avalanche criteria in some cases. This flaw is insignificant for the t1ha1() purposes and imperceptible from a practical point of view. However, nowadays this issue has resolved in the next t1ha2() function, that was initially planned to providing a bit more quality.

The basic version of t1ha1() intends for little-endian systems and will run slowly on big-endian. Therefore a dedicated big-endian version is also provided, but returns the different result than the basic version.

`t1ha2` = 64 and 128 bits, slightly more attention for quality and strength

The recommended version of "Fast Positive Hash" with good quality for checksum, hash tables and fingerprinting. It is stable, e.g. returns same result on all architectures and CPUs.

Portable and extremely efficiency on modern 64-bit CPUs.
Great quality of hashing and still faster than other non-t1ha hashes.
Provides streaming mode and 128-bit result.

The t1ha2() is intended for little-endian systems and will run slightly slowly on big-endian systems.

`t1ha3` = 128 and 256 bits, fast non-cryptographic fingerprinting

The next-step version of "Fast Positive Hash", but not yet finished and therefore not available.

Planned: `t1ha4` = 128 bits, fast insecure fingerprinting

Planned: `t1ha5` = 256 bits, fast Cryptographic, but with some limitations

Planned: `t1ha6` = 256 and 512 bits, Cryptographic with reasonable resistance to acceleration on GPU and FPGA.

Planned: `t1ha7` = 256, 512 and 1024 bits, Cryptographic, Strong Post-Quantum

Requirements and Portability:

t1ha designed for modern 64-bit architectures. But on the other hand, t1ha doesn't require instructions specific to a particular architecture:
- therefore t1ha could be used on any CPU for which compiler provides support 64-bit arithmetic.
- but unfortunately t1ha could be dramatically slowly on architectures without native 64-bit operations.
This implementation of t1ha requires modern GNU C compatible compiler, including Clang/LLVM, or Visual Studio 2013/2015/2017. For proper performance please use one of: GNU C 5.5 or later, CLANG 5.0 or later, Microsoft Visual Studio 2017 15.6 or later.

Acknowledgement:

The t1ha was originally developed by Leonid Yuriev (Леонид Юрьев) for The 1Hippeus project - zerocopy messaging in the spirit of Sparta!

Benchmarking and Testing

Current version of t1ha library includes tool for basic testing and benchmarking. Just try make check from t1ha directory.

To comparison benchmark also includes 32- and 64-bit versions of xxhash() function. For example:

$ CC=gcc-7 CXX=g++-7 make all && sudo make check
...
Preparing to benchmarking...
 - suggest enable rdpmc for usermode (echo 2 | sudo tee /sys/devices/cpu/rdpmc)
 - running on CPU#3
 - use RDPMC_perf as clock source for benchmarking
 - assume it cheap and stable
 - measure granularity and overhead: 53 cycle, 0.0188679 iteration/cycle

Bench for tiny keys (7 bytes):
t1ha2_atonce            :     18.109 cycle/hash,  2.587 cycle/byte,  0.387 byte/cycle,  1.160 Gb/s @3GHz
t1ha2_atonce128*        :     36.406 cycle/hash,  5.201 cycle/byte,  0.192 byte/cycle,  0.577 Gb/s @3GHz
t1ha2_stream*           :     84.938 cycle/hash, 12.134 cycle/byte,  0.082 byte/cycle,  0.247 Gb/s @3GHz
t1ha2_stream128*        :    104.062 cycle/hash, 14.866 cycle/byte,  0.067 byte/cycle,  0.202 Gb/s @3GHz
t1ha1_64le              :     19.109 cycle/hash,  2.730 cycle/byte,  0.366 byte/cycle,  1.099 Gb/s @3GHz
t1ha0                   :     15.039 cycle/hash,  2.148 cycle/byte,  0.465 byte/cycle,  1.396 Gb/s @3GHz
xxhash32                :     18.016 cycle/hash,  2.574 cycle/byte,  0.389 byte/cycle,  1.166 Gb/s @3GHz
xxhash64                :     26.094 cycle/hash,  3.728 cycle/byte,  0.268 byte/cycle,  0.805 Gb/s @3GHz
HighwayHash64_pure_c    :    513.000 cycle/hash, 73.286 cycle/byte,  0.014 byte/cycle,  0.041 Gb/s @3GHz
HighwayHash64_portable  :    498.771 cycle/hash, 71.253 cycle/byte,  0.014 byte/cycle,  0.042 Gb/s @3GHz
HighwayHash64_sse41     :     67.062 cycle/hash,  9.580 cycle/byte,  0.104 byte/cycle,  0.313 Gb/s @3GHz
HighwayHash64_avx2      :     59.375 cycle/hash,  8.482 cycle/byte,  0.118 byte/cycle,  0.354 Gb/s @3GHz

Bench for large keys (16384 bytes):
t1ha2_atonce            :   3555.000 cycle/hash,  0.217 cycle/byte,  4.609 byte/cycle, 13.826 Gb/s @3GHz
t1ha2_atonce128*        :   3577.000 cycle/hash,  0.218 cycle/byte,  4.580 byte/cycle, 13.741 Gb/s @3GHz
t1ha2_stream*           :   3716.000 cycle/hash,  0.227 cycle/byte,  4.409 byte/cycle, 13.227 Gb/s @3GHz
t1ha2_stream128*        :   3731.000 cycle/hash,  0.228 cycle/byte,  4.391 byte/cycle, 13.174 Gb/s @3GHz
t1ha1_64le              :   3542.000 cycle/hash,  0.216 cycle/byte,  4.626 byte/cycle, 13.877 Gb/s @3GHz
t1ha0                   :   1306.000 cycle/hash,  0.080 cycle/byte, 12.545 byte/cycle, 37.636 Gb/s @3GHz
xxhash32                :   8201.000 cycle/hash,  0.501 cycle/byte,  1.998 byte/cycle,  5.993 Gb/s @3GHz
xxhash64                :   4118.000 cycle/hash,  0.251 cycle/byte,  3.979 byte/cycle, 11.936 Gb/s @3GHz
HighwayHash64_pure_c    :  49079.201 cycle/hash,  2.996 cycle/byte,  0.334 byte/cycle,  1.001 Gb/s @3GHz
HighwayHash64_portable  :  44486.000 cycle/hash,  2.715 cycle/byte,  0.368 byte/cycle,  1.105 Gb/s @3GHz
HighwayHash64_sse41     :   6419.000 cycle/hash,  0.392 cycle/byte,  2.552 byte/cycle,  7.657 Gb/s @3GHz
HighwayHash64_avx2      :   4265.000 cycle/hash,  0.260 cycle/byte,  3.842 byte/cycle, 11.525 Gb/s @3GHz

The test tool support a set of command line options to selecting functions and size of keys for benchmarking. For more info please run ./test --help.

The `--hash-stdin-strings` option

One noteable option is --hash-stdin-strings, it intended to estimate hash collisions on your custom data. With this option test tool will hash each line from standard input and print its hash to standard output.

For instance, you could count collisions for lines from some words.list file by bash's command:

  ./t1ha/test --hash-stdin-strings < words.list | sort | uniq -c -d | wc -l

More complex example - count xxhash() collisions for lines from words.list and 0...10000 numbers, with distinction only in 32 bit of hash values:

  (cat words.list && seq 0 10000) | \
     ./t1ha/test --xxhash --hash-stdin-strings | \
     cut --bytes=-8 | sort | uniq -c -d | wc -l

SMHasher

SMHasher is a wellknown test suite designed to test the distribution, collision, and performance properties of non-cryptographic hash functions.

Reini Urban provides extended version/fork of SMHasher which integrates a lot of modern hash functions, including t1ha.

So, the quality and speed of t1ha can be easily checked with the following scenario:

git clone https://github.com/rurban/smhasher
cd smhasher
cmake .
make
./SMHasher City64
./SMHasher metrohash64_1
./SMHasher xxHash64
...
./SMHasher t1ha

For properly performance please use at least GCC 5.5, Clang 6.0 or Visual Studio 2017.

Scores

Please take in account that the results is significantly depend on actual CPU, compiler version and CFLAGS. The results below were obtained in 2016 on:

CPU: Intel(R) Core(TM) i7-6700K CPU;
Compiler: gcc version 5.4.0 20160609 (Ubuntu 5.4.0-6ubuntu1~16.04.4);
CFLAGS: -march=native -O3 -fPIC;

The SMALL KEYS case

Order by average Cycles per Hash for 1..31 bytes (less is better).

Function	MiB/Second	Cycles/Hash	Notes (quality, portability)
donothing	15747227.36	6.00	not a hash (just for reference)
sumhash32	43317.86	16.69	not a hash (just for reference)
FNV1a_YoshimitsuTRIAD	13000.49	24.96	poor (100% bias, collisions, distrib)
crc64_hw	7308.06	28.37	poor (insecure, 100% bias, collisions, distrib), non-portable (SSE4.2)
crc32_hw	5577.64	29.10	poor (insecure, 100% bias, collisions, distrib), non-portable (SSE4.2)
NOP_OAAT_read64	1991.31	30.46	poor (100% bias, 2.17x collisions)
Crap8	2743.80	32.50	poor (2.42% bias, collisions, 2% distrib)
t1ha_aes	34636.42	33.03	non-portable (AES-NI)
t1ha	12228.80	35.55
MUM	10246.20	37.25	non-portable (different result, machine specific)
Murmur2	2789.89	38.37	poor (1.7% bias, 81x coll, 1.7% distrib)
t1ha_32le	5958.54	38.54	alien (designed for 32-bit CPU)
t1ha_64be	9321.23	38.29	alien (designed for big-endian CPU)
lookup3	1817.11	39.30	poor (28% bias, collisions, 30% distrib)
t1ha_32be	5873.45	39.81	alien (designed for 32-bit big-endian CPU)
Murmur2C	3655.60	42.68	poor (91% bias, collisions, distrib)
fasthash64	5578.06	43.42
Murmur2A	2789.85	43.38	poor (12.7% bias)
xxHash32	5513.55	43.72
Murmur2B	5578.21	44.13	weak (1.8% bias, collisions, distrib)
fasthash32	5381.46	45.50
cmetrohash64_1_optshort	11808.92	46.33	seems weak (likely cyclic collisions)
metrohash64_2	12113.12	46.88	seems weak (likely cyclic collisions)
cmetrohash64_1	12081.32	47.28	seems weak (likely cyclic collisions)
metrohash64_1	12024.68	47.21	seems weak (likely cyclic collisions)
Murmur3F	5473.62	47.37
superfast	1860.25	47.45	poor (91% bias, 5273.01x collisions, 37% distrib)
cmetrohash64_2	12052.58	48.66
Murmur3A	2232.00	48.16
City32	5014.33	51.13	far to perfect (2 minor collisions)
City64	11041.72	51.77
metrohash64crc_2	20582.76	51.39	seems weak (likely cyclic collisions), non-portable (SSE4.2)
sumhash	9668.13	51.31	not a hash (just for reference)
metrohash64crc_1	21319.23	52.36	weak (cyclic collisions), non-portable (SSE4.2)
PMurHash32	2232.26	53.18
Murmur3C	3719.22	54.05
bernstein	921.43	55.17	poor (100% bias, collisions, distrib)
xxHash64	11123.15	56.17
Spooky32	11464.20	59.45
City128	12551.54	60.93
FarmHash64	12145.36	60.12	non-portable (SSE4.2)
Spooky128	11735.99	60.45	weak (collisions with 4bit diff)
Spooky64	11820.20	60.39
CityCrc128	14821.82	62.38	non-portable (SSE4.2)
MicroOAAT	826.32	62.06	poor (100% bias, distrib)
metrohash128_1	11063.78	66.58	seems weak (likely cyclic collisions)
metrohash128_2	11465.18	66.72	weak (cyclic collisions)
GoodOAAT	930.18	68.24
metrohash128crc_1	21322.80	70.33	seems weak (likely cyclic collisions), non-portable (SSE4.2)
metrohash128crc_2	20990.70	70.40	seems weak (likely cyclic collisions), non-portable (SSE4.2)
farmhash64_c	12033.13	71.30	non-portable (SSE4.2)
sdbm	695.29	71.76	poor (100% bias, collisions, distrib)
FNV1a	684.17	72.75	poor (zeros, 100% bias, collisions, distrib)
FNV64	697.67	72.70	poor (100% bias, collisions, distrib)
FarmHash128	12515.98	77.43	non-portable (SSE4.2)
hasshe2	2587.39	81.23	poor (insecure, 100% bias, collisions, distrib), non-portable (SSE2)
BadHash	558.14	87.87	not a hash (just for reference)
x17	551.99	89.24	poor (99.98% bias, collisions, distrib)
JenkinsOOAT_perl	558.14	95.26	poor (1.5-11.5% bias, 7.2x collisions)
farmhash128_c	12709.06	96.42	non-portable (SSE4.1)
MurmurOAAT	465.12	107.61	poor (collisions, 99.99% distrib)
JenkinsOOAT	558.13	116.75	poor (53.5% bias, collisions, distrib)
falkhash	8909.54	124.48	non-portable (AES-NI)
crc32	342.27	142.06	poor (insecure, 8589.93x collisions, distrib)
SipHash	962.35	147.36
md5_32a	433.03	508.98
sha1_32a	531.44	1222.44

The LARGE KEYS case

Order by hashing speed in Mi-bytes (2^20 = 1048576) per second for 262144-byte block (more is better).

Function	MiB/Second	Cycles/Hash	Notes (quality, portability)
donothing	15747227.36	6.00	not a hash (just for reference)
sumhash32	43317.86	16.69	not a hash (just for reference)
t1ha_aes	34636.42	33.03	non-portable (AES-NI)
metrohash128crc_1	21322.80	70.33	seems weak (likely cyclic collisions), non-portable (SSE4.2)
metrohash64crc_1	21319.23	52.36	seems weak (cyclic collisions), non-portable (SSE4.2)
metrohash128crc_2	20990.70	70.40	seems weak (likely cyclic collisions), non-portable (SSE4.2)
metrohash64crc_2	20582.76	51.39	seems weak (likely cyclic collisions), non-portable (SSE4.2)
CityCrc128	14821.82	62.38	non-portable (SSE4.2)
FNV1a_YoshimitsuTRIAD	13000.49	24.96	poor (100% bias, collisions, distrib)
farmhash128_c	12709.06	96.42	non-portable (SSE4.1)
City128	12551.54	60.93
FarmHash128	12515.98	77.43	non-portable (SSE4.2)
t1ha	12228.80	35.55
FarmHash64	12145.36	60.12	non-portable (SSE4.2)
metrohash64_2	12113.12	46.88	seems weak (likely cyclic collisions)
cmetrohash64_1	12081.32	47.28	seems weak (likely cyclic collisions)
cmetrohash64_2	12052.58	48.66	seems weak (likely cyclic collisions)
farmhash64_c	12033.13	71.30	non-portable (SSE4.2)
metrohash64_1	12024.68	47.21	seems weak (likely cyclic collisions)
Spooky64	11820.20	60.39
cmetrohash64_1_optshort	11808.92	46.33	seems weak (likely cyclic collisions)
Spooky128	11735.99	60.45	weak (collisions with 4-bit diff)
metrohash128_2	11465.18	66.72	weak (cyclic collisions)
Spooky32	11464.20	59.45
xxHash64	11123.15	56.17
metrohash128_1	11063.78	66.58	seems weak (likely cyclic collisions)
City64	11041.72	51.77
MUM	10246.20	37.25	non-portable (different result, machine specific)
sumhash	9668.13	51.31	not a hash (just for reference)
t1ha_64be	9321.23	38.29	alien (designed for big-endian CPU)
falkhash	8909.54	124.48	non-portable (AES-NI)
crc64_hw	7308.06	28.37	poor (insecure, 100% bias, collisions, distrib), non-portable (SSE4.2)
t1ha_32le	5958.54	38.54	alien (designed for 32-bit CPU)
t1ha_32be	5873.45	39.81	alien (designed for 32-bit big-endian CPU)
fasthash64	5578.06	43.42
Murmur2B	5578.21	44.13	weak (1.8% bias, collisions, distrib)
crc32_hw	5577.64	29.10	poor (insecure, 100% bias, collisions, distrib), non-portable (SSE4.2)
xxHash32	5513.55	43.72
Murmur3F	5473.62	47.37
fasthash32	5381.46	45.50
City32	5014.33	51.13	far to perfect (2 minor collisions)
Murmur3C	3719.22	54.05
Murmur2C	3655.60	42.68	poor (91% bias, collisions, distrib)
Murmur2	2789.89	38.37	poor (1.7% bias, 81x coll, 1.7% distrib)
Murmur2A	2789.85	43.38	poor (12.7% bias)
Crap8	2743.80	32.50	poor (2.42% bias, collisions, 2% distrib)
hasshe2	2587.39	81.23	poor (insecure, 100% bias, collisions, distrib), non-portable (SSE2)
Murmur3A	2232.00	48.16
PMurHash32	2232.26	53.18
NOP_OAAT_read64	1991.31	30.46	poor (100% bias, 2.17x collisions)
superfast	1860.25	47.45	poor (91% bias, 5273.01x collisions, 37% distrib)
lookup3	1817.11	39.30	poor (28% bias, collisions, 30% distrib)
SipHash	962.35	147.36
GoodOAAT	930.18	68.24
bernstein	921.43	55.17	poor (100% bias, collisions, distrib)
MicroOAAT	826.32	62.06	poor (100% bias, distrib)
FNV64	697.67	72.70	poor (100% bias, collisions, distrib)
sdbm	695.29	71.76	poor (100% bias, collisions, distrib)
FNV1a	684.17	72.75	poor (zeros, 100% bias, collisions, distrib)
BadHash	558.14	87.87	not a hash (just for reference)
JenkinsOOAT	558.13	116.75	poor (53.5% bias, collisions, distrib)
JenkinsOOAT_perl	558.14	95.26	poor (1.5-11.5% bias, 7.2x collisions)
x17	551.99	89.24	poor (99.98% bias, collisions, distrib)
sha1_32a	531.44	1222.44
MurmurOAAT	465.12	107.61	poor (collisions, 99.99% distrib)
md5_32a	433.03	508.98
crc32	342.27	142.06	poor (insecure, 8589.93x collisions, distrib)

inste / t1ha Goto Github PK

t1ha's Introduction

t1ha

Briefly, it is a portable 64-bit hash function:

Usage

t1ha0 = 64 bits, "Just Only Faster"

t1ha1 = 64 bits, baseline fast portable hash

t1ha2 = 64 and 128 bits, slightly more attention for quality and strength

t1ha3 = 128 and 256 bits, fast non-cryptographic fingerprinting

Planned: t1ha4 = 128 bits, fast insecure fingerprinting

Planned: t1ha5 = 256 bits, fast Cryptographic, but with some limitations

Planned: t1ha6 = 256 and 512 bits, Cryptographic with reasonable resistance to acceleration on GPU and FPGA.

Planned: t1ha7 = 256, 512 and 1024 bits, Cryptographic, Strong Post-Quantum

Requirements and Portability:

Acknowledgement:

Benchmarking and Testing

The --hash-stdin-strings option

SMHasher

Scores

The SMALL KEYS case

The LARGE KEYS case

t1ha's People

Contributors

Watchers

Forkers

Recommend Projects

Recommend Topics

Recommend Org