summaryrefslogtreecommitdiff
AgeCommit message (Collapse)AuthorFilesLines
2024-05-04sha3.c: neon: add tests, improve performance (still too slow, see full ↵Paul Duncan1-198/+217
commit message) scalar (odroid n2l): pabs@pizza:~/git/sha3/tests/bench> ./bench 1000 info: cpucycles: version=20240318 implementation=arm64-vct persecond=1800000000 info: backend=scalar num_trials=1000 src_lens=64,256,1024,4096,16384 dst_lens=32 function,dst_len,64,256,1024,4096,16384 sha3_224,28,34.0,16.4,15.5,14.0,13.7 sha3_256,32,34.0,16.1,15.4,14.8,14.4 sha3_384,48,34.0,23.4,19.0,18.8,18.6 sha3_512,64,34.0,30.8,28.1,26.5,26.5 shake128,32,34.0,16.1,13.6,12.1,11.8 shake256,32,34.0,16.1,15.5,14.8,14.4 neon (odroid n2l): pabs@pizza:~/git/sha3/tests/bench> ./bench 1000 info: cpucycles: version=20240318 implementation=arm64-vct persecond=1800000000 info: backend=neon num_trials=1000 src_lens=64,256,1024,4096,16384 dst_lens=32 function,dst_len,64,256,1024,4096,16384 sha3_224,28,65.6,32.5,31.8,28.7,28.2 sha3_256,32,65.6,32.5,31.9,30.8,30.0 sha3_384,48,65.6,48.0,39.7,39.5,39.0 sha3_512,64,68.0,63.9,59.1,56.0,55.9 shake128,32,65.6,32.5,28.4,25.4,24.8 shake256,32,65.6,32.5,31.6,30.5,29.7
2024-05-04tests/bench/Makefile: add commented CFLAGS with scalar backendPaul Duncan1-0/+1
2024-05-03sha3.c: add missing RHO_IDSPaul Duncan1-0/+9
2024-05-03sha3.c: add neon backendPaul Duncan1-9/+305
2024-05-03add tests/neonPaul Duncan3-0/+989
2024-05-03sha3.c: refactor backends so they only implement permute_n()Paul Duncan1-40/+21
i verified that (gcc, at least) does constant propagation and inlines permute_n_<backend> and that this change does not affect performance. bench results, before: pabs@flex:~/git/sha3/tests/bench> ./bench info: cpucycles: version=20240318 implementation=amd64-pmc persecond=4800000000 info: backend=avx512 num_trials=100000 src_lens=64,256,1024,4096,16384 dst_lens=32 function,dst_len,64,256,1024,4096,16384 sha3_224,28,15.4,7.8,7.8,7.1,7.0 sha3_256,32,15.5,7.8,7.8,7.6,7.4 sha3_384,48,15.5,11.7,9.8,9.8,9.7 sha3_512,64,15.6,15.5,14.6,13.9,13.9 shake128,32,15.5,7.8,6.9,6.2,6.1 shake256,32,15.5,7.8,7.9,7.6,7.4 bench results, after change: pabs@flex:~/git/sha3/tests/bench> ./bench info: cpucycles: version=20240318 implementation=amd64-pmc persecond=4800000000 info: backend=avx512 num_trials=100000 src_lens=64,256,1024,4096,16384 dst_lens=32 function,dst_len,64,256,1024,4096,16384 sha3_224,28,15.4,7.8,7.8,7.1,7.0 sha3_256,32,15.6,7.8,7.8,7.6,7.4 sha3_384,48,15.6,11.7,9.8,9.8,9.7 sha3_512,64,15.6,15.5,14.6,13.8,13.8 shake128,32,15.6,7.9,6.9,6.2,6.1 shake256,32,15.7,7.9,7.9,7.6,7.4
2024-05-01sha3.h: remove extraneous exclamation pointsPaul Duncan1-2/+2
2024-05-01sha3.c: improve internal documentation, s/SHA3_BACKEND_/BACKEND_/Paul Duncan1-78/+134
2024-04-29sha3.c: add permute_n_{scalar,avx512}() and refactor ↵Paul Duncan1-198/+23
permute{,12}_{scalar,avx512}() to use them
2024-04-29sha3.c: s/ifdef/if/ in a few placesPaul Duncan1-2/+2
2024-04-29tests/bench: add backend to metadataPaul Duncan2-3/+4
2024-04-29.gitignore: add all-fnsPaul Duncan1-0/+1
2024-04-29sha3.[hc]: add sha3_backend()Paul Duncan2-1/+26
2024-04-29examples/06-all/all-fns.c: add sha3_backend() examplePaul Duncan1-0/+11
2024-04-29sha3.c: add/use SHA3_BACKENDPaul Duncan1-11/+26
2024-04-29tests/bench/README.md: add n2l examplePaul Duncan1-2/+8
2024-04-29tests/bench: refactor so bench prints a cpb table to stdoutPaul Duncan2-93/+129
2024-04-29tests/bench/README.md: remove mean_cpb, add "cycles per byte" link, misc ↵Paul Duncan1-4/+5
cleanups
2024-04-29tests/bench/bench.c: fix bench function memory allocation, remove mean_cpb ↵Paul Duncan1-20/+24
from output, use defines for src/dst lengths
2024-04-29sha3.c: permute{,12}_avx512(): optimize, update header commentPaul Duncan1-367/+295
2024-04-29sha3.c: build permute12_scalar() during testing, remove old commentPaul Duncan1-7/+1
2024-04-29sha3.c: clean up whitespace in rho(), pi(), and chi()Paul Duncan1-38/+50
2024-04-29add tests/benchPaul Duncan7-0/+354
2024-04-29add rand-bytes.hPaul Duncan1-0/+20
2024-04-29Makefile: add -march=native -mtune=native to TEST_CFLAGSPaul Duncan1-1/+1
2024-04-29sha3.c: use shared RCS, rename scalar and avx512 permute() to ↵Paul Duncan1-78/+547
permute_{scalar,axv512}(), hard-code num_rounds to 24 in permute_{scalar,avx512}(), add permute12_{scalar,avx512}(), absorb12(), and xof12_{init,absorb,raw,absorb,squeeze_raw,squeeze,once}(), update turboshake to use xof12_*(), move permute tests to PERMUTE_TESTS static array, rename test_permute() to test_permute_scalar(), add test_permute_avx512(), add PERMUTE12_TESTS and test_permute12_{scalar,avx512}()
2024-03-02LICENSE.txt: update copyright yearPaul Duncan1-1/+1
2024-03-02README.md: move hmac down in list of algorithms, add note about recommended ↵v0.6Paul Duncan1-1/+4
compiler flags
2024-03-02sha3.c: add fail_test(), use it in all testsPaul Duncan1-211/+67
2024-03-02sha3.c: rename HASH_RATE() to RATE(), document RATE() and use it everywhere, ↵Paul Duncan1-34/+71
improve comments
2024-03-02sha3.c: fix commentPaul Duncan1-1/+1
2024-03-02sha3.c: parameterize turboshakePaul Duncan1-50/+34
2024-03-02sha3.c: parameterize parallelhashPaul Duncan1-271/+143
2024-03-02sha3.c: use __func__ in testsPaul Duncan1-49/+49
2024-03-02sha3.c: parameterize tuplehashPaul Duncan1-121/+68
2024-03-02sha3.c: parameterize kmac functionsPaul Duncan1-235/+122
2024-03-02sha3.c: move kmac128(), add missing kmac commentsPaul Duncan1-53/+63
2024-03-02sha3.c: parameterize cshakePaul Duncan1-158/+86
2024-03-02sha3.c: parameterize rate handling, simplify hash, shake, hmac, and cshake ↵Paul Duncan1-72/+58
definitions
2024-03-02sha3.c: remove keccak(), refactor xof_absorb_raw() into absorb(), update ↵Paul Duncan1-92/+54
hash_once(), hash_absorb(), and xof_absorb_raw() to use absorb()
2024-03-02sha3.h: move shake prototypes above hmacPaul Duncan1-181/+182
2024-03-02sha3.h: s/calculat/calculate/Paul Duncan1-1/+1
2024-03-02sha3.[hc]: update commentsPaul Duncan2-3/+5
2024-03-02sha3.c: move hmac below shakePaul Duncan1-71/+71
2024-03-02sha3.c: add DEF_HASH(), DEF_HMAC(), and DEF_SHAKE(), remove explicit hash, ↵Paul Duncan1-376/+137
hmac, and shake implementations
2024-03-02sha3.c: xof_absorb_raw_large(): rename to xof_absorb_raw(), do not copy ↵Paul Duncan1-61/+11
state, check message alignment to elide unnecessary copies, remove xof_absorb_raw_small()
2024-02-26rename SHA3_*_CAPACITY to SHA3_*_LEN, document SHA3_*_LEN and ↵Paul Duncan1-28/+47
{SHA3,SHAKE*}_*_RATE
2024-02-24sha3.c: s/sha3(/hash_once(/, s/sha3_(init|absorb|final)/hash_$1/Paul Duncan1-21/+21
2024-02-24sha3.c: improve internal documentationPaul Duncan1-1/+29
2024-02-24sha3.c: s/xof_absorb_raw_bulk/xof_absorb_raw_large/, ↵Paul Duncan1-10/+18
s/xof_absorb_raw_simple/xof_absorb_raw_small/, add documentation to xof absorb functions