aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)AuthorFilesLines
2024-05-27sha3.c: permute_n_avx2(): rho: use ROLI instead of ROLV for hiPaul Duncan1-11/+10
2024-05-27sha3.c: permute_n_avx2(): remove K64Paul Duncan1-1/+1
2024-05-27sha3.c: permute_n_avx2(): clean up commentsPaul Duncan1-17/+12
2024-05-27sha3.c: permute_n_avx2(): add CHI()Paul Duncan1-59/+25
2024-05-26sha3.c: avx2: s/loadu_epi64/loadu_si256Paul Duncan1-10/+10
2024-05-26sha3.c: permute_n_avx2(): chi: explicit andnotPaul Duncan1-5/+5
2024-05-26sha3.c: permute_n_avx2(): chi: use blend instead of permute where possible, r...Paul Duncan1-16/+19
2024-05-26sha3.c: permute_n_avx2(): refactor lane masksPaul Duncan1-44/+34
2024-05-26sha3.c: permute_n_avx2(): pi: remove gather (better, but still slower than sc...Paul Duncan1-29/+46
2024-05-26sha3.c: permute_n_avx2(): rename theta permute idsPaul Duncan1-3/+9
2024-05-26tests/avx2: use permutes for piPaul Duncan1-13/+50
2024-05-26sha3.c: add initial avx2 backend (slow)Paul Duncan1-0/+249
2024-05-26add tests/avx2Paul Duncan5-0/+477
2024-05-26add tests/diagrams/avx512.svgPaul Duncan1-0/+61
2024-05-26add tests/diagramsPaul Duncan2-0/+26
2024-05-17sha3.c: permute_n_avx512(): simplify num_rounds logicPaul Duncan1-5/+2
2024-05-17sha3.c: permute_n_scalar(): simplify num_rounds logicPaul Duncan1-2/+2
2024-05-09examples/*/Makefile, tests/*/Makefile: add BACKEND support, mark clean target...v0.8Paul Duncan9-16/+40
2024-05-09sha3.c: document how to add a new backendPaul Duncan1-3/+30
2024-05-09Makefile: remove -fsanitize=leak (not supported in macos)Paul Duncan1-1/+10
2024-05-09sha3.[hc]: add sha3_xof12_t, refactor xor12 and turboshake functions to use itPaul Duncan2-12/+28
2024-05-09sha3.h: remove extraneous newlinePaul Duncan1-1/+0
2024-05-09sha3.c: add internal documentation, s/absorb12/absorb_12/Paul Duncan1-39/+227
2024-05-08sha3.c: update internal documentationPaul Duncan1-16/+50
2024-05-08rand-bytes.h: use getentropy() instead of getrandom() to support macosPaul Duncan1-8/+24
2024-05-08sha3.c, Makefile: s/SHA3_TEST/TEST_SHA3/Paul Duncan2-5/+5
2024-05-08.gitignore: add tests/neon/neonPaul Duncan1-0/+1
2024-05-08s/permute/permute_24/, s/permute12/permute_12/, misc comment cleanupPaul Duncan1-99/+89
2024-05-08sha3.c: rename hybrid-neon to hybrid, fix commentsPaul Duncan1-30/+26
2024-05-08sha3.c: hybrid: do not mix neon and non-neon instructions (still slow)Paul Duncan1-15/+26
2024-05-08s/SHA3_BACKEND/BACKEND/gPaul Duncan5-66/+66
2024-05-08sha3.c: s/call permute_n_.*(/call permute_n(/ in test commentsPaul Duncan1-4/+4
2024-05-08sha3.c: add hybrid-neon backend (slow)Paul Duncan1-1/+202
2024-05-08sha3.c: diet-neon: misc fixes. still too slowPaul Duncan1-46/+36
2024-05-08sha3.c: neon, diet-neon: use replace vorrq(vshlq, vshrq) and vorr(vshl,vshr) ...Paul Duncan1-3/+4
2024-05-08sha3.c: move INTERNAL before includesPaul Duncan1-2/+2
2024-05-08sha3.h: fix typo in sha3_backend() documentationPaul Duncan1-1/+1
2024-05-08README.md: add median linkPaul Duncan1-0/+2
2024-05-08sha3.c: prefer scalar backend to neon for nowv0.7Paul Duncan1-2/+2
2024-05-08README.md: add Backends and Benchmarks sectionsPaul Duncan1-1/+71
2024-05-08tests/bench/README.md: fix cpb links, update default trial count, add pi5 res...Paul Duncan1-21/+103
2024-05-08tests/bench/bench.c: reduce defailt number of trails from 100k to 2kPaul Duncan1-1/+1
2024-05-08sha3.c: neon: refactor, add documentationPaul Duncan1-115/+157
2024-05-08tests/neon/neon.c: port changes back from sha3.cPaul Duncan1-302/+405
2024-05-07tests/neon/Makefile: add all targetPaul Duncan1-2/+5
2024-05-07tests/bench/Makefile: add test targetPaul Duncan1-0/+3
2024-05-07sha3.c: s/union/struct/ (yeesh)Paul Duncan1-7/+2
2024-05-06sha3.c: neon backend now twice the speed of scalar backend (~50% fewer cyles,...Paul Duncan1-153/+137
2024-05-05sha3.c: diet-neon: s/permute_n_neon/permute_n_diet_neon/Paul Duncan1-2/+2
2024-05-05sha3.c: add diet-neon backend (even slower, see commit message)Paul Duncan1-0/+329