From 80d046db7a5cafe05c0a4872087d9d6d549a1145 Mon Sep 17 00:00:00 2001 From: John Tromp Date: Thu, 27 Oct 2016 16:08:25 -0400 Subject: [PATCH] tweak targets, fix README --- README.md | 15 +++++++++------ equi_miner.cpp | 2 +- 2 files changed, 10 insertions(+), 7 deletions(-) diff --git a/README.md b/README.md index 2291a7a..9e8babe 100644 --- a/README.md +++ b/README.md @@ -36,7 +36,7 @@ reducing waste by about 7%. My solver now needs only 144MB compared to xenoncat' Seeing that my solver was spending 45% of runtime on hashing, I asked xenoncat if (s)he could make their assembly blake2b implementation available through a C binding, which s(he) -very generously did. +very generously did. My solver executables using this are called dev1/dev. Zooko had earlier suggested looking at Samuel Neves' blake2bp implemention for faster hashing. After initially rejecting this approach due to different blake2bp semantics, I came back to @@ -53,10 +53,13 @@ Performance summary (on 4GHz i7-4790K and NVidia GTX980): - equi1: 4.6 Sol/s - eqavx21: 5.9 Sol/s -- equi -t 8: 4.6 Sol/s -- eqavx2 -t 8: TBA Sol/s -- 8 x equi1: 20.3 Sol/s -- dev1: 6.5 Sol/s (xenoncat's blake) -- 8 x dev1: 20.6 Sol/s +- dev1: 6.5 Sol/s + +- equi -t 8: 13.7 Sol/s +- eqavx2 -t 8: 16.7 Sol/s - dev -t 8: 17.2 Sol/s + +- 8 x eqavx21: 20.3 Sol/s +- 8 x dev1: 20.6 Sol/s + - eqcuda: 23.6 Sol/s diff --git a/equi_miner.cpp b/equi_miner.cpp index 31d0917..0da3829 100644 --- a/equi_miner.cpp +++ b/equi_miner.cpp @@ -51,7 +51,7 @@ int main(int argc, char **argv) { #ifdef USE_AVX2 printf(" and AVX2 intrinsics to compute 4-way blake2b\n"); #else - printf("; no AVX2 detected\n"); + printf(" and no AVX2\n"); #endif u32 sumnsols = 0; char headernonce[HEADERNONCELEN];