SChernykh
eae3a62345
Auto-detect the fastest code for dataset init
2020-12-19 13:59:28 +01:00
SChernykh
aaf1ed1c99
Dataset initialization with AVX2 (WIP)
2020-12-18 14:53:54 +01:00
XMRig
25a9fb2ab4
Fixed secure JIT on Linux and code cleanup.
2020-12-12 19:18:47 +07:00
XMRig
84443a4e8b
Added generic secure JIT support for RandomX.
2020-12-11 23:17:54 +07:00
SChernykh
83d152f557
More static analysis fixes
2020-12-08 16:05:58 +01:00
SChernykh
d822ffef36
Fixed Debug build in Visual Studio
2020-10-27 14:08:36 +01:00
SChernykh
1865d31d0c
RandomX: added huge-pages-jit config parameter
...
Set to false by default, gives 0.2% boost on Ryzen 7 3700X with 16 threads, but hashrate might be unstable on Ryzen between launches. Use with caution.
2020-10-07 17:42:55 +02:00
SChernykh
b8d604bec4
RandomX improved performance of GCC compiled binaries
...
JIT compilator was slower compared to MSVC compiled binary. Up to +0.1% speedup on rx/wow in Linux.
2020-09-22 13:48:11 +02:00
SChernykh
1e38eb3f4f
RandomX refactoring, moved more stuff to compile time
...
Small x86 JIT compiler speedup.
2020-09-18 20:51:25 +02:00
SChernykh
fe96036607
RandomX JIT refactoring
...
- Smaller memory footprint
- A bit faster overall
2020-04-09 14:24:54 +02:00
SChernykh
c6e9e2308d
Optimized CFROUND
...
Shorter version using BMI2 instructionns
2020-02-21 19:00:58 +01:00
SChernykh
c7b4a89832
Tuned JIT compiler
...
0.3-0.4% speedup depending on CPU.
2020-02-20 20:59:22 +01:00
SChernykh
eaaf157997
Added support for BMI2 instructions
2020-01-21 19:44:56 +01:00
SChernykh
efdbda40f0
Optimizations for AMD Bulldozer
...
- Added support for XOP instructions
- Enabled Ryzen code for Bulldozer because it's faster there too
2020-01-15 13:04:26 +01:00
SChernykh
07e36a6cf0
Refactor Ryzen fix to fix compilation issues
2019-12-31 11:55:07 +02:00
SChernykh
4237ed6fad
Add vzeroupper for processors with AVX
...
To avoid false dependencies on upper 128 bits of YMM registers.
2019-12-18 09:12:25 +01:00
SChernykh
7e724aa70d
More optimizations for Ryzen
2019-12-09 20:29:05 +01:00
SChernykh
2ed2629653
Optimized dataset read for Ryzen CPUs
...
Removed register dependency in dataset read, +0.8% speedup on average.
2019-12-08 16:14:02 +01:00
SChernykh
4b0f3fab83
RandomX fixes
...
Intel JCC erratum fix and various other improvements, see more here: https://www.phoronix.com/scan.php?page=article&item=intel-jcc-microcode&num=1
2019-12-01 08:46:35 +01:00
SChernykh
9314205de2
Refactored JIT compiler for x86, small RandomX speedup
2019-10-05 21:40:21 +02:00
XMRig
a3e66e94cf
Fix compile warning, mostly struct/class inconsistency.
2019-09-13 18:21:05 +07:00
SChernykh
5fffeed646
RandomX: optimized loading from scratchpad
...
Prefetches scratchpad data as soon as possible to calculate data address for the next load.
Up to ~1.4% speedup on Ryzen 7 3700X @ 4.1 GHz, RAM 3200 MHz 14-14-14-28 with optimized sub-timings:
Variant|Before H/S|After H/S
-------|----------|---------
rx/0|8663|8777
rx/wow|9867|10009
rx/loki|8652|8731
2019-09-11 19:10:01 +02:00
SChernykh
d8f78c1d19
Optimized RandomX JIT compiler
...
Hashrate improved by 0.5-1.5% depending on RandomX version and CPU.
2019-08-27 20:18:56 +02:00
XMRig
54994d361b
Resolved conflict between argon2 implementations.
2019-08-16 03:18:35 +07:00
SChernykh
7d10b6b71f
Integrated RandomX, added RandomXL (Loki)
2019-07-01 20:11:51 +02:00