Commit Graph

60 Commits

Author SHA1 Message Date
SChernykh
efdbda40f0 Optimizations for AMD Bulldozer
- Added support for XOP instructions
- Enabled Ryzen code for Bulldozer because it's faster there too
2020-01-15 13:04:26 +01:00
SChernykh
716df54ac0 JIT compiler: removed unnecessary memcpy from generateProgram() 2020-01-13 18:00:41 +01:00
SChernykh
a976379f19 JIT compiler tweaks 2020-01-06 13:57:48 +01:00
SChernykh
07e36a6cf0 Refactor Ryzen fix to fix compilation issues 2019-12-31 11:55:07 +02:00
XMRig
82d4a1b3c9 Fix build. 2019-12-28 02:00:08 +07:00
SChernykh
a1f223dcb0 Fix for 1st-gen Ryzen crashes 2019-12-27 12:40:38 +02:00
XMRig
a46c4f02e8 Removed "rx/v" algorithm. 2019-12-26 22:34:19 +07:00
XMRig
e7c4b1d0c2 Fixed memory allocation checks. 2019-12-25 04:39:21 +07:00
Tony Butler
4c4b29d45c Add MoneroV (rx/v) algorithm [based on MoneroOcean/master] 2019-12-18 16:17:22 -07:00
SChernykh
18ceec0022 Fixed crash with GCC compiler 2019-12-18 17:32:57 +01:00
SChernykh
3bb6b332db Fixed AVX detection 2019-12-18 12:20:21 +01:00
SChernykh
5492d9270f Update jit_compiler_x86_static.S 2019-12-18 09:13:21 +01:00
SChernykh
4237ed6fad Add vzeroupper for processors with AVX
To avoid false dependencies on upper 128 bits of YMM registers.
2019-12-18 09:12:25 +01:00
SChernykh
9b5ccb55a5 RandomSFX (Safex Cash variant) support 2019-12-16 19:36:29 +01:00
SChernykh
9629001fe4 Update jit_compiler_x86_static.S 2019-12-09 20:30:37 +01:00
SChernykh
7e724aa70d More optimizations for Ryzen 2019-12-09 20:29:05 +01:00
XMRig
280618a9eb Memory allocation refactoring. 2019-12-08 23:17:39 +07:00
SChernykh
08eef1bf9a Fix GCC compilation 2019-12-08 16:51:37 +01:00
SChernykh
8a1679ec89 Fixed indentation 2019-12-08 16:20:46 +01:00
SChernykh
2ed2629653 Optimized dataset read for Ryzen CPUs
Removed register dependency in dataset read, +0.8% speedup on average.
2019-12-08 16:14:02 +01:00
SChernykh
0a5c12295d Added 1GB hugepages support for Linux 2019-12-05 19:39:47 +01:00
SChernykh
4b0f3fab83 RandomX fixes
Intel JCC erratum fix and various other improvements, see more here: https://www.phoronix.com/scan.php?page=article&item=intel-jcc-microcode&num=1
2019-12-01 08:46:35 +01:00
SChernykh
6e96a38a87 Use XMRIG_ARMv8 macro 2019-11-15 16:12:26 +01:00
SChernykh
d13c77ddb9 Fix for 32-bit ARM compilation 2019-11-15 16:00:48 +01:00
SChernykh
74d6a35f1a Fix function names for clang on Apple 2019-11-12 14:42:21 +01:00
SChernykh
e8c1a8bdd0 Prefer sys_icache_invalidate on iOS
Also break compilation with error if clear cache is not available
2019-10-18 18:17:57 +02:00
SChernykh
c4ce4baa69 Fixed __builtin___clear_cache detection 2019-10-18 18:04:13 +02:00
SChernykh
c67f2c2f51 Update jit_compiler_a64.cpp 2019-10-18 16:36:50 +02:00
SChernykh
c31f2bb92b Update jit_compiler_a64.cpp 2019-10-18 16:32:01 +02:00
SChernykh
5af02671ee Update jit_compiler_a64.cpp 2019-10-18 16:28:49 +02:00
SChernykh
ab1175a1a7 Fixed code cache cleanup on iOS/Darwin 2019-10-18 16:26:15 +02:00
XMRig
2d7c99cc47 Fix copy/paste typo. 2019-10-18 21:26:15 +07:00
SChernykh
0640950d44 Fix ARM64 code alignemtn 2019-10-18 16:18:45 +02:00
XMRig
84e0b5d0ca #1246 Fixed build on iOS. 2019-10-18 12:02:10 +07:00
SChernykh
e7f137bd23 Sync with latest RandomX code
Fix a possible out-of-bounds access in superscalar generator
2019-10-13 22:13:29 +02:00
SChernykh
e766e11dfa RandomX (Arqma variant) support 2019-10-08 19:00:19 +02:00
SChernykh
9314205de2 Refactored JIT compiler for x86, small RandomX speedup 2019-10-05 21:40:21 +02:00
SChernykh
1b33b073f3 Set scratchpad pointer to null by default
To avoid freeing random blocks of memory in some cases.
2019-09-24 08:53:00 +02:00
SChernykh
d346994fbd Workaround for a bug in binutils-2.32-1 on ARM
ldr/madd instruction sequence makes compiled binary crash, so separate them.
2019-09-23 23:12:40 +02:00
XMRig
7303be4c46 Revert instructions_portable.cpp to avoid warning on gcc compilers. 2019-09-22 00:59:53 +07:00
SChernykh
46ef04a4cf Added JIT compiler for RandomX on ARMv8 2019-09-21 10:10:52 +02:00
XMRig
a3e66e94cf Fix compile warning, mostly struct/class inconsistency. 2019-09-13 18:21:05 +07:00
SChernykh
5fffeed646 RandomX: optimized loading from scratchpad
Prefetches scratchpad data as soon as possible to calculate data address for the next load.

Up to ~1.4% speedup on Ryzen 7 3700X @ 4.1 GHz, RAM 3200 MHz 14-14-14-28 with optimized sub-timings:
Variant|Before H/S|After H/S
-------|----------|---------
rx/0|8663|8777
rx/wow|9867|10009
rx/loki|8652|8731
2019-09-11 19:10:01 +02:00
SChernykh
e2e4fe5f8c Optimized CFROUND
One less micro-op
2019-09-04 20:47:47 +02:00
SChernykh
fffe3c2e10 RandomX optimizations
- Optimized soft AES code, up to +30% hashrate on CPU without AES support
- Added prefetch for the first dataset access, up to +0.1% hashrate
2019-09-04 19:24:12 +02:00
Matt Smith
8f4aa6871a Fix linker marking entire executable as executable stack
See: https://wiki.ubuntu.com/SecurityTeam/Roadmap/ExecutableStacks
See: https://wiki.gentoo.org/wiki/Hardened/GNU_stack_quickstart
2019-08-29 14:12:43 +01:00
SChernykh
ab5d1a6cc9 Reverted intrin_portable.h 2019-08-28 07:20:01 +02:00
SChernykh
d8f78c1d19 Optimized RandomX JIT compiler
Hashrate improved by 0.5-1.5% depending on RandomX version and CPU.
2019-08-27 20:18:56 +02:00
SChernykh
fc670b2a58 Updated RandomX 2019-08-27 16:12:13 +02:00
XMRig
3660216ebc Fix prefix. 2019-08-16 04:16:29 +07:00