So I came across 4 Fermi M2090, installed CentOS-6.9, the toolchain, …, and was able to build gpuPlotGenerator-4.0.4
#bin/gpuPlotGenerator.exe listDevices 0
Id: 3
Type: GPU
Name: Tesla M2090
Vendor: NVIDIA Corporation
Version: OpenCL 1.1 CUDA
Driver version: 375.66
Max clock frequency: 1301MHz
Max compute units: 16
Global memory size: 5GB 946MB 704KB
Max memory allocation size: 1GB 492MB 688KB
Max work group size: 1024
Local memory size: 48KB
Max work-item sizes: (1024, 1024, 64)
BUT: Plotting in buffer mode just hangs at the last "buffer block" - up to that point all is good, GPUs are busy, the output file grows. Then the last piece is never written, GPUs and plotter process are idle.
This command line produces a 14 GiB file and omits the last 2 GiB:
#bin/gpuPlotGenerator.exe generate buffer /tmp/12345678901234567890_738197504_65536_8192
bin/gpuPlotGenerator.exe: /usr/local/cuda-8.0/targets/x86_64-linux/lib/libOpenCL.so.1: no version information available (required by bin/gpuPlotGenerator.exe)
-------------------------
GPU plot generator v4.0.4
-------------------------
Author: Cryo
Bitcoin: 138gMBhCrNkbaiTCmUhP9HLU9xwn5QKZgD
Burst: BURST-YA29-QCEW-QXC3-BKXDL
----
Loading platforms...
Loading devices...
Loading devices configurations...
Initializing generation devices...
[0] Device: Tesla M2090 (OpenCL 1.1 CUDA)
[0] Device memory: 512MB
[0] CPU memory: 512MB
Initializing generation contexts...
[0] Path: /tmp/12345678901234567890_738197504_65536_8192
[0] Nonces: 738197504 to 738263039 (16GB 0MB)
[0] CPU memory: 2GB 0MB
----
Devices number: 1
Plots files number: 1
Total nonces number: 65536
CPU memory: 2GB 512MB
----
Generating nonces...
Oh, and I played around with devices.txt, single+multiple GPU, various localWorkSize + hashesNumber, to no avail;
0 0 2048 128 8192
0 0 2048 256 8192
0 0 4096 512 8192
...
globalWorkSize (corresponding to RAM on GPU) MUST be under 4 GiB, although these M2090 have 6. -?-
Yes, /tmp has 32 GiB of free space.
Ideas, anyone ?