log in |
Message boards : Number crunching : Compiling for AVX-512
1 · 2 · 3 · Next
Author | Message |
---|---|
Has any consideration been given to compiling for AVX-512? | |
ID: 2029 · Reply Quote | |
A free compiler may be available: | |
ID: 2031 · Reply Quote | |
So few processors are capable of AVX-512, I wouldn't think it worth the effort. | |
ID: 2035 · Reply Quote | |
So few processors are capable of AVX-512, I wouldn't think it worth the effort. I'll make it worth someone's while to compile for AVX-512. All Cascade Lake and Skylake CPUs are AVX-512 capable: https://en.wikipedia.org/wiki/Skylake_(microarchitecture)#High-end_desktop_processors_(Skylake-X) https://ark.intel.com/content/www/us/en/ark/products/codename/124664/cascade-lake.html Even more than I thought: https://en.wikipedia.org/wiki/AVX-512#:~:text=AVX%2D512%20are%20512%2Dbit,i5%2D7640X%20and%20Core%20i7%2D I'm running these now and all my future CPU upgrades will be AVX-512 capable: i9-10980XE, i9-9980XE, i9-7980XE & i9-9960X. They're vastly underutilized without AVX-512. | |
ID: 2037 · Reply Quote | |
Hmmm, whatever. | |
ID: 2039 · Reply Quote | |
Like he said AVX-512 is for HPC and that's what we're doing. The proof's in the pudding. Until someone compiles for AVX-512 and we can test it we won't know. | |
ID: 2040 · Reply Quote | |
The code in BOINC already probes for the host's cpu capabilities, so it knows whether the processor supports AVX-512 instructions. | |
ID: 2042 · Reply Quote | |
I found 236 AVX-512 capable CPUs plus 2 of mine that haven't been included yet: | |
ID: 2043 · Reply Quote | |
What's a User? It's a unique CPID (so, yes, aggregators are counted as a single user) About AVX-512, I do not have AVX-512 capable computers so it would very difficult for me to build and test a new application. Also please keep in mind that the benefits are unknown, see: AVX-512 slower than AVX. For instance FMA against AVX only provided a very small speed gain, that's one of the reasons I do not distribute FMA for Windows, also at the very beginning FMA was broken for Ryzen. Anyway, if someone would like to build up a AVX-512 application I can provide some assistance. | |
ID: 2044 · Reply Quote | |
...benefits are unknown...Exactly and we won't know until someone tries it. see: AVX-512 slower than AVX.I tried to read this thread but it got very tedious. The only thing I learned was Intel made a major mistake and produced some CPUs with singlets instead of pairs so I updated my list. if someone would like to build up a AVX-512 application I can provide some assistance.And if anyone does try it I'd be glad to test it :-) Intel Core i9-7900X CPU @ 3.30GHz [Family 6 Model 85 Stepping 4] 2 x AVX-512 Intel Core i9-7960X CPU @ 2.80GHz [Family 6 Model 85 Stepping 4] 2 x AVX-512 Intel Core i9-7980XE CPU @ 2.60GHz [Family 6 Model 85 Stepping 4] 2 x AVX-512 i9-9960X 2 x AVX-512 i9-10980XE 2 x AVX-512 Intel Xeon CPU 8164 @ 2.00GHz [Family 6 Model 85 Stepping 3] 2 x AVX-512 Intel Xeon Gold 6130 CPU @ 2.10GHz [Family 6 Model 85 Stepping 4] 2 x AVX-512 Intel Xeon Gold 6140 CPU @ 2.30GHz [Family 6 Model 85 Stepping 4] 2 x AVX-512 Intel Xeon Gold 6148 CPU @ 2.40GHz [Family 6 Model 26 Stepping 4] 2 x AVX-512 Intel Xeon Platinum 8124M CPU @ 3.00GHz [Family 6 Model 85 Stepping 3] 2 x AVX-512 Intel Xeon Platinum 8124M CPU @ 3.00GHz [Family 6 Model 85 Stepping 4] 2 x AVX-512 Intel Xeon Platinum 8167M CPU @ 2.00GHz [Family 6 Model 85 Stepping 4] 2 x AVX-512 Intel Xeon Platinum 8168 CPU @ 2.70GHz [Family 6 Model 85 Stepping 4] 2 x AVX-512 Intel Xeon Platinum 8171M CPU @ 2.60GHz [Family 6 Model 85 Stepping 4] 2 x AVX-512 Intel Xeon Silver 4110 CPU @ 2.10GHz [Family 6 Model 85 Stepping 4] 1 x AVX-512 Intel Xeon Gold 5118 CPU @ 2.30GHz [Family 6 Model 85 Stepping 4] 1 x AVX-512 | |
ID: 2063 · Reply Quote | |
FMA against AVX only provided a very small speed gain, that's one of the reasons I do not distribute FMA for Windows... Look at the Applications page. There are many Windoze crunchers here than Linux. FMA WUs require both wingmen to be Linux users. Be more more efficient if FMA compiled for Windoze as well. ____________ | |
ID: 2081 · Reply Quote | |
FMA against AVX only provided a very small speed gain, that's one of the reasons I do not distribute FMA for Windows... Not convinced by this statement. Almost all my WUs are fma and if I look at the wingmen the very often receive avx or sse2 equivalents - see Workunit 25706025 or 25705750 as examples. | |
ID: 2082 · Reply Quote | |
Almost all my WUs are fma and if I look at the wingmen the very often receive avx or sse2 equivalents - see Workunit 25706025 or 25705750 as examples. I have never paid any attention to this, but now that you mention it, it seems that my Ubuntu 20.04.1 machine is being validated by everyone and everything. http://gene.disi.unitn.it/test/results.php?hostid=55080&offset=0&show_names=0&state=4&appid= | |
ID: 2083 · Reply Quote | |
The validator of the science application is agnostic about who or what produced the output file. | |
ID: 2084 · Reply Quote | |
It only cares if there is consensus reached between the two samples of matching or closed matched results. Sure. But it is frequently the case on some projects that "similar" outputs don't match well enough for validation. I suppose it depends on the science, and how the validator is calibrated. | |
ID: 2085 · Reply Quote | |
Oh. I thought the WU needed to match its program since all WUs say what instruction set they're for. | |
ID: 2086 · Reply Quote | |
Oh. I thought the WU needed to match its program since all WUs say what instruction set they're for. Diagnostic for troubleshooting when things go wrong. | |
ID: 2088 · Reply Quote | |
Some time ago, when starting to deploy applications for different architectures we made sure (and this wasn't easy) that the output file matched our "gold" one, which we defined to be the one got using plain Linux x64. Therefore, a Xeon FMA Linux output exactly matches any other one output (like ARM vfpv3 or MacOs). This way both the distribution and the validation of workunits are much easier to handle. | |
ID: 2089 · Reply Quote | |
Oh. I thought the WU needed to match its program since all WUs say what instruction set they're for. All the instruction set in the name is doing is showing how the source code was compiled. Same source code, just using a different set of compilation parameters for using different hardware. Nothing more. One instruction set works better or worse on whatever hardware architecture and age of cpu you are using. My Ryzen and Treadripper perform the fastest on the FMA app. | |
ID: 2090 · Reply Quote | |
BTW, as I explained before, I cannot build and test a AVX512 application (I don't have any AVX512 capable computer here). Anyway, it is not difficult to compile the source code (gcc on Linux x64), the tricky thing is to compile the needed BOINC libraries but, if requested, I can provide them.I haven't compiled anything since the nineties. Maybe I can get a free compiler from Intel for open source use. Is the gene_pcim code open source??? | |
ID: 2091 · Reply Quote | |
Message boards :
Number crunching :
Compiling for AVX-512