Posts by Beyond
log in
1) Message boards : Number crunching : No work for newly attached machines? (Message 1206)
Posted 16 days ago by Profile Beyond
One of my teammates is reporting:

"Having a heck of a time getting any work on the machines I added here, they request new tasks and event log just says the request was completed, got 0 new tasks. No error, no out of work, just nothing. Machines that were attached several weeks ago seem to work fine."

What's going on?
2) Message boards : Wish List : Future requests (Message 1202)
Posted 23 days ago by Profile Beyond
Francesco (one of our collaborators) is making it, he told me it would probably be ready at the beginning of the next week.

Any update on this? It's a shame to be unnecessarily wasting our CPU time.
3) Message boards : Wish List : Future requests (Message 1201)
Posted 29 days ago by Profile Beyond
Francesco (one of our collaborators) is making it, he told me it would probably be ready at the beginning of the next week.

Cool! Thanks, might bug you again next week :-)
4) Message boards : Number crunching : Optimization (Message 1198)
Posted 16 Nov 2017 by Profile Beyond
Hi

any news regarding the Mac OS version ?

Any news yet? valterc is waiting for the Mac version before he makes the new optimized app the default (have no idea why that's a condition).
5) Message boards : Wish List : Future requests (Message 1197)
Posted 15 Nov 2017 by Profile Beyond
Can we possibly make the latest apps from Daniel the official versions? Haven't they been out for about 6 months?

That's something I wanted to do after a system upgrade (operating system), I still haven't find the time (for instance, right now, we are focusing on the work generator) to do this... I'll try my best, maybe the next week...

Thanks, might bug you again next week :-)

Next week is here :-P

Thanks ;-). Just came to do the reminder too!

Yep, this is starting to become 'my apologies thread'.... Notice the 'maybe' in the former sentence ;) I'll wait until we have all the needed versions (the one missing is the one for MacOS)

>> I'll wait until we have all the needed versions (the one missing is the one for MacOS)

Daniel?
6) Message boards : Wish List : Future requests (Message 1193)
Posted 14 Nov 2017 by Profile Beyond
Can we possibly make the latest apps from Daniel the official versions? Haven't they been out for about 6 months?

That's something I wanted to do after a system upgrade (operating system), I still haven't find the time (for instance, right now, we are focusing on the work generator) to do this... I'll try my best, maybe the next week...

Thanks, might bug you again next week :-)

Next week is here :-P

Thanks ;-). Just came to do the reminder too!
7) Message boards : Wish List : Future requests (Message 1185)
Posted 2 Nov 2017 by Profile Beyond
Can we possibly make the latest apps from Daniel the official versions? Haven't they been out for about 6 months?

That's something I wanted to do after a system upgrade (operating system), I still haven't find the time (for instance, right now, we are focusing on the work generator) to do this... I'll try my best, maybe the next week...

Thanks, might bug you again next week :-)
8) Message boards : Wish List : Future requests (Message 1179)
Posted 1 Nov 2017 by Profile Beyond
Can we possibly make the latest apps from Daniel the official versions? Haven't they been out for about 6 months?
9) Message boards : Number crunching : File size too big error (Message 883)
Posted 8 Feb 2017 by Profile Beyond
The Ec workunits were easy to manage: relatively long computation time and small output file, the opposite happens with Vv. Ec (Escherichia coli) has only ~4000 genes while Vv (Vitis Vinifera) more than 28000. So we are trying to balance this in order to achieve the optimal workunit size. (the upload error was obviously not expected, our fault, sorry).

Could you make a choice in project preferences to choose the WU type? Some of us (read: 1/2 of the USA) have stone age broadband (a misnomer) speeds and can't handle these large uploads for short runtime WUs. I've had to cut way back because of this.
10) Message boards : Number crunching : Optimization (Message 824)
Posted 25 Jan 2017 by Profile Beyond
Impressive work. Thanks! It looks like the sse2+fma is only 0.659% faster. Is that even worth having another version?

I am going to modify code a bit to use AVX for div/sqrt calculations and SSE for the rest. This should improve performance a bit, so finally it should be a bit faster than this SSE+FMA version. Will see when I will have it ready how much it is faster. Also keep in mind that WUs sent by server now are 100 times longer and we can expect that they will be 200 times longer, so actual time reduction per WU will not be so tiny.

Looking at the user reporting for his AMD X8, his results show that the new fma app is actually running around 11% faster than the sse2 version. This is also what I'm seeing on my four AMD X8 CPUs. A useful increase. Once again, THANKS!
11) Message boards : Number crunching : Optimization (Message 806)
Posted 24 Jan 2017 by Profile Beyond
I found Haswell Xeon machine where I could perform some tests. Here are results of running app on test data, averaged over 10 runs:

SSE2 20.600 SSE2+FMA 20.465

Impressive work. Thanks! It looks like the sse2+fma is only 0.659% faster. Is that even worth having another version?
12) Message boards : Number crunching : Optimization (Message 803)
Posted 24 Jan 2017 by Profile Beyond
This app was limited by memory speed, so SSE version may be faster. Older version was executing more slow calculations (square roots, divisions) plus loops for AVX were executing less times because of longer vectors, so AVX and FMA were faster. Now with reduced number of these slow calculations and with unrolled loops it may be that SSE is faster. I could test it on SandyBridge CPUs only which have slow AVX division and square roots, so AVX app was slower there too. On newer CPUs with faster AVX and memory things may work differently and AVX/FMA versions may be faster. Will see, I hope people will post their results for new apps here.

I am seeing a little less than 34 min/workunit on win 7 64 with the newest AVX version. On an i3-4330 (Haswell) running two instances

Would it be possible to test the newest sse2 app on your machine? I bet it's faster.
13) Message boards : Number crunching : Optimization (Message 797)
Posted 23 Jan 2017 by Profile Beyond
The new sse2 app is faster than the fma app on this machine (AMD FX-8320E), a reversal from the earlier optimized app.

Newest optimized sse2: 51:08 to 53:32
Newest optimized fma: 56:10 to 58:12

The sse2 app is about 9% faster on this CPU, while the older fma app was faster. Again, strange...

This app was limited by memory speed, so SSE version may be faster. Older version was executing more slow calculations (square roots, divisions) plus loops for AVX were executing less times because of longer vectors, so AVX and FMA were faster. Now with reduced number of these slow calculations and with unrolled loops it may be that SSE is faster. I could test it on SandyBridge CPUs only which have slow AVX division and square roots, so AVX app was slower there too. On newer CPUs with faster AVX and memory things may work differently and AVX/FMA versions may be faster. Will see, I hope people will post their results for new apps here.

Thanks for the great explanation about what's probably going on. I was scratching my head over this one and it was starting to hurt. ;-)
14) Message boards : Number crunching : Optimization (Message 791)
Posted 23 Jan 2017 by Profile Beyond
I'm draining the queue on one of my AMD FX-8320E machines now. Will install the fixed fma version and test.

So far the new fma version is running but seems to be slower on this CPU than the sse2 app.
On the previous optimized app the fma was a little faster than the sse2 on this CPU. Strange.

Edit: The new sse2 app is faster than the fma app on this machine, a reversal from the earlier optimized app.

Newest optimized sse2: 51:08 to 53:32
Newest optimized fma: 56:10 to 58:12

The sse2 app is about 9% faster on this CPU, while the older fma app was faster. Again, strange...
15) Message boards : Number crunching : Optimization (Message 788)
Posted 23 Jan 2017 by Profile Beyond
I found why FMA version crashed. I compiled windows FMA version with target CPU architecture set to Haswell, and gcc enabled AVX2 which is not supported by AMD Bulldozer CPUs so app crashed with message "Illegal Instruction". But SSE2 version crash is surprising, it is compiled with the same options (target architecture: core2). Could you check again to make sure that it crashes, and provide me link to failed task? I would like to check error message.

I have recompiled and uploaded FMA Windows version, now it does not use AVX2 so it should work fine. I also uploaded separate AVX2 versions for Windows and Linux 64-bit. Could someone with sufficiently new CPU run some benchmarks with test data on AVX2 and FMA versions? I wonder if there is some performance improvement between AVX2 and FMA versions.

I'm draining the queue on one of my AMD FX-8320E machines now. Will install the fixed fma version and test.
16) Message boards : Number crunching : Optimization (Message 785)
Posted 23 Jan 2017 by Profile Beyond
But I´m getting compute errors with the new SSE2 and FMA App. Only AVX ist working very well on my FX 8320 (Win 10 64 bit). Keep on Your great work!

On my four AMD FX-8320E and 8310 machines the older fma app worked but not the new version. However the newest sse2 version is working fine (error free and validating properly) on all of those boxes and also on my various AMD Phenom II X6 CPUs, the AMD 5350 APU and the Intel Celeron 1037U. Haven't tried the newest avx as the previous avx version didn't test faster for my machines.
17) Message boards : Number crunching : Optimization (Message 783)
Posted 23 Jan 2017 by Profile Beyond
Thanks Daniel!

I tried the newest fma version and it crashed on a machine that worked with the old fma optimized app.
The new sse2 version worked on all my various CPUs and was over twice as fast as the previous optimized app.
To be clear, the new sse2 app is more than 2x faster on every one of my machines, even the ones that ran the old fma version.
18) Message boards : Number crunching : Optimization (Message 757)
Posted 14 Jan 2017 by Profile Beyond

Immediate failure. I tested on 12/25 and those WUs are out of the database. On most machines the avx app ran but wasn't any faster (often slower) than the sse2 app. In fact I found no machines on which the avx app was superior. The fma app was a little bit faster than the other 2 versions on my AMD X8 boxes.

Be aware you need a capable cpu to run those applications. Looking at your computers, ony AMD FX have both avx and fma, Athlon 5350 have only avx. however, they are not known to be excellent in these capabilities :)

Thanks. Pretty much to the point I was making. Looks like you're in the same boat. What CPUs actually use avx effectively? Comparative times?
19) Message boards : Number crunching : Optimization (Message 755)
Posted 14 Jan 2017 by Profile Beyond
You mentioned errors with AVX app. Are they problems with starting/running app, or validation errors? If with starting/running, could you provide me link to example failed WU? I would like to check if there are some details which may be helpful.

Immediate failure. I tested on 12/25 and those WUs are out of the database. On most machines the avx app ran but wasn't any faster (often slower) than the sse2 app. In fact I found no machines on which the avx app was superior. The fma app was a little bit faster than the other 2 versions on my AMD X8 boxes.
20) Message boards : Number crunching : Optimization (Message 752)
Posted 13 Jan 2017 by Profile Beyond
I checked your computers and found few validations errors, but no error reported by app or BOINC client that something was wrong. Maybe these are caused by that mysterious error mentioned by valterc before? I also saw this few times but on SSE version. That was on machine running 24/7 and BOINC configured with long task switch time, so these WUs were crunched from start to end without interruptions. One of them was also crunched by someone with my app and that for that person it was validated successfully, so it is even more interesting.

The only 3 validation errors I had were all at once on 1 machine when there was a power glitch. The problem I had with the non sse2 versions were on certain CPU types. They errored all WUs immediately, but it was a while ago when I was testing and the WUs are no longer in the database. The sse2 version runs fine on every machine I've tried so far (15-20 boxes for myself and friends).


Next 20

Main page · Your account · Message boards


Copyright © 2017 CNR-TN & UniTN