Help with invalid tasks and computation errors?
log in

Advanced search

Message boards : Number crunching : Help with invalid tasks and computation errors?

Author Message
autouzi
Send message
Joined: 14 Jan 20
Posts: 3
Credit: 0
RAC: 0
United States
Message 1665 - Posted: 14 Jan 2020, 3:39:48 UTC

Would anybody who knows the terminology mind taking a look at my invalid tasks and errors to see why they are happening? I use GRC Pool, so you will need to use the link to my PC at the bottom of text.

Primarily, I am interested in the computation errors and why this is happening. I have not been able to replicate any instability in any other tests. I am also confused with the invalid tasks and how they can be invalid without being a computation error.

Any help is appreciated!
Link to computer 56280
http://gene.disi.unitn.it/test/results.php?hostid=56280&offset=0&show_names=0&state=0&appid=

Profile valterc
Project administrator
Project tester
Send message
Joined: 30 Oct 13
Posts: 452
Credit: 22,345,282
RAC: 9,748
Italy
Message 1667 - Posted: 14 Jan 2020, 10:05:49 UTC - in response to Message 1665.

Sometimes, for a lot of different reasons, the computation ends 'correctly' but the results are not. This is the reason we implement the 'redundancy' feature (one result is marked correct if it is bit-wise identical to another one).

We also know we have a small bug in our code (very infrequent, that we were not able to catch). In some cases, when the computation of a task is stopped at the very beginning, before the first checkpoint, the output file become 'inconsistent', thus the computation will produce an 'invalid' result (it can happen if you see 'Start from checkpoint: 1' in the log). Keeping a small workunit queue (thus avoiding BOINC going into 'rush' mode) will mitigate this problem.

Error 194 is sometimes an effect of the computer being unresponsive, too much load, see http://wuprop.boinc-af.org/forum_thread.php?id=402

autouzi
Send message
Joined: 14 Jan 20
Posts: 3
Credit: 0
RAC: 0
United States
Message 1668 - Posted: 15 Jan 2020, 1:22:31 UTC - in response to Message 1667.

So fairly normal. Thank you for your response and all you do for this project! This is my favorite project available on GRC Pool because of the potential to help us better understand the complex subject of genetics.

Timber
Send message
Joined: 20 Jan 20
Posts: 5
Credit: 540,906
RAC: 0
Canada
Message 1672 - Posted: 21 Jan 2020, 17:57:16 UTC

3 failed (and errored) tasks so far on a Ryzen 7 1800x, running Windows 10.
An example of one of the errored tasks:
https://gene.disi.unitn.it/test/result.php?resultid=46767050
the machine this is happening on:
https://gene.disi.unitn.it/test/show_host_detail.php?hostid=56399
At least it's not a day of work lost. None of the tasks have shown as invalid, yet.

Jim1348
Send message
Joined: 29 Dec 16
Posts: 41
Credit: 6,038,899
RAC: 9,486
United States
Message 1673 - Posted: 21 Jan 2020, 20:36:37 UTC - in response to Message 1672.

3 failed (and errored) tasks so far on a Ryzen 7 1800x, running Windows 10.

It could be the segfault error. I see them on my Ryzen 1700 occasionally, but not on my Ryzen 2700. (And my Ryzen 1700 is one of the "fixed" versions, produced after they introduced the fix.)


Post to thread

Message boards : Number crunching : Help with invalid tasks and computation errors?


Main page · Your account · Message boards


Copyright © 2020 CNR-TN & UniTN