log in |
Message boards : Number crunching : OUT of tasks
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next
Author | Message |
---|---|
There are 239 tasks ready to send at the moment.I just modified wus*core from 8 to 6 (don't remember if I also need to restart the server), also reduced the deadline from 5 to 4 days.Thanks. We'll see if it resolves the situation or not. There are 35 tasks ready to send at the moment. I think in two days it will steadily decrease again to 0, as the now overfilled hosts will fall below th 6 wus/core treshold, and begin to replenish their queue faster than the generator could fill them (the hosts didn't get slower because their queue got shorter). So to ultimately resolve this issue you'll have to increase the number of chunks in the workunits to make the hosts run out of tasks slower than the workunit generator could feed them. We'll see. | |
ID: 3207 · Reply Quote | |
We are at 0 (Zero) available tasks again. | |
ID: 3208 · Reply Quote | |
We are at 0 (Zero) available tasks again. This would be good but we don't have the resources. Only one instance of the work generator can be run at the same time (it needs a rewrite). Also our current hardware would not be able to support it. We need new hardware and a (slightly new) work generator, we are working on it, but it is not that easy (finding money) | |
ID: 3209 · Reply Quote | |
Maybe try generating 2x bigger work units, that take 2x more CPU time to process ? +deadline must be 2x longer, obviously. | |
ID: 3210 · Reply Quote | |
That's what I would suggest as well, and that's what Retvari Zoltan wrote: ...to double workunit lenght and halve the maximum number of cached workunits per core. Of course, you could then vary the max cache and/or the deadline to hold the time to finish constant. | |
ID: 3211 · Reply Quote | |
There are 600 chunks in the "normal" workunits (1-76), the last one has 400 chunks (judging by the ratio of their runtime). 46000|2
23000|2
11500|2
5750|2
2875|5
575|5
115|5
23|23
1| It's not practical to double the number of chunks, as in this case the last workunit will be the half of the present ones.I suggest to have 1000 chunks in a workunit, as in this case the workunits will be 2/3 longer than the present ones, and the last one will be the same length. 1000*46 (2*2*2*5*5*5)*(2*23)
920*50 (2*2*2*5*23)*(2*5*5) The other possibility is to have 920 chunks, in this case every gene expansion would made up by 50 workunits. | |
ID: 3212 · Reply Quote | |
Since I have some extra Linux server capacity, I can definitely accept more BOINC work. So whatever is easier for work-generator to generate ... | |
ID: 3213 · Reply Quote | |
There are 600 chunks in the "normal" workunits (1-76), the last one has 400 chunks (judging by the ratio of their runtime). I could support this option as it would also reduce the download/upload activity on the server as well (fewer WUs to manage). | |
ID: 3214 · Reply Quote | |
But if we do make bigger Work-units, please make sure to adjust "deadline" too. | |
ID: 3215 · Reply Quote | |
I propose to make work units made of 1150 chunks x 40 WUs per gene. | |
ID: 3216 · Reply Quote | |
I might try this solution but I don't think it will work, I will try to explain why. | |
ID: 3217 · Reply Quote | |
Now I understand the work generator process. | |
ID: 3218 · Reply Quote | |
Right now a single run of the work generator builds 77 workunits (154 tasks because of the validation requisites). Say that, for example, the result is computed, on a ideal computer, in one hour.I'm glad that I have a better than ideal computer, as my i9-12900F can do one workunit in 35 minutes :) So this will keep busy that ideal computer for 77 hours. If I, theoretically, will pack all the tiles into a single workunit this will keep that computer busy for 77 hoursI would crunch such workunits. It would take 45 hours for my better than ideal computer. ...exactly the same time but increasing the risks of computational errors.That's probably true for "average" computers, but there are quite a few dedicated crunching boxes, so there could be a queue for those. | |
ID: 3219 · Reply Quote | |
As we enter summer in the northern hemisphere and TOU peak rates it does not seem like a donor-friendly thing to make the WUs run longer. Once my queue got filled it remained loaded so I don't see a problem. | |
ID: 3220 · Reply Quote | |
Since I have some extra Linux server capacity, I can definitely accept more BOINC work. So whatever is easier for work-generator to generate ... It appears that you're running on dual-CPU server motherboards. Many of your CPUs are Intel Xeon E5-2680 v4 @ 2.40GHz (56 processors). An E5-2680 v4 is a 14c/28t CPU. Do you actually run 56 WUs on a single computer? My CPUs are 18c/36t and they now have from 71 to 176 WUs Ready to Start in addition to those running. I suspect a shortage of WUs is uniquely your issue. Do you never get over 1,032 WUs per computer? Do you ever get over 256 WUs per server? If you have idle threads you could help with cancer research by running MCM and SCC at WCG. Just a thought. | |
ID: 3221 · Reply Quote | |
Maybe I can answer some of that: I suspect a shortage of WUs is uniquely your issue. As long as tasks ready to send are not rising, someone's cache is not satisfied, the client is crunching faster than the cache can fill, ergo they run out of TN tasks sooner or later. The bottleneck is therefore at the moment the work generator, not the power-sum of our computers. Of course users should let them work on multiple projects if they desire to prevent idle cores. Source Do you never get over 1,032 WUs per computer? In general, the Boinc client has a hard limit of 1000 tasks, above that it refuses to request more. | |
ID: 3222 · Reply Quote | |
I'm glad that I have a better than ideal computer, as my i9-12900F can do one workunit in 35 minutes :) Nice time for a work unit. I have a Ryzen 3900 X and it takes over an hour to do a work unit. Perhaps this project prefers Intel. Using only your performance cores assuming you can get in case you can return 13.76 results in hour. Maths to get that is: 8×1.72 | |
ID: 3223 · Reply Quote | |
I dedicate 50% threads to BOINC (WCG/SiDock/Rosetta/TN-Grid) and 50% to Stockfish Fishtest chess. | |
ID: 3224 · Reply Quote | |
AMD Zen 2 cores are an equivalent of older Intel Skylake cores. (6th Gen up to 10th Gen Core i7 chips). They are significantly slower than newer Intel 12th and 13th gen Core i7 chips. | |
ID: 3225 · Reply Quote | |
I'm glad that I have a better than ideal computer, as my i9-12900F can do one workunit in 35 minutes :) I don't know about Intel vs AMD but it does prefer Linux. 10-15% faster IIRC. ____________ | |
ID: 3226 · Reply Quote | |
Message boards :
Number crunching :
OUT of tasks