Posts by Aurum
log in
1) Message boards : Number crunching : OUT of tasks (Message 3221)
Posted 27 May 2023 by Aurum
Since I have some extra Linux server capacity, I can definitely accept more BOINC work. So whatever is easier for work-generator to generate ...


It appears that you're running on dual-CPU server motherboards. Many of your CPUs are Intel Xeon E5-2680 v4 @ 2.40GHz (56 processors). An E5-2680 v4 is a 14c/28t CPU. Do you actually run 56 WUs on a single computer?
My CPUs are 18c/36t and they now have from 71 to 176 WUs Ready to Start in addition to those running.
I suspect a shortage of WUs is uniquely your issue. Do you never get over 1,032 WUs per computer? Do you ever get over 256 WUs per server?
If you have idle threads you could help with cancer research by running MCM and SCC at WCG. Just a thought.
2) Message boards : Number crunching : OUT of tasks (Message 3220)
Posted 27 May 2023 by Aurum
As we enter summer in the northern hemisphere and TOU peak rates it does not seem like a donor-friendly thing to make the WUs run longer. Once my queue got filled it remained loaded so I don't see a problem.
My electric utility just blind-sided us by adding June to July-September and more hours. TOU season requires more babysitting. I'm contemplating shutting down until October.
3) Message boards : Number crunching : SSE2 AVX FMA ? (Message 3203)
Posted 23 May 2023 by Aurum
Theoretically AVX2+FMA should be most energy efficient

Can you cite the research that proves that claim?

Here's why I say SSE is the most energy efficient:
Thermal design power and vectorized instructions behavior, Amina Guermouche & Anne-Cécile Orgerie, CONCURRENCY & COMPUTATION: PRACTICE & EXPERIENCE, Feb 2021.
https://hal.archives-ouvertes.fr/hal-03185821/document
4) Message boards : Number crunching : Curious (Message 3189)
Posted 22 May 2023 by Aurum
Are the HS WUs grape or human? Just curious.
E.g., 236831_Hs_T198489-LAGE3_wu-134_1684763098446
5) Message boards : Number crunching : SSE2 AVX FMA ? (Message 3188)
Posted 22 May 2023 by Aurum
SSE is the most energy efficient.
6) Message boards : News : Maintenance (filesystem and o.s.) (Message 3154)
Posted 28 Apr 2023 by Aurum
Outstanding! Will we be back on the grape again?
7) Message boards : Number crunching : SSE2 with app_info? (Message 2862)
Posted 2 Sep 2022 by Aurum
It's astounding that no one has even read this paper. I think TN-GRID should stop issuing AVX and FMA WUs.
Depending on the program SSE could use as little as 60% of the power used by other intsruction sets. For an i9-7980XE with a 165 W TDP that's a savings of up to 66 W per computer.
8) Message boards : Number crunching : SSE2 with app_info? (Message 2815)
Posted 7 Aug 2022 by Aurum
My point is energy efficiency, not run time.
9) Message boards : Number crunching : OUT of tasks (Message 2812)
Posted 6 Aug 2022 by Aurum
I suppose it can be a pain in the butt and not ideal since you'd expect the full 5 days to complete and report the work unit.

I personally only set 4 days of work for my boinc clients on this project. I rarely get a cancelled unit and If I do its one I haven't started yet.


I never set a 5 day cache. Where'd you get that idea?
I always use either 0.5/0.01 days or 1.0/0.01 days. I prefer RZM but lately there's been so much demand for WUs that lets me dry out. Maybe the races are over and I should go back to RZM.
I'm not the one that waited until near the deadline. I'm one of the ones that got the replacement WUs. Then within a couple of hours of the deadline for the original WUs they submit them at the last minute and the replacement WUs get Server Aborted. If the replacement WUs were not sent out until after the deadline then this should never happen.
10) Message boards : Number crunching : OUT of tasks (Message 2803)
Posted 2 Aug 2022 by Aurum
Does the Work Generator need a little adjustment? I've been getting a lot of Server Aborts. It appears that shortly before the Deadline another pair of WUs is sent out to new computers then when the WUs actually get submitted before the Deadline the duplicates get Server Aborted. E.g.,
http://gene.disi.unitn.it/test/workunit.php?wuid=36782082
http://gene.disi.unitn.it/test/workunit.php?wuid=36765727
http://gene.disi.unitn.it/test/workunit.php?wuid=36669384
Waiting until after the actual Deadline to send out replacements might be more efficient.
11) Message boards : Number crunching : SSE2 with app_info? (Message 2792)
Posted 30 Jul 2022 by Aurum
We may need to give up the Louisiana Purchase.

Well there goes the farm belt :-)

But does any one agree with the authors that SSE uses less CPU Wattage than other instruction sets?
12) Message boards : Number crunching : SSE2 with app_info? (Message 2779)
Posted 22 Jul 2022 by Aurum
With the Northern Hemisphere on fire I'm surprised no one has taken any interest in this paper I posted about reducing CPU power consumption.

I switched all computers to SSE2.
13) Message boards : Number crunching : SSE2 with app_info? (Message 2763)
Posted 8 Jul 2022 by Aurum
Still curious about the topic of which instruction set is the most energy efficient or at least runs CPU at coolest temperature under full work load.
Found this interesting looking paper:
Thermal design power and vectorized instructions behavior, Amina Guermouche & Anne-Cécile Orgerie, CONCURRENCY & COMPUTATION: PRACTICE & EXPERIENCE, Feb 2021.
https://hal.archives-ouvertes.fr/hal-03185821/document

I haven't read the whole thing yet but it seems to imply SSE has the lowest power ratio for both CPU operation and DRAM. They also test AVX512.
"AVX2 extension adds fused multiply add instructions (FMA)."
Does this mean what we label as FMA is also AVX2?
They use the term memory-bound which I confess I don't understand. E.g., https://www.intel.com/content/www/us/en/develop/documentation/vtune-help/top/reference/cpu-metrics-reference/memory-bound.html
Are TN-Grid WUs memory-bound?
Do TN-Grid WUs ever trigger Intel CPU Turbo Boost?
14) Message boards : Number crunching : SSE2 with app_info? (Message 2745)
Posted 27 Jun 2022 by Aurum
I'm not sure I can tell the difference. I have an older Ensupra Energy Monitor. The newer ones have a graph. I thought it had an averaging feature but I can't find the instructions. The range I saw has a big overlap and may be confounded by GPU or RAM fluctuations.
15) Message boards : Number crunching : SSE2 with app_info? (Message 2744)
Posted 24 Jun 2022 by Aurum
5 Watts on a 16c/32t CPU is well worth the effort. Think I'll get out the watt meter and run a test. Thanks for the tip.
16) Message boards : Number crunching : SSE2 with app_info? (Message 2739)
Posted 24 Jun 2022 by Aurum
... also do a "chmod a+x"

Thanks. That is what I was missing. I had done the other stuff.

I'm lazy, I make a copy of my app_config and rename it, comes with permissions preloaded :-)
17) Message boards : Number crunching : SSE2 with app_info? (Message 2738)
Posted 24 Jun 2022 by Aurum
I would like to run the SSE2 version rather than the fma version on a Linux (Ubuntu 20.04.4) machine for reduced heat production.

We just started another heat wave here. Do you have a feel for the percent Wattage reduction?

Warning: For an app_info.xml to take effect you must restart BOINC. When you do it will delete every TN-Grid WU you have and DL only SSE2 WUs and start fresh.
18) Message boards : Number crunching : Curious (Message 2714)
Posted 8 Jun 2022 by Aurum
Nice work Valter! Mine have all ULed and back to normal :-)
19) Message boards : Number crunching : Curious (Message 2711)
Posted 7 Jun 2022 by Aurum
I have two finished tasks that I haven't been able to get uploaded to the server all day for some reason.

Tried all the tricks that I know, but nothing has worked. Just keep getting retried. Counts are 11 and 14 attempts so far.

I also have two tasks from different computers that refuse to upload is there a way to rectify this or should they just be aborted?

Don't abort them. They can be aborted from the server if that becomes necessary. They do not interfere with new WUs DLing so we can keep on crunching.
20) Message boards : Number crunching : Curious (Message 2701)
Posted 5 Jun 2022 by Aurum
On the server status page there is a higher than usual number of "tasks in progress". Will check tomorrow, while back in the office, if there is something strange on the server.
That does seem high. Does it include the Ready To Start WUs as well?
Every few days my Ready To Start WUs accumulate to almost 300 and I switch preferences to Resource Zero Mode. TN-GRID works really well in RZM and never seems to give me more than one extra WU waiting in the wings. But at Resource 100% it does not seem to honor the BOINC preference for how much work to buffer. All my computers are set to either 0.5 or 1.0 days but you send more than that. I believe some projects limit the maximum amount of WUs to twice the number of CPU threads and GPUgrid limits it to twice the number of GPUs.
It's been a good while since I've noticed the server running out of available WUs. Nice work tuning it up.


Next 20

Main page · Your account · Message boards


Copyright © 2024 CNR-TN & UniTN