Optimization
log in

Advanced search

Message boards : Number crunching : Optimization

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · Next
Author Message
Profile valterc
Project administrator
Project tester
Send message
Joined: 30 Oct 13
Posts: 620
Credit: 34,596,417
RAC: 4,991
Italy
Message 1052 - Posted: 13 Apr 2017, 9:40:07 UTC - in response to Message 1046.

Daniel made a great work again, the modifications he made are both efficient and elegant. Thanks a lot.

As usual, I will wait for some time before making the new optimized application 'official'. This will probably happen at the end of May (I'll be quite busy until then).

sorcrosc
Volunteer developer
Send message
Joined: 19 Dec 13
Posts: 26
Credit: 3,866,632
RAC: 0
Italy
Message 1054 - Posted: 16 Apr 2017, 23:10:52 UTC

This is a test run on my FX8320 with a real vv workunit kept from the project, the same for every app. Best time of 3 runs.

-> gene_pcim_v1.03_linux64__sse2 user 36m 4s -> gene_pcim_v1.03_linux64__avx user 35m 44s -> gene_pcim_v1.03_linux64__fma user 34m 47s -> TN-Grid.linux-x86-64-sse2-v1.2 user 32m 41s -> TN-Grid.linux-x86-64-avx-v1.2 user 32m 15s -> TN-Grid.linux-x86-64-fma-v1.2 user 31m 40s

Profile NxtGenCowboy
Send message
Joined: 26 Jan 17
Posts: 5
Credit: 432,072
RAC: 0
United States
Message 1056 - Posted: 26 Apr 2017, 6:07:51 UTC
Last modified: 26 Apr 2017, 6:08:20 UTC

Could someone recommend which version I should be using with Intel L7555 cpus? (4 of them)

Looks like 58 minutes using 1.03 sse2 (linux)

mmonnin
Send message
Joined: 24 Oct 16
Posts: 14
Credit: 4,507,347
RAC: 432
United States
Message 1057 - Posted: 28 Apr 2017, 20:44:23 UTC - in response to Message 1056.

Could someone recommend which version I should be using with Intel L7555 cpus? (4 of them)

Looks like 58 minutes using 1.03 sse2 (linux)


Looks like thats the best app version for you.
http://www.cpu-world.com/CPUs/Xeon/Intel-Xeon%20MP%20L7555%20-%20AT80604004875AA.html

Profile NxtGenCowboy
Send message
Joined: 26 Jan 17
Posts: 5
Credit: 432,072
RAC: 0
United States
Message 1058 - Posted: 1 May 2017, 18:00:06 UTC - in response to Message 1057.

Thanks!

Profile Buro87 [Lombardia]
Send message
Joined: 23 Nov 16
Posts: 100
Credit: 4,000,541
RAC: 0
Italy
Message 1059 - Posted: 4 May 2017, 11:18:10 UTC - in response to Message 1058.

Hi daniel, i have a question for you

on my i5-6400 win7 64bit i recieve both avx an sse2 WU
if i want instal your optimization v1.2 which version i need to copy in the project folder?
- TN-Grid.windows-x86-64-avx-v1.2
- TN-Grid.windows-x86-64-sse2-v1.2
can i install both?

Profile [B@P] Daniel
Volunteer developer
Send message
Joined: 19 Oct 16
Posts: 90
Credit: 2,205,103
RAC: 0
Poland
Message 1060 - Posted: 4 May 2017, 15:34:19 UTC - in response to Message 1059.
Last modified: 4 May 2017, 15:40:02 UTC

Hi daniel, i have a question for you

on my i5-6400 win7 64bit i recieve both avx an sse2 WU
if i want instal your optimization v1.2 which version i need to copy in the project folder?
- TN-Grid.windows-x86-64-avx-v1.2
- TN-Grid.windows-x86-64-sse2-v1.2
can i install both?

Your CPU also supports FMA instructions, so you can also try FMA app version: http://www.cpu-world.com/CPUs/Core_i5/Intel-Core%20i5-6400.html. In general FMA app version should be faster than AVX, which is faster than SSE one. However on some CPUs FMA versions for some reason is a bit slower than AVX one, so please try both.

It is possible to install few versions, but you would have to rename pc.exe files and modify app_info.xml to specify all app versions with proper plan classes. Files prepared by me are configured to run single app version only.
____________

Profile Buro87 [Lombardia]
Send message
Joined: 23 Nov 16
Posts: 100
Credit: 4,000,541
RAC: 0
Italy
Message 1061 - Posted: 4 May 2017, 17:39:23 UTC - in response to Message 1060.

Hi daniel, i have a question for you

on my i5-6400 win7 64bit i recieve both avx an sse2 WU
if i want instal your optimization v1.2 which version i need to copy in the project folder?
- TN-Grid.windows-x86-64-avx-v1.2
- TN-Grid.windows-x86-64-sse2-v1.2
can i install both?

Your CPU also supports FMA instructions, so you can also try FMA app version: http://www.cpu-world.com/CPUs/Core_i5/Intel-Core%20i5-6400.html. In general FMA app version should be faster than AVX, which is faster than SSE one. However on some CPUs FMA versions for some reason is a bit slower than AVX one, so please try both.

It is possible to install few versions, but you would have to rename pc.exe files and modify app_info.xml to specify all app versions with proper plan classes. Files prepared by me are configured to run single app version only.


ok i will try fma app too. i use the i5-6400 since yesterday (60 wu completed) and i haven't recieve any fma WU
i will try all 3 apps from tomorrow

for now, both sse2 and avx standard app are between 1220 - 1290s cpu time

Profile Buro87 [Lombardia]
Send message
Joined: 23 Nov 16
Posts: 100
Credit: 4,000,541
RAC: 0
Italy
Message 1063 - Posted: 12 May 2017, 22:40:08 UTC - in response to Message 1061.

hi Daniel i have a problem installing the app on the i5-6400 pc
in 10 days (around 500wu completed) i haave recived equal number of sse and avx WU. no fma wu
both sse and avx standard app are around 1250s so i will try sse first

i copy both pc.exe and app_info.xml in C:\Users\All Users\BOINC\projects\gene.disi.unitn.it_test\ but it doesn't work
i have todelete some existing file first?

koschi
Send message
Joined: 22 Oct 16
Posts: 25
Credit: 17,949,386
RAC: 1,367
Germany
Message 1064 - Posted: 15 May 2017, 8:18:50 UTC

Did you restart the BOINC Client after placing the files? Otherwise these won't be recognized...

Misho
Send message
Joined: 7 Dec 16
Posts: 3
Credit: 104,201
RAC: 0
Bulgaria
Message 1067 - Posted: 20 May 2017, 7:40:29 UTC

Hello and from me.
I am new in this project and want to ask one thing: how can i reduce which app was download to my BOINC? Iwant only FMA or only AVX but the client start download all app: avx, ss2 and fma. What cind of app_config file must i put to reduce only fma app?

koschi
Send message
Joined: 22 Oct 16
Posts: 25
Credit: 17,949,386
RAC: 1,367
Germany
Message 1068 - Posted: 20 May 2017, 19:47:45 UTC
Last modified: 20 May 2017, 19:48:46 UTC

The server initially sends you all 3 flavors, after you return some results for each it decides which is the best for you and would usually send you WUs tagged for just one app version.

Please note, those are the default applications, not the latest updated ones by Daniel. Check back at http://gene.disi.unitn.it/test/forum_thread.php?id=135&postid=1031#1031 and download a copy from his link. If you have AVX or FMA capable CPUs, head straight for those apps. Each archive contains an app_info.xml, so once activated you get only WUs for that application.

Misho
Send message
Joined: 7 Dec 16
Posts: 3
Credit: 104,201
RAC: 0
Bulgaria
Message 1069 - Posted: 21 May 2017, 2:10:04 UTC - in response to Message 1068.

So....what is the difference between default app which i download automatic (for example gene_pcim_v1.02_win64__fma) and the same one fma from the link and the archive? If i saw right both are version 1.02? Which one i must use for better performance?

Profile [B@P] Daniel
Volunteer developer
Send message
Joined: 19 Oct 16
Posts: 90
Credit: 2,205,103
RAC: 0
Poland
Message 1070 - Posted: 21 May 2017, 19:44:42 UTC - in response to Message 1069.

I asked rattorosso [Marche] to create apps for remaining 3 ARM architectures: armv6_vfp, armv7_vfpv3 and aarch64. He send them to me, and I uploaded them in usual place: https://bitbucket.org/sirzooro/pc-boinc/downloads/. Feel free to download and test them too.

So....what is the difference between default app which i download automatic (for example gene_pcim_v1.02_win64__fma) and the same one fma from the link and the archive? If i saw right both are version 1.02? Which one i must use for better performance?

Both internally specify the same version 1.02, but version from this thread has extra optimizations so it runs faster than official one. Valterc is going to take it and release as a new version of official one at end of May.
____________

Misho
Send message
Joined: 7 Dec 16
Posts: 3
Credit: 104,201
RAC: 0
Bulgaria
Message 1071 - Posted: 22 May 2017, 4:05:43 UTC - in response to Message 1070.

Both internally specify the same version 1.02, but version from this thread has extra optimizations so it runs faster than official one. Valterc is going to take it and release as a new version of official one at end of May.


I tested on my pc and there is the result (for some reason on my pc the SSE2 app is fastest app. CPU - i7 4790, 32GB RAM @1600MHz):
1. the official app SDSE2: Run time: 27 min 39 sec; CPU time: 27 min 30 sec;
2. the optimized app SSE2: Run time: 28 min 17 sec; CPU time: 27 min 43 sec;

[AF>Le_Pommier] Jerome_C2005
Send message
Joined: 12 May 14
Posts: 8
Credit: 2,044,101
RAC: 0
Mexico
Message 1076 - Posted: 27 May 2017, 18:12:28 UTC

Hi

any news regarding the Mac OS version ?

Partygott
Send message
Joined: 8 Apr 17
Posts: 3
Credit: 3,160,712
RAC: 0
Germany
Message 1077 - Posted: 29 May 2017, 18:01:26 UTC

I tested 3x the same 16 WUs on a Ryzen 1700 @ 3,8 GHz with SMT on.

avx: 28:30 - 28:48
sse: 28:56 - 29:14
fma: 29:43 - 30:01
standard: ~35min.

i5-3570 (3,2 GHz) standard: ~28min.

So there is a difference of ~18% per WU on the Ryzen, what is pretty nice.
But I wonder why the Ryzen isn't faster than the old i5, even with the optimized app and 600 MHz more clock speed. I saw this also on other Projects.
Seems to be that most of the apps from all projects are optimized for Intel.

Jim1348
Send message
Joined: 29 Dec 16
Posts: 87
Credit: 21,013,002
RAC: 0
United States
Message 1078 - Posted: 30 May 2017, 0:58:15 UTC - in response to Message 1077.

But I wonder why the Ryzen isn't faster than the old i5, even with the optimized app and 600 MHz more clock speed. I saw this also on other Projects.
Seems to be that most of the apps from all projects are optimized for Intel.

It is more because with the i5-3570 you are using a full core, whereas with the Ryzen you are using a virtual core. So it is not bad at all, though you may be right that it is optimized for Intel.

europe64
Send message
Joined: 25 Oct 16
Posts: 1
Credit: 2,211,321
RAC: 0
Message 1082 - Posted: 1 Jun 2017, 12:40:06 UTC - in response to Message 1078.

hi,
the FPU of the AMD processor is slow

Jay
Send message
Joined: 2 Jun 17
Posts: 4
Credit: 373,931
RAC: 0
Message 1083 - Posted: 4 Jun 2017, 0:46:55 UTC - in response to Message 1068.

It appears that initially sse2 was downloaded then it switched over to avx when it realized that those run better, I have an i7-4770. However, now it seems that it has switched back to sse2? There also seems to be no sign of it running FMA...

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · Next
Post to thread

Message boards : Number crunching : Optimization


Main page · Your account · Message boards


Copyright © 2024 CNR-TN & UniTN