Completed work not uploading
log in

Advanced search

Message boards : Number crunching : Completed work not uploading

Author Message
arcturus
Send message
Joined: 18 May 22
Posts: 17
Credit: 5,806,368
RAC: 0
United States
Message 3035 - Posted: 20 Dec 2022, 15:42:14 UTC

Despite the upload server status showing 'running.' This is happening on a couple of computers.

pitngrid
Send message
Joined: 6 Oct 22
Posts: 1
Credit: 77,794
RAC: 0
United States
Message 3036 - Posted: 20 Dec 2022, 18:44:15 UTC

I have the same problem. BOINC Manager says the completed work unit is uploading, but it isn't.

This is on a Raspberry Pi, which takes several days to complete a work unit; the deadline is today, so I would like the work unit submitted soon to get credit for work completed.

Profile valterc
Project administrator
Project tester
Send message
Joined: 30 Oct 13
Posts: 623
Credit: 34,677,535
RAC: 1
Italy
Message 3037 - Posted: 20 Dec 2022, 23:23:42 UTC - in response to Message 3036.
Last modified: 20 Dec 2022, 23:24:08 UTC

https://gene.disi.unitn.it/test/forum_thread.php?id=355

Nothing I can do but wait for the problem to be solved by the University's staff.

Speedy
Send message
Joined: 13 Nov 21
Posts: 33
Credit: 1,020,742
RAC: 0
New Zealand
Message 3038 - Posted: 22 Dec 2022, 1:54:08 UTC

All of my work has uploaded yesterday I think I something like 33 results waiting

walli
Send message
Joined: 14 Jan 17
Posts: 2
Credit: 8,729,342
RAC: 0
Germany
Message 3039 - Posted: 22 Dec 2022, 3:37:39 UTC
Last modified: 22 Dec 2022, 3:40:38 UTC

My clients are still unable to upload files and show messages like:

[error] Error reported by file upload server: [224090_Hs_T129315-RYK_wu-269_1671451150189_2_0] locked by file_upload_handler PID=-1
or
[error] Error reported by file upload server: can't lock file /storage/boinc/upload//180/224124_Hs_T201713-RPS4Y2_wu-50_1671498997135_0_0

walli
Send message
Joined: 14 Jan 17
Posts: 2
Credit: 8,729,342
RAC: 0
Germany
Message 3042 - Posted: 23 Dec 2022, 2:57:31 UTC
Last modified: 23 Dec 2022, 3:00:27 UTC

This

... locked by file_upload_handler PID=-1

seems to have been fixed and affected tasks could be uploaded. The other problem

[error] Error reported by file upload server: can't lock file /storage/boinc/upload...

still persists. The deadline of all my remaining tasks is < 2 days. I hope this can be fixed in time as well :).

Retvari Zoltan
Send message
Joined: 31 Mar 20
Posts: 43
Credit: 51,206,467
RAC: 0
Hungary
Message 3082 - Posted: 18 Feb 2023, 1:24:44 UTC

This issue hit my hosts again...
Is it still the filesystem to blame?

Profile valterc
Project administrator
Project tester
Send message
Joined: 30 Oct 13
Posts: 623
Credit: 34,677,535
RAC: 1
Italy
Message 3083 - Posted: 18 Feb 2023, 10:47:01 UTC - in response to Message 3082.
Last modified: 18 Feb 2023, 10:47:40 UTC

Yes, now it's working. I will try to check the "unable to upload" problem as soon as possible. Unfortunately it's Saturday:)

Profile Keith Myers
Send message
Joined: 26 Jun 20
Posts: 64
Credit: 15,299,594
RAC: 0
United States
Message 3086 - Posted: 22 Feb 2023, 1:47:38 UTC - in response to Message 3083.

Yes, now it's working. I will try to check the "unable to upload" problem as soon as possible. Unfortunately it's Saturday:)

Still having an issue with not being able to return results.

Still going into multi-hour backoffs. Server status page shows all server processes running OK.
____________

A proud member of the OFA (Old Farts Association)

Umlauf
Send message
Joined: 18 May 20
Posts: 1
Credit: 1,410,510
RAC: 0
Germany
Message 3087 - Posted: 22 Feb 2023, 14:34:58 UTC

I still can´t up- or download any wu´s or results.

entity
Send message
Joined: 20 Jul 20
Posts: 20
Credit: 31,475,949
RAC: 0
United States
Message 3088 - Posted: 22 Feb 2023, 18:43:02 UTC - in response to Message 3087.

I have moved to Denis since they now have lots of work. Last time i looked they had about 62,000 WUs unsent

Profile Keith Myers
Send message
Joined: 26 Jun 20
Posts: 64
Credit: 15,299,594
RAC: 0
United States
Message 3089 - Posted: 28 Feb 2023, 17:57:56 UTC

The project has an endemic problem with uploading results. I'm going to stop crunching for them.
____________

A proud member of the OFA (Old Farts Association)

TLD
Send message
Joined: 16 Feb 22
Posts: 3
Credit: 9,252,686
RAC: 0
United States
Message 3090 - Posted: 28 Feb 2023, 18:40:24 UTC

Upload problem is back.

Profile adrianxw
Send message
Joined: 22 Dec 16
Posts: 36
Credit: 8,489,198
RAC: 0
Denmark
Message 3091 - Posted: 28 Feb 2023, 21:30:30 UTC
Last modified: 28 Feb 2023, 21:34:24 UTC

>>> The project has an endemic problem with uploading results. I'm going to stop crunching for them.

Why? It's been off most of the day, but my uploads have all cleared within the last couple of hours. BOINC handles this kind of thing fine.
____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

entity
Send message
Joined: 20 Jul 20
Posts: 20
Credit: 31,475,949
RAC: 0
United States
Message 3092 - Posted: 28 Feb 2023, 22:50:50 UTC - in response to Message 3091.

Not totally, I have 173 errors of which half are jobs cancelled by server and the other half are labeled Timed out -- No response. These are jobs that couldn't be uploaded in time due to either client back-off or due to files on the server that remain locked and cannot be opened when client tries the upload process again. The later can only be fixed by a server admin.

Profile adrianxw
Send message
Joined: 22 Dec 16
Posts: 36
Credit: 8,489,198
RAC: 0
Denmark
Message 3093 - Posted: 1 Mar 2023, 12:33:49 UTC - in response to Message 3092.

Cancelled by the server might go into the "errors" box, but really, it is not a client error, there is a limit on the number of boxes to put the figure. The server has been instructed to cancel remaining tasks already sent for a particular job, or class of jobs, if the criteria it is given are met. There are several, probably many, possible reasons for this. It might have been instructed to cancel jobs because an error has been found with them, alternatively, if the server has already received sufficient replies from other crunchers which are valid it cancels outstanding jobs to prevent crunchers wasting their time. You can get this message from all projects, I have received several of these messages from Rosetta this week for example.
Timed out is simply what it says, no reply has been successfully received for a job it has sent you before the given deadline. The deadline is used by the server to ensure that a task does, indeed, get processed. It allows it to resend the job to another cruncher so that the result can be assimilated.
Both of these "errors" can, and will be seen at all projects. A common cause is a client downloading excessive quantities of work units for the project.
____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

entity
Send message
Joined: 20 Jul 20
Posts: 20
Credit: 31,475,949
RAC: 0
United States
Message 3094 - Posted: 1 Mar 2023, 14:13:59 UTC - in response to Message 3093.

I understand fully what you are saying and you are correct. The server cancelled WUs are not a problem as they were the work that was in my queue and not started when quorum was established for the WU. The other work however was work that was stuck in upload and missed the deadline not because there was too much work queued (this project limits the amount of work queued to twice the number of threads) but because the client was in an extended backoff. If I micro managed the client, I would have noticed the extended backoff and forced an update which would have met the deadline. I just wish they would fix that filesystem issue once and for all.

Profile adrianxw
Send message
Joined: 22 Dec 16
Posts: 36
Credit: 8,489,198
RAC: 0
Denmark
Message 3098 - Posted: 2 Mar 2023, 11:15:40 UTC - in response to Message 3094.

Fair enough. I've never had that problem.
____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.


Post to thread

Message boards : Number crunching : Completed work not uploading


Main page · Your account · Message boards


Copyright © 2024 CNR-TN & UniTN