log in |
Message boards : Number crunching : Validation problem (host #81) Client version v6.
Author | Message |
---|---|
Host #81 http://gene.disi.unitn.it/test/results.php?hostid=81 is producing nothing but validation errors. Although stderr and runtime seems to indicate valid results the validator cannot even check the host's output files simply because they are not in the server. As an example WU 3866 has two results, one in progress and one returned by computer 81, WU name is 16_At_fos-lgn_wu-1099_1396713783839 and, server side I have: boincadm@gene:~/projects/test$ find . -name "16_At_fos-lgn_wu-1099_1396713783839*" I really don't know what is happening, but it is happening only with this host. One strange thing is that, server side, I cannot see the host's IP number. Also the cpu describes itself as GenuineIntel Genuine Intel(R) CPU 000 @ 3.20GHz, which sounds a little strange to me... Any hints? | |
ID: 366 · Reply Quote | |
I'm not entirely sure what is going on either, I see nothing obvious in all the files on the machine. The chip itself is an engineering sample (I7-920) so that's why it appears as GenuineIntel Genuine Intel(R) CPU 000 @ 3.20GHz. | |
ID: 367 · Reply Quote | |
Could it be the version of BOINC I'm using on that particular host ? 6.12.34 4/7/2014 7:05:53 AM | TN-Grid Test Platform | [task] Process for 16_At_fos-lgn_wu-1229_1396713946319_0 exited
4/7/2014 7:05:53 AM | TN-Grid Test Platform | [task] task_state=EXITED for 16_At_fos-lgn_wu-1229_1396713946319_0 from handle_exited_app
4/7/2014 7:05:53 AM | TN-Grid Test Platform | [task] process exited with status 0
4/7/2014 7:05:53 AM | TN-Grid Test Platform | Computation for task 16_At_fos-lgn_wu-1229_1396713946319_0 finished
4/7/2014 7:05:53 AM | TN-Grid Test Platform | [task] result state=FILES_UPLOADING for 16_At_fos-lgn_wu-1229_1396713946319_0 from CS::app_finished
4/7/2014 7:05:53 AM | TN-Grid Test Platform | [task] result state=FILES_UPLOADED for 16_At_fos-lgn_wu-1229_1396713946319_0 from CS::update_results
4/7/2014 7:05:53 AM | TN-Grid Test Platform | [task] ACTIVE_TASK::start(): forked process: pid 2201
</code> | |
ID: 368 · Reply Quote | |
That's getting weird... I just checked the file system for the file mentioned in your log and found nothing, in the server log files the only reference I got is: grep 16_At_fos-lgn_wu-1229_1396713946319_0 log_gene/*
log_gene/gene_network_validator.log:md5_file: can't open /home/boincadm/projects/test/upload/da/16_At_fos-lgn_wu-1229_1396713946319_0_0
log_gene/gene_network_validator.log:2014-04-07 17:00:34.5171 [CRITICAL] [RESULT#8247 16_At_fos-lgn_wu-1229_1396713946319_0] md5_file() failed for /home/boincadm/projects/test/upload/da/16_At_fos-lgn_wu-1229_1396713946319_0_0: fopen() failed
log_gene/gene_network_validator.log:2014-04-07 17:00:34.5171 [CRITICAL] check_set: init_result([RESULT#8247 16_At_fos-lgn_wu-1229_1396713946319_0]) failed: fopen() failed
log_gene/gene_network_validator.log:2014-04-07 17:00:34.5224 [RESULT#8247 16_At_fos-lgn_wu-1229_1396713946319_0] Invalid [HOST#81]
Which is 'normal', I guess, the validator couldn't open the file because it's not on the disk.... Host#81 requests work from IP 75.189.xxx.161 (from scheduler.log) but there is nothing in file_upload_handler.log neither in /var/log/apache2/* (the following gives nothing...) grep 75.189 /var/log/apache2/* |grep upload |grep "6.12.34"
From the hosts page of the administrative interface I see that this host as NO internal IP address (it's blank): Info [BOINC|6.12.34]
Total credit 0
Average credit 0
Average update time 11 Feb 2014, 19:00:20 UTC
IP address
(same the last 856 times)
External IP address 75.189.xxx.161
Domain name dbase2
...
% of time host connected -100 % So, from the server side it seems that your host neither has a proper ip address and never connected to the server. I really don't know on which side the error is. I don't think your client version is the problem. The only thing I can suggest is to detach/reattach and see if something changes... | |
ID: 369 · Reply Quote | |
yes, that is strange. | |
ID: 370 · Reply Quote | |
I got this from Slicker who runs the Collatz project. Perhaps it's relevant here | |
ID: 371 · Reply Quote | |
I've just checked our config.xml and both options are in... The thing that I really don't understand is that I cannot see any requests for an upload connection in my apache logs.... Just to be sure to check everything, do you use a proxy? I just found this https://boinc.berkeley.edu/dev/forum_thread.php?id=6940 | |
ID: 372 · Reply Quote | |
no proxy at all. | |
ID: 373 · Reply Quote | |
I just installed a 6.12.34 client on a virtualized XP. If I will be able to catch some work (we have planned to stop the work generator for a couple of days) maybe I can replicate the problem... otherwise I will write for help to the boinc project mailing list... | |
ID: 374 · Reply Quote | |
WTF! Got exactly your same errors..... Time to ask for help to the boinc mailing lists... | |
ID: 375 · Reply Quote | |
I just installed a 6.12.34 client on a virtualized XP. https://www.microsoft.com/en-us/windows/enterprise/end-of-support.aspx :-P | |
ID: 376 · Reply Quote | |
We found that the error described in this thread affected the results returned by *any* pre v7 boinc clients (like Bok's v6.12.34 computer). It was a configuration error by our side. We just fixed it but please wait some days, better one week, before asking for new jobs or attaching using pre v7 client. | |
ID: 382 · Reply Quote | |
is the fix only for *new* wu's sent out? I had some units still on that machine and kicked them off, but it looks like they are getting the same errors. | |
ID: 383 · Reply Quote | |
looks like this computer has now started getting tasks validated :) | |
ID: 385 · Reply Quote | |
looks like this computer has now started getting tasks validated :) Ok, I'm glad that this problem has been solved. Thank you again for your cooperation. | |
ID: 386 · Reply Quote | |
Message boards :
Number crunching :
Validation problem (host #81) Client version v6.