log in
1) Message boards : Number crunching : No work? (Message 1219)
Posted 19 Jan 2009 by Profile Viking69
Well the new work is great, but I cannot upload.

the current messages are for teh 2 WU's I have on one of my PC's is:

1/19/2009 1:54:29 AM|MindModeling@Beta|Started upload of SASTNM_9_7-3-5919_1232319446_0_0
1/19/2009 1:54:30 AM|MindModeling@Beta|[error] Error on file upload: Server is out of disk space
1/19/2009 1:54:30 AM|MindModeling@Beta|Temporarily failed upload of SASTNM_9_7-3-5919_1232319446_0_0: transient upload error
1/19/2009 1:54:30 AM|MindModeling@Beta|Backing off 3 hr 10 min 3 sec on upload of SASTNM_9_7-3-5919_1232319446_0_0
1/19/2009 1:55:32 AM|MindModeling@Beta|Started upload of SASTNM_9_7-3-5919_1232319446_0_0
1/19/2009 1:55:34 AM|MindModeling@Beta|[error] Error on file upload: Server is out of disk space
1/19/2009 1:55:34 AM|MindModeling@Beta|Temporarily failed upload of SASTNM_9_7-3-5919_1232319446_0_0: transient upload error
1/19/2009 1:55:34 AM|MindModeling@Beta|Backing off 2 hr 10 min 25 sec on upload of SASTNM_9_7-3-5919_1232319446_0_0
1/19/2009 1:55:37 AM|MindModeling@Beta|Started upload of SASTNM_9_7-3-7072_1232316416_0_0
1/19/2009 1:55:38 AM|MindModeling@Beta|[error] Error on file upload: Server is out of disk space
1/19/2009 1:55:38 AM|MindModeling@Beta|Temporarily failed upload of SASTNM_9_7-3-7072_1232316416_0_0: transient upload error
1/19/2009 1:55:38 AM|MindModeling@Beta|Backing off 2 hr 57 min 53 sec on upload of SASTNM_9_7-3-7072_1232316416_0_0
2) Message boards : Number crunching : No work? (Message 1184)
Posted 12 Jan 2009 by Profile Viking69
The current job we are running is finishing up. Most work units have been allocated, so even though the job is only 75% complete no more WU's will be sent out to be computed. New research is on the way!



OK, that was from November '08. You state that you have new servers. The server status pages show no new work available. The homepage shows that a project is not yet complete.

Whats UP?
3) Message boards : Number crunching : Suppressed Pending Completion? (Message 1169)
Posted 19 Dec 2008 by Profile Viking69
This boat is getting crowded.
Lots of pending, of about 100 credits. Not earth shaking but annoying.
From OCT 21st thru OCT 30th they are all saying the same thing.
But when looking at my tasks list, they are almost all given credit. AND, the order of the WU's in the task list seems a bit random.

Tom?????

Hello ADMIN ! ! ! ! Some help here please !
4) Message boards : Number crunching : RESOLVED--> HTTP error ,can't upload (Message 1152)
Posted 10 Dec 2008 by Profile Viking69
Same Here

12/10/2008 5:21:49 AM|MindModeling@Beta|Started upload of SSATNM_8-3-42613_1225397822_1_0
12/10/2008 5:21:51 AM|MindModeling@Beta|[error] Error on file upload: Server is out of disk space
12/10/2008 5:21:51 AM|MindModeling@Beta|Temporarily failed upload of SSATNM_8-3-42613_1225397822_1_0: transient upload error


But the servers say that they are UP and only the assimilateor is down. No server health info that I can see.
5) Message boards : Number crunching : Reported back too late to validate? (Message 989)
Posted 12 Sep 2008 by Profile Viking69
OK, the system has been updated, I allowed an new WU to go to this PC , and it never showed up on my stats. It did crunch and report.

9/10/2008 8:19:48 AM|MindModeling@Beta|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 1 completed tasks
9/10/2008 8:19:53 AM|MindModeling@Beta|Scheduler request succeeded: got 0 new tasks

and here is what I got the night before,

9/10/2008 3:22:30 AM|MindModeling@Beta|Starting PVTNM_noattend4_large-3-266_1220223241_0
9/10/2008 3:22:31 AM|MindModeling@Beta|Starting task PVTNM_noattend4_large-3-266_1220223241_0 using mm_ACTR version 300
9/10/2008 3:22:52 AM|MindModeling@Beta|Computation for task PVTNM_noattend4_large-3-266_1220223241_0 finished
9/10/2008 3:22:54 AM|MindModeling@Beta|Started upload of PVTNM_noattend4_large-3-266_1220223241_0_0
9/10/2008 3:22:56 AM|MindModeling@Beta|Finished upload of PVTNM_noattend4_large-3-266_1220223241_0_0

so where did it go?

It was on the third page of results. You can look at the workunit and see that 4 others returned a "Client error" before your success reported so you received "Validate error". Looking at the times of work being issued, 3 of those 4 were sent the task after you, but since it errored immediately and then reported back, you needed to be really quick with your success to receive credit. This situation is why I have suggested changing the initial replication to 1 so it matches quorum.


Well, thats the first time I have seen the most recient WU not be at the top of the first page in any project I have done. This is being sorted by TASK ID not date of download. A bit different.
6) Message boards : Number crunching : Reported back too late to validate? (Message 981)
Posted 10 Sep 2008 by Profile Viking69
OK, the system has been updated, I allowed an new WU to go to this PC , and it never showed up on my stats. It did crunch and report.

9/10/2008 8:19:48 AM|MindModeling@Beta|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 1 completed tasks
9/10/2008 8:19:53 AM|MindModeling@Beta|Scheduler request succeeded: got 0 new tasks

and here is what I got the night before,

9/10/2008 3:22:30 AM|MindModeling@Beta|Starting PVTNM_noattend4_large-3-266_1220223241_0
9/10/2008 3:22:31 AM|MindModeling@Beta|Starting task PVTNM_noattend4_large-3-266_1220223241_0 using mm_ACTR version 300
9/10/2008 3:22:52 AM|MindModeling@Beta|Computation for task PVTNM_noattend4_large-3-266_1220223241_0 finished
9/10/2008 3:22:54 AM|MindModeling@Beta|Started upload of PVTNM_noattend4_large-3-266_1220223241_0_0
9/10/2008 3:22:56 AM|MindModeling@Beta|Finished upload of PVTNM_noattend4_large-3-266_1220223241_0_0

so where did it go?
7) Message boards : Number crunching : Reported back too late to validate? (Message 975)
Posted 9 Sep 2008 by Profile Viking69
And another thing, if you are going to send out 3 WU's in the initial replication, then if all three WU's come back in time and have valid data they should all get credit.

I didn't get credit on this becaus I reported back 4 minutes after someone else, and I crunched it quicker.


You are correct - that would only be fair.
Until I can rewrite the validator - I re-ran a script to fix credit; this method is far from ideal because it does not affect RAC only total credit counts.

I will see what I can do,
--Jack


Thank you Jack. I know that this is BETA, and these are the growing pains of a mainline project.
8) Message boards : Number crunching : Reported back too late to validate? (Message 965)
Posted 5 Sep 2008 by Profile Viking69
And another thing, if you are going to send out 3 WU's in the initial replication, then if all three WU's come back in time and have valid data they should all get credit.

I didn't get credit on this becaus I reported back 4 minutes after someone else, and I crunched it quicker.
9) Message boards : Number crunching : Computing errors (Message 964)
Posted 5 Sep 2008 by Profile Viking69
I installed the x64 patch on my x32 Vista box to see if a WU would crunch and not error out. Well, it crunched. Then is failing on upload.

9/5/2008 12:54:34 AM|MindModeling@Beta|Started upload of PVTNM_noattend4_large-3-36_1219696876_2_0
9/5/2008 12:54:36 AM|MindModeling@Beta|[error] Error on file upload: can't open file
9/5/2008 12:54:36 AM|MindModeling@Beta|Temporarily failed upload of PVTNM_noattend4_large-3-36_1219696876_2_0: transient upload error
9/5/2008 12:54:36 AM|MindModeling@Beta|Backing off 1 min 0 sec on upload of PVTNM_noattend4_large-3-36_1219696876_2_0


And since it had used one of the two cores on my pc, that core was idle until I interacted with the PC and then the WU finished. This must have been several hours of idle time for my cpu because many other short WU's for other projects did not get done.

And this one failed to validate. why?

I'm getting frustrated with this project.
10) Questions and Answers : Windows : RESOLVED -- I'll never get workunits (Message 950)
Posted 1 Sep 2008 by Profile Viking69
OK, I got WU's, but they are all errors.

TASKS
11) Message boards : Number crunching : Still uploading problems? (Message 942)
Posted 27 Aug 2008 by Profile Viking69
Good Luck Tom and Crew. I can feel you frustration, as I am IT as well.
12) Message boards : Number crunching : Computing errors (Message 903)
Posted 9 Aug 2008 by Profile Viking69
3 of the 4 files I had in upload mode have now reported but I still have this one:

8/9/2008 10:46:20 AM|MindModeling@Beta|Started upload of PRP_HF_Rev_orig2-3-1118_1218116953_2_0
8/9/2008 10:46:22 AM|MindModeling@Beta|[error] Error on file upload: Server is out of disk space
8/9/2008 10:46:22 AM|MindModeling@Beta|Temporarily failed upload of PRP_HF_Rev_orig2-3-1118_1218116953_2_0: transient upload error
8/9/2008 10:46:22 AM|MindModeling@Beta|Backing off 1 hr 28 min 1 sec on upload of PRP_HF_Rev_orig2-3-1118_1218116953_2_0


Things are geting better, but are not yet 'good to go'.
13) Message boards : Number crunching : Computing errors (Message 898)
Posted 9 Aug 2008 by Profile Viking69
And also this:

8/9/2008 4:47:00 AM|MindModeling@Beta|Started upload of PRP_HF_Rev_orig2-3-1118_1218116953_2_0
8/9/2008 4:47:02 AM|MindModeling@Beta|[error] Error on file upload: can't open log file
8/9/2008 4:47:02 AM|MindModeling@Beta|Temporarily failed upload of PRP_HF_Rev_orig2-3-1118_1218116953_2_0: transient upload error
8/9/2008 4:47:02 AM|MindModeling@Beta|Backing off 1 hr 12 min 41 sec on upload of PRP_HF_Rev_orig2-3-1118_1218116953_2_0
14) Message boards : Number crunching : Computing errors (Message 896)
Posted 9 Aug 2008 by Profile Viking69
And now I am getting this:

8/8/2008 4:57:29 PM|MindModeling@Beta|[error] Error on file upload: Server is out of disk space
8/8/2008 4:57:29 PM|MindModeling@Beta|Temporarily failed upload of PRP_HF_Rev_orig2-3-990_1218116949_0_0: transient upload error
8/8/2008 4:57:29 PM|MindModeling@Beta|Backing off 1 min 0 sec on upload of PRP_HF_Rev_orig2-3-990_1218116949_0_0


But I see that there was a power outage. So this issue is probably related.
15) Message boards : Cafe : ............ (Message 895)
Posted 9 Aug 2008 by Profile Viking69
. . . - - - . . .
16) Message boards : Number crunching : Computing errors (Message 891)
Posted 8 Aug 2008 by Profile Viking69
I'm getting a bunch of these all of a sudden.

FYI




Main page · Your account · Message boards


Copyright © 2020 MindModeling.org