[Iplant-api-dev] job failed, but status says RUNNING

Barthelson, Roger A - (rogerab) rogerab at email.arizona.edu
Wed Oct 3 09:01:03 MST 2012


Hi Cornel-

Your jobs won't do anything until Lonestar is back up running jobs. I don't know what will happen with your existing jobs. Most likely they will run when they can. All you can do is wait and see. You can check the Lonestar queue with showq to see if it has your jobs. But I think that's about it.

Roger

Sent from my iPad

On Oct 3, 2012, at 7:06 AM, "Cornel Ghiban" <ghiban at cshl.edu> wrote:

> Hi,
> 
> One of the jobs (#4204) is still showing as running, but I think it actually 
> failed. The error log says:
> 
> ERROR: _rcConnect: connectToRhost error, server on 
> bitol.iplantcollaborative.org is probably down status = -347000 
> USER_SOCK_CONNECT_TIMEDOUT
> 
> 
> Do you have any best practice tips on how to resubmit jobs that are stuck? I 
> also have a few jobs stuck in SUBMITTING since yesterday, when lonestar was 
> put into maintenance mode.
> 
> Thanks,
> Cornel
> _______________________________________________
> Iplant-api-dev mailing list
> Iplant-api-dev at iplantcollaborative.org
> http://mail.iplantcollaborative.org/mailman/listinfo/iplant-api-dev



More information about the Iplant-api-dev mailing list