[Iplant-api-dev] more job failures, stalls

Rion Dooley dooley at tacc.utexas.edu
Wed Dec 11 09:13:09 MST 2013


Hi Mohammed,

When stampede came back online, there were 600 jobs queued up in Foundation. It backfilled them as quickly as possible and, as a result, managed to blow its disk quota on Stampede. At that point jobs started failing. I cleared up space on Stampede and bumped all the jobs that failed as a result of it. They are flowing back in again now. You won’t see all of yours in queue at the moment due to user-level throttling, but throughput on stampede is excellent right now and your jobs historically have very short run times, so they should all be completed shortly. Let me know if you run into any issues.

--
Rion




On Dec 11, 2013, at 9:40 AM, Khalfan, Mohammed <mkhalfan at cshl.edu<mailto:mkhalfan at cshl.edu>> wrote:

Hi,

More failures and stalls, these are from DNA Subway:

35717: STAGING_JOB
35718: FAILED
36111: PENDING

Please help!

Thank you,
Mohammed




Mohammed Khalfan
Bioinformatics Developer
DNA Learning Center
Cold Spring Harbor Laboratory
1 Bungtown Road
Cold Spring Harbor NY 11724
(516)367-5162
www.dnalc.org<http://www.dnalc.org/>

_______________________________________________
Iplant-api-dev Mailing List: Iplant-api-dev at iplantcollaborative.org<mailto:Iplant-api-dev at iplantcollaborative.org>
List Info and Archives: http://mail.iplantcollaborative.org/mailman/listinfo/iplant-api-dev
One-click Unsubscribe: http://mail.iplantcollaborative.org/mailman/options/iplant-api-dev/dooley%40tacc.utexas.edu?unsub=1&unsubconfirm=1

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.iplantcollaborative.org/pipermail/iplant-api-dev/attachments/20131211/2a17f931/attachment.html 


More information about the Iplant-api-dev mailing list