Joel Armstrong
Joel Armstrong
Currently the AWS provisioner will terminate the entire workflow if it hits the spot request limit: ``` Exception in thread preemptable-scaler: Traceback (most recent call last): File "/usr/lib/python2.7/threading.py", line 810,...
Dumping my list of bad error/logging messages. Some are more egregious than others. * [ ] Jobs "end successfully" even when they fail: ``` Job ended successfully: 'CactusBarWrapperWithPrecomputedEndAlignments' 3c056a1e-db38-4707-9fd2-c2b6885ea12e Job...
This has been a problem for a while, but I'm just putting an issue up so I remember to fix this somehow. When parasol has more than a million or...
I just had several jobTrees fail (presumably due to a filesystem problem) with this error: ``` Batch system is reporting that the job (1, 298079848) /hive/users/jcarmstr/cactusStuff/phylogenyTests/glires/work-noRescue2/jobTree/jobs/t2/t2/t2/t3/t1/t0/t1/t0/job failed with exit value...
Recently I've noticed instances staying completely idle despite having plenty of jobs that could be scheduled on them. For example:  This tends to get worse...
Related to #1699. About once a day I get an error like this: ``` Traceback (most recent call last): File "/usr/local/lib/python2.7/dist-packages/toil/provisioners/clusterScaler.py", line 296, in check scalerThread.join(timeout=0) File "/usr/local/lib/python2.7/dist-packages/bd2k/util/threading.py", line 51,...