Joel Armstrong

Results 6 issues of Joel Armstrong

Currently the AWS provisioner will terminate the entire workflow if it hits the spot request limit: ``` Exception in thread preemptable-scaler: Traceback (most recent call last): File "/usr/lib/python2.7/threading.py", line 810,...

bug
ready
roadmap

Dumping my list of bad error/logging messages. Some are more egregious than others. * [ ] Jobs "end successfully" even when they fail: ``` Job ended successfully: 'CactusBarWrapperWithPrecomputedEndAlignments' 3c056a1e-db38-4707-9fd2-c2b6885ea12e Job...

in progress
roadmap
intern

This has been a problem for a while, but I'm just putting an issue up so I remember to fix this somehow. When parasol has more than a million or...

I just had several jobTrees fail (presumably due to a filesystem problem) with this error: ``` Batch system is reporting that the job (1, 298079848) /hive/users/jcarmstr/cactusStuff/phylogenyTests/glires/work-noRescue2/jobTree/jobs/t2/t2/t2/t3/t1/t0/t1/t0/job failed with exit value...

Recently I've noticed instances staying completely idle despite having plenty of jobs that could be scheduled on them. For example: ![Grafana screenshot showing idle instances](https://user-images.githubusercontent.com/4723163/50744542-f60a7780-11d8-11e9-8266-3432a66fad6f.png) This tends to get worse...

aws
mesos
roadmap

Related to #1699. About once a day I get an error like this: ``` Traceback (most recent call last): File "/usr/local/lib/python2.7/dist-packages/toil/provisioners/clusterScaler.py", line 296, in check scalerThread.join(timeout=0) File "/usr/local/lib/python2.7/dist-packages/bd2k/util/threading.py", line 51,...

mesos
roadmap