Greg Caporaso
Greg Caporaso
I'm not really sure what's supposed to go in this field. Should this be something like "Study contact" or "Data depositor"?
since all of the input handlers don't split the input (e.g., alpha diversity)
this is currently implemented as _precommand_initiation in ParallelPickOtusBlast, ParallelAssignTaxonomyBlast, and probably ParallelBlast. Multiple inheritance might be in order here, including a new, very lightweight class (or maybe ParallelPickOtusBlast and ParallelAssignTaxonomyBlast...
currently user has to override this function (as in ParallelPickOtus), but we could come up with a clever way to get around that
this would ensure (1) that abundances are not effected by differing quality across reads of the same sequence, and (2) that prefixes cluster to the same OTU as the sequence...
Directories like rarified OTU tables, jackknifed distance matrices, etc (i.e., large directories that get created for record keeping but not frequently used) should be gzipped as part of the workflow...
When we move to python 2.7 we should take advantage of conditional testing to put this in place.