SALSA icon indicating copy to clipboard operation
SALSA copied to clipboard

Two HiC experiments: recommend combine data or run SALSA sequentially?

Open ckeeling opened this issue 4 years ago • 1 comments

Hello,

I have HiC (Chicago) and HiC data, and a gfa file for the draft assembly to scaffold. I am trying both ways, but conceptually, would there be any problem combining the reads from both HiC datasets into one for mapping for SALSA, so that SALSA is informed by the gfa file with the longer-range HiC data, and not just the Chicago HiC data if run sequentially?

Thanks, Chris

ckeeling avatar Apr 12 '21 17:04 ckeeling

I would expect sequential runs would work better. The issue with combining these libraries is they will have very different length distributions and I worry the selection of best edges might get skewed by this. If you finished the runs, feel free to post which turned out better.

skoren avatar Jul 15 '21 14:07 skoren

I found little difference between doing it sequential or in combination in my case. Thus, using the combination allows one to work downstream (e.g. JBAT) on just one assembly rather than step-wise.

ckeeling avatar Jun 15 '23 18:06 ckeeling