facets icon indicating copy to clipboard operation
facets copied to clipboard

mafR.clust is NA for all rows

Open andyjslee opened this issue 7 years ago • 3 comments

I am running FACETS on Whole Exome Sequencing (WES) data. I get NA for all rows in the mafR.clust column. I see that cval of 100 in the procSample function is reasonable for WES. What is the reason why I'm getting all NAs?

andyjslee avatar May 09 '18 08:05 andyjslee

Could happen if the number of heterozygous snps (nhet) is low for each segment or your sample is hyper-segmented? If the proportion of hets in the sample is not around 10% then you may be using the wrong genome build.

veseshan avatar May 09 '18 14:05 veseshan

Thank you for your response. Yes, you were right. I had used the wrong genome build. Could you please describe the difference between human_9606_b150_GRCh37p13 and human_9606_b151_GRCh37p13 (on ftp://ftp.ncbi.nlm.nih.gov/snp/organisms/)? Also, what genome build is human_9606?

andyjslee avatar May 10 '18 01:05 andyjslee

Human is the common name for the organism 9606 is the NCBI taxonomy id number b150 is the dbsnp build (i.e. build 150) GRCh37p13 is the reference genome

veseshan avatar May 10 '18 14:05 veseshan