the parental specific kmers were intersected in progeny genome
Hi, I took use of meryl to identify individual-specific kmers with difference subcommand as below:
meryl difference paternal.meryl/ maternal.meryl/ output paternal-specific.meryl
meryl difference maternal.meryl/ paternal.meryl/ output maternal-specific.meryl
following the differnece subcommand description, I put the paternal kmers first and maternal kmers second to identify paternal-specific kmers, then maternal kmers first and paternal kmers second to identify maternal-specific kmers.
Then meryl-lookup was used to identify paternal-specific or maternal-specific kmers in progeny as below:
meryl-lookup -sequence progeny.genome.fa -mers maternal-specific.meryl -bed-runs > progeny-maternal.bed
meryl-lookup -sequence progeny.genome.fa -mers paternal-specific.meryl -bed-runs > progeny-paternal.bed
I compared the two bed files with bedtools intersect -a progeny-maternal.bed -b progeny-paternal.bed -wa -wb | wc -l, there are over 50,000 overlapped intervals.
My confusion is that the paternal-specific or maternal-specific kmers are individual-specific, why they are overlapped in progeny genome?
Much thanks if you can give me suggestions.
Hi, I checked the length of kmer intervals in progeny genome. The paternal-specific kmers occupied 70Mb. The maternal-specific kmers occupied 218Mb. The overlapped intervals of paternal-specific kmers and maternal-specific kmers are 1.3Mb. The length of progeny genome is 356Mb.
Maybe I get the reason.