meryl icon indicating copy to clipboard operation
meryl copied to clipboard

the parental specific kmers were intersected in progeny genome

Open leon945945 opened this issue 2 years ago • 1 comments

Hi, I took use of meryl to identify individual-specific kmers with difference subcommand as below: meryl difference paternal.meryl/ maternal.meryl/ output paternal-specific.meryl meryl difference maternal.meryl/ paternal.meryl/ output maternal-specific.meryl following the differnece subcommand description, I put the paternal kmers first and maternal kmers second to identify paternal-specific kmers, then maternal kmers first and paternal kmers second to identify maternal-specific kmers.

Then meryl-lookup was used to identify paternal-specific or maternal-specific kmers in progeny as below: meryl-lookup -sequence progeny.genome.fa -mers maternal-specific.meryl -bed-runs > progeny-maternal.bed meryl-lookup -sequence progeny.genome.fa -mers paternal-specific.meryl -bed-runs > progeny-paternal.bed I compared the two bed files with bedtools intersect -a progeny-maternal.bed -b progeny-paternal.bed -wa -wb | wc -l, there are over 50,000 overlapped intervals.

My confusion is that the paternal-specific or maternal-specific kmers are individual-specific, why they are overlapped in progeny genome?

Much thanks if you can give me suggestions.

leon945945 avatar Mar 20 '24 03:03 leon945945

Hi, I checked the length of kmer intervals in progeny genome. The paternal-specific kmers occupied 70Mb. The maternal-specific kmers occupied 218Mb. The overlapped intervals of paternal-specific kmers and maternal-specific kmers are 1.3Mb. The length of progeny genome is 356Mb.

leon945945 avatar Mar 20 '24 04:03 leon945945

Maybe I get the reason.

leon945945 avatar Jul 02 '24 01:07 leon945945