GWA_tutorial icon indicating copy to clipboard operation
GWA_tutorial copied to clipboard

ftp vs https in downloading 1000 genome vcf file in Step 2_Population_stratification

Open SarojaSomu opened this issue 4 years ago • 2 comments

Hi,

I noticed using the command wget ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/release/20100804/ALL.2of4intersection.20100804.genotypes.vcf.gz from the 2_Main_script_MDS.txt downloads a corrupted file. Hence creating plink files from the vcf resulted in file read failure error. But when ftp is replaced with https like this, wget https://ftp-trace.ncbi.nih.gov/1000genomes/ftp/release/20100804/ALL.2of4intersection.20100804.genotypes.vcf.gz, there is no file corruption issue and the download is faster as well.

Thanks

SarojaSomu avatar Apr 13 '22 20:04 SarojaSomu

I was using that old ftp link as well and running into issues. Thank you for this insight!

mpage21 avatar Sep 09 '22 20:09 mpage21