RepeatMasker icon indicating copy to clipboard operation
RepeatMasker copied to clipboard

Cannot Read File

Open br302005 opened this issue 5 years ago • 6 comments

I am trying to softmask a fungal genome and get the following error: image

When I use ls to look in my directory, it says that the fasta file (Polycephalomyces_Cleaned.fasta) is there. image

I am very new to programming and have no idea what might be wrong. Thanks in advance.

br302005 avatar Feb 17 '20 16:02 br302005

Can you show the output of ls -l /home/brittany/Aaron/RepeatMasker/Polycephalomyces_Cleaned.fasta?


It looks like you are using only Dfam. Dfam does not include any fungal-specific repeats, so the masking will likely be incomplete. You will get better masking if you install RepBase RepeatMasker Edition (requires a subscription), or if you use and/or create a curated library of repeats from that organism or a closely related one.

jebrosen avatar Feb 17 '20 17:02 jebrosen

Thank you for such a quick response! Here is the output after I did ls -l :

image

Also, thank you for the suggestion about the fungal-specific repeats. I'm not sure how to make the library of repeats but will look into it!

br302005 avatar Feb 18 '20 15:02 br302005

I will add that after I ran this code there now is a Fungal library: /RepeatMasker/Libraries/CONS-Dfam_3.1/Fungi that contains 9 fungal-specific repeats.

br302005 avatar Feb 18 '20 16:02 br302005

That is very unusual: the Polycephalomyces_Cleaned.fasta file is not readable or writable by anyone. Running the command chmod 0644 Polycephalomyces_Cleaned.fasta will reset it to a sane default, and after that RepeatMasker should be happy to run on it. However, the library issue is worse than I realized:

I will add that after I ran this code there now is a Fungal library: /RepeatMasker/Libraries/CONS-Dfam_3.1/Fungi that contains 9 fungal-specific repeats.

Unfortunately those 9 repeats are used to recognize clonal artifacts and are not actually fungi-specific. In fact, Dfam only includes repeats from Metazoa and descendants.

jebrosen avatar Feb 18 '20 17:02 jebrosen

That worked and the file can now run! How would you suggest fixing the library problem? Making a custom library of fungal repeats and running -lib? I see that RepBase has a library of fungal repeats, but the subscription is almost $1500. Do you have any other suggestions of where I could find a good database? Thank you for all your help!

br302005 avatar Feb 18 '20 17:02 br302005

I am wondering the same thing as above. I have used tantan to mask repeats because I am not sure of a better database for fungi. Any suggestions?

lukaskon avatar Jun 08 '23 20:06 lukaskon