tutorials.quanteda.io
tutorials.quanteda.io copied to clipboard
encoding is not speficied properly
I noted that ISO-8859-1(Latin 1) is set as character encoding for all the European languages: https://tutorials.quanteda.io/import-data/multiple-files/
It is inappropriate because many of the European languages are not in Latin 1 (and text is actually corrupt in GR and LV). The most appropriate way is specifying ISO-8859-1, SO-8859-2, SO-8859-3 etc. but it requires file names to identify language...