tutorials.quanteda.io icon indicating copy to clipboard operation
tutorials.quanteda.io copied to clipboard

encoding is not speficied properly

Open koheiw opened this issue 6 years ago • 0 comments

I noted that ISO-8859-1(Latin 1) is set as character encoding for all the European languages: https://tutorials.quanteda.io/import-data/multiple-files/

It is inappropriate because many of the European languages are not in Latin 1 (and text is actually corrupt in GR and LV). The most appropriate way is specifying ISO-8859-1, SO-8859-2, SO-8859-3 etc. but it requires file names to identify language...

koheiw avatar Jun 07 '19 02:06 koheiw