pyshp icon indicating copy to clipboard operation
pyshp copied to clipboard

Support for getting encoding from .cpg files

Open karimbahgat opened this issue 3 years ago • 2 comments

Describe the feature request

Non-standard but commonly accepted cpg file that specifies which text encoding to use. Would make for more robust handling of files and special cases. Main problem is figuring out which strings are used to represent encodings, appears to be non-standard.

May want to support both read and write support for this.

Contributions

  • [ ] I am interested in implementing the described feature request and submit as a PR.

karimbahgat avatar Jul 26 '22 22:07 karimbahgat

I'm not familiar with arc, but from googling, .cpg files do seem to be nothing more than one line text files. However their very nature does invite the following questions:

Are the .cpg files themselves encoded in utf-8, or what?
Or do all the encoding strings to be supported only use ascii characters, so it shouldn't matter if it's not utf-16?

JamesParrott avatar Oct 18 '22 22:10 JamesParrott

Not sure at the moment, I suspect using ascii or utf8 would be a safe assumption and work in most cases. I haven't looked very hard though. Let me know if you find out, or want to submit a PR :)

karimbahgat avatar Jan 10 '23 23:01 karimbahgat