bsdconv
bsdconv copied to clipboard
A simple but powerful DSL for charset/encoding conversion and transformation, pure C implemetation with no extra dependencies
is it possible to build bsdconv as a single binary? (everything is inside the .dll/.so file so no need to specify path.)
I think this project would become more widely used if there was a additional library that provides iconv compatible api that you could link against to. I may do this...
Please provide better documentation. Explain the rule syntax and each codecs
bsdconv's GB2312 table which comes from unicode.org and went missing after EASTASIA charts became obsolete is, to some extent, similar to Unicode's Big5 table in quality. (I will use unicode.org's...
GB2312, which is just a table without binary numbers, have various interchangable encodings like EUC-CN (the common "gb2312" encoding which looks like GBK) and HZ (which uses escapes). Bsdconv's current...
Current GB18030's DBCS subset (aka "GBK") data: * should be updated with a U+E7C7/U+1E3F swap in GB18030-2005, * may use 24 [Unicode 4.1](https://en.wikipedia.org/wiki/GB_18030#PUA) mappings instead of the PUA code points...