hyperglot icon indicating copy to clipboard operation
hyperglot copied to clipboard

Issues with s-cedilla in several Turkic and related languages

Open MrBrezina opened this issue 4 years ago • 2 comments

Instead of S-cedilla (U+15E) and s-cedilla (U+015F), S-comma(U+0218) and s-comma(U+0219) are listed as required letters. It has to be S-cedilla and s-cedilla.

Expanding on the issue mentioned in #71 and #72

MrBrezina avatar Feb 17 '22 15:02 MrBrezina

👍 Both other sources (Omniglot and Wikipedia) we reference list the cedilla versions, too.

kontur avatar Feb 17 '22 15:02 kontur

I originally thought this was correct, but am not so sure now and backed out my related changes in #72 for now.

The remaining Turkic languages that list both S-cedilla and S-commaaccent variants probably stem from the fact that they are Romanizations and there is more than one scheme. Most of these are Latin variants of a default Cyrilic based alphabet. Some schemes for Cyrilic → Latin do call for commaaccent glyphs even if the most prevalant use case for these will use the modern Turkish alphabed with its cedilla based glyph.

If the goal is to be canocical, research will be needed into each case. If the goal is to cover all possible Romanization schemes then having both might be correct.

Note this is also true for other letters in affected languages such as T-cedilla and C-cedilla.

alerque avatar Feb 17 '22 18:02 alerque