camel_tools icon indicating copy to clipboard operation
camel_tools copied to clipboard

[Documentation] Update Morphology Features page

Open Qfl3x opened this issue 4 years ago • 1 comments

Describe the bug https://camel-tools.readthedocs.io/en/latest/reference/camel_morphology_features.html appears to be outdated.

To Reproduce Analyze "وأما". Result for rat is a u. However that's not present in the docs, this isn't the only one I fell on. Especially for rot field: it also includes r and i as well as y and n which seemed awkward.

Expected behavior "u" present in docs.

Desktop (please complete the following information):

  • OS Linux
  • Python version 3.7.4 Anaconda
  • CAMeL Tools version: Installed through pip. 1.2.0
  • Documentation version: latest

Qfl3x avatar Nov 28 '21 14:11 Qfl3x

A related note on the output of the CALIMA Star Analyzer: It's feature-value pair format output for "وأما" contains analyses like:

diac: وَأُمّاً | caphi: w_a_2_u_m_m_a_n | lex: أُمّ | bw: وَ/CONJ+أُمّ/NOUN+اً/CASE_INDEF_ACC | gloss: and+mother;maternal+[acc.indef.] | pos: noun | prc3: 0 | prc2: wa_conj | prc1: 0 | prc0: 0 | enc0: 0 | asp: na | vox: na | mod: na | gen: f | num: s | stt: i | cas: a | per: na | rat: r | source: lex | stem: أُمّ | stemcat: N | d3tok: وَ+_ أُمّاً | d3seg: وَ+_ أُمّاً | d2tok: وَ+_ أُمّاً | d2seg: وَ+_ أُمّاً | d1tok: وَ+_ أُمّاً | d1seg: وَ+_ أُمّاً | atbtok: وَ+_ أُمّاً | atbseg: وَ+_ أُمّاً

In the "Simplified Analyses" mode this is given as:

‎وَأُمّاً w_a_2_u_m_m_a_n وَ+/conj [Feminine Singular Indefinite Accusative Rational]

So apparently the rat: r means "rational", but as you write it is not documented in camel_tools/docs/source/reference/camel_morphology_features.rst.

jjmhtp avatar Jun 11 '25 18:06 jjmhtp