MedKhem
MedKhem
First re-flexion, identify piece of text as sub/superscript based on position, fonts, etc.
An internal validation scheme should be probably added
When the morphological is processed, extracting the list of lemmas and pos should be possible
the existing "parse full dictionary" service doesn't allow the user to get the parsing results of specific models like form or sense
more labels could be used to encode a lexical entry other than: \, \, \, \ and \.
For each model, 2 commands should be available: one for raw text creation (to be annotated from scratch) and one with pre-annotated text (which is going to be refined in...
After adding and testing new models, their output should follow the same logic as previous models (case when the entry is cut on 2 pages)
Implement components for parsing and segmenting etymological information in etymology section detected in a lexical entry