Janis Lesinskis

Results 32 issues of Janis Lesinskis

https://github.com/CoEDL/elpis/blob/913c567bfaafc2081bc96fbd80036b07d4762bf0/elpis/endpoints/__init__.py#L26-L33 From the point of view of making tooling that interacts with this project it would be easier to deal with a missing log file if the following code had...

backend
API

https://github.com/CoEDL/elpis/blob/913c567bfaafc2081bc96fbd80036b07d4762bf0/elpis/endpoints/__init__.py#L28-L30 There's some subtle bugs that can occur here if the file encoding is not specified. I'd suggest making this explicit.

There is currently some code that returns an empty list and has a TODO, it is likely best to raise an exception here so that incomplete code cannot slip past...

Improve the default for the Kaldi path such that it does not reference an explicit users home directory. Update documentation. Closes #237 Closes #239

bug
documentation

Currently there's a mismatch between how the settings.ini file is loading the Kaldi root path and how the documentation explains the same. The documentation expects to read in the key...

bug
documentation

It might be good to provide some links to papers that have used Persephone on the https://persephone.readthedocs.io/en/stable/ site

documentation

Create a model for RNN CTC using Keras. Note: Use of Keras save functionality will greatly help in saving and restoring models.

low-priority

Would be good to have a small integration test for constructing a corpus from an Elan file for coverage reasons, see #177

testing

`Corpus.from_elan` has a parameter that will allow you to specify a label segmenter which is of type `LabelSegmenter` Usages: ```python utterances = [label_segmenter.segment_labels(utter) for utter in utterances] ``` and ```python...

bug
help wanted
testing

Right now Unicode space characters such as 'NO-BREAK SPACE' (U+00A0) don't get split on. I propose we decide on what behavior we expect here and resolve this in PR #213....

bug
question