Alex

Results 80 issues of Alex

It might be nice to link this list from https://github.com/keon/awesome-nlp#nlp-in-spanish

Thank you for a very useful app! Especially [now](https://apple.stackexchange.com/questions/380186/macbook-pro-16-runs-about-10-degrees-hotter-when-plugged-in-on-idle-is-this-no). It would be really nice to have binaries built and attached to GitHub releases on git tags. That would simplify [package...

enhancement

This is an umbrella issue for initial work on automating performance analysis/regression suite for bblfshd, to build a baseline benchmark. Motivation (things reported to be slow): - https://github.com/bblfsh/java-driver/issues/96 - https://github.com/src-d/empathy-sessions/issues/15...

Right now in bblfshd logs after driver installation we have ``` time="2019-03-20T08:34:57Z" level=info msg="driver cpp installed "bblfsh/cpp-driver:latest"" ``` which is not very useful for debugging as we do not know...

enhancement
help wanted

Right now, after #216 both bbfshd and driver gRCP servers have a hard-coded limit of 100MiB for the UAST. This works, but is not a generic solution - UAST size...

Same way as we have Java bindings for enry, wrapping a Go library built with `-buildmode=c-shared` it would be nice to have one for Python using ~[ctypes FFI](https://docs.python.org/3/library/ctypes.html)~ [cffi](https://cffi.readthedocs.io/en/latest/) or...

enhancement

Enry right now consist of the sequence matching of strategies that narrow down the possible language options based on different available information: - finelame + extension - first line of...

JDK version that we are using for `enry-java` release is oracle JDK 8 which EOLed 1 year ago (and e.g is not available on newer linux 16.04 CI images any...

enhancement

Steps to reproduce: ``` mkdir -p /tmp/linguist-django cd /tmp/linguist-django git clone --depth 1 https://github.com/django/django.git cd - ./enry /tmp/linguist-django/django/ 95.87% Python 1.85% JavaScript 1.65% HTML 0.63% CSS 0.01% Shell 0.00% Smarty...

bug

Part of the #155 Right now enry uses content tokenization approach [based on regexps](https://github.com/github/linguist/pull/3846/files#diff-d5179df0b71620e3fac4535cd1368d15L60) from linguist before [v5.3.2](https://github.com/github/linguist/releases/tag/v5.3.2). This issues is about enry supporting/producing same results as a new, flex-based...

enhancement