Xingjian Shi
Xingjian Shi
*Issue #, if available:* *Description of changes:* By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.
Test-time augmentations are very useful for improving the vision model performance. We can offer test-time augmentation to AutoMM via another `tta` flag in the `.predict()` API: ```python predictor = AutoMMPredictor(...)...
## Description Currently, `nlp_data` is using wikiextractor with version 0.1 but the latest version is https://pypi.org/project/wikiextractor/3.0.4/. It will be good to try to upgrade to the latest version. @DOUDOU0314 If...
## Description Add the CC-100 corpus that can be used for pretraining to `nlp_data`. http://data.statmt.org/cc-100/
## Description OSCAR Corpus: https://oscar-corpus.com/ ``` @inproceedings{ortiz-suarez-etal-2020-monolingual, title = "A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages", author = "Ortiz Su{\'a}rez, Pedro Javier and Romary, Laurent and Sagot,...
## Description ## Implements https://github.com/dmlc/gluon-nlp/issues/1486 ## Checklist ## ### Essentials ### - [ ] PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc) - [ ]...
## Description https://github.com/dmlc/gluon-nlp/pull/1374 has been merged so we have fixed the warnings in our documents. However, the current structure of the website is not very satisfactory and we should try...
## Description @leezu @szha @xinyual I noticed that we may need to set `strip_accents` to None in https://github.com/dmlc/gluon-nlp/blob/223f1f6f8e267d258abd2f299ec6fc4a9b2f1cf8/src/gluonnlp/data/tokenizers/huggingface.py#L564 so that it will be turned on when `lowercase` is True. This...
## Description We should improve the README here: https://github.com/dmlc/gluon-nlp/tree/master/scripts/conversion_toolkits
## Description We fixed TVM integration in GluonNLP recently for fp32 dtype. However, we still do not support fp16 dtype. We should - [ ] Revise the test to add...