Xingjian Shi

Results 45 issues of Xingjian Shi

*Issue #, if available:* *Description of changes:* By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Test-time augmentations are very useful for improving the vision model performance. We can offer test-time augmentation to AutoMM via another `tta` flag in the `.predict()` API: ```python predictor = AutoMMPredictor(...)...

enhancement
module: multimodal

## Description Currently, `nlp_data` is using wikiextractor with version 0.1 but the latest version is https://pypi.org/project/wikiextractor/3.0.4/. It will be good to try to upgrade to the latest version. @DOUDOU0314 If...

enhancement

## Description Add the CC-100 corpus that can be used for pretraining to `nlp_data`. http://data.statmt.org/cc-100/

enhancement
good first issue

## Description OSCAR Corpus: https://oscar-corpus.com/ ``` @inproceedings{ortiz-suarez-etal-2020-monolingual, title = "A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages", author = "Ortiz Su{\'a}rez, Pedro Javier and Romary, Laurent and Sagot,...

enhancement
good first issue

## Description ## Implements https://github.com/dmlc/gluon-nlp/issues/1486 ## Checklist ## ### Essentials ### - [ ] PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc) - [ ]...

## Description https://github.com/dmlc/gluon-nlp/pull/1374 has been merged so we have fixed the warnings in our documents. However, the current structure of the website is not very satisfactory and we should try...

enhancement
help wanted

## Description @leezu @szha @xinyual I noticed that we may need to set `strip_accents` to None in https://github.com/dmlc/gluon-nlp/blob/223f1f6f8e267d258abd2f299ec6fc4a9b2f1cf8/src/gluonnlp/data/tokenizers/huggingface.py#L564 so that it will be turned on when `lowercase` is True. This...

bug

## Description We should improve the README here: https://github.com/dmlc/gluon-nlp/tree/master/scripts/conversion_toolkits

enhancement
documentation

## Description We fixed TVM integration in GluonNLP recently for fp32 dtype. However, we still do not support fp16 dtype. We should - [ ] Revise the test to add...

enhancement
help wanted