Ogundepo Odunayo issues

Results 15 issues of


                                            Ogundepo Odunayo

Tokenizer VIsualizer

I tried using the tokenizer visualizer but it doesn't seem to work when I load the tokenizer using `AutoTokenizer.from_pretrained()`. Here's the error I'm getting below: ``` --------------------------------------------------------------------------- AttributeError Traceback (most...

Stale

Adding Support for Yoru

Added support for Yoruba Language Language Code = 'yo ![11507 Danpascu jẹ plánẹ tì kékeré ní ibi ìgbàjá ástẹ rọ ìdì_0](https://user-images.githubusercontent.com/38908008/136361842-28c456ca-93c0-40bb-9716-8895fb3ba337.jpg) ![Àyọkà yìí tàbí apá rẹ únfẹ àtúnṣe sí_1](https://user-images.githubusercontent.com/38908008/136361844-f46f01ce-adfe-4678-9595-b04dcd24150e.jpg) '

AttributeError: 'GCTrainer' object has no attribute 'scaler'

Hi @luyug, any idea on how to fix this? 04/14/2022 15:48:04 - INFO - tevatron.trainer - Initializing Gradient Cache Trainer Traceback (most recent call last): File "/home/odunayo/anaconda3/envs/tevatron_env/lib/python3.9/runpy.py", line 197, in...

Fix Dependency Issues

- Updated requirements to use a more recent version of pygaggle and pyserini. - The existing version of pyserini in the code cannot load Lucene indexes from the current Anserini...

MrTydi Updated regressions (MIRACL)

Extend Trec to DPR Run Converter to use Custom Topics ?

The file [convert_trec_run_to_dpr_retrieval_run.py](https://github.com/castorini/pyserini/blob/master/pyserini/eval/convert_trec_run_to_dpr_retrieval_run.py) only allows the conversion of topics currently checked into anserini. I guess we can open this up to use custom query files also? https://github.com/castorini/pyserini/blob/2673031f6b202941fe0f9953c9b876e6d4f1e653/pyserini/eval/convert_trec_run_to_dpr_retrieval_run.py#L26-L37 I can see...

Ogundepo Odunayo

Tokenizer VIsualizer

Adding Support for Yoru

AttributeError: 'GCTrainer' object has no attribute 'scaler'

Fix Dependency Issues

MrTydi Updated regressions (MIRACL)

Extend Trec to DPR Run Converter to use Custom Topics ?

Refactor Dependencies

Add Colbert MLX

Segment Open Text like Wikipedia into Passages

Efficient Memory Management