Zehan Li
Zehan Li
I'm trying to train a T5 model with `transformers` library, which requires the `sentencepiece` library to tokenize sentence. But when I installed it with `pip install sentencepiece`, I can't import...
Hi, I noticed that your `beir==0.2.3` depends on `faiss-cpu`. While in your NeurIPS'2021 paper, you benchmarked several dense retrieval models on GPU. Did you use `faiss-gpu` for that? Have you...
Hi, I was trying to run your `evaluate_bm25.py` baseline, but I got the following error. There may be some problem with `elasticsearch`. Could you please help me fix it? ```...
Hi, I tried your BBTv2 code but failed to get comparable results as reported in your paper. In my case, using the command ```Python python deepbbt.py --model_name "roberta-large" --task_name "snli"...
## 🐛 Bug When I walks through your [docs](https://fairseq.readthedocs.io/en/latest/tasks.html), I find that the following code raises `TypeError: 'EpochBatchIterator' object is not iterable` ``` # setup the task (e.g., load dictionaries)...
Hi, I'm a little confused about the version. Is this an intermediate checkpoint? How is it trained? What is its difference with respect to v1 and v2? Is training data...
Hi, I noticed that the squad dataset on [hf](https://huggingface.co/datasets/Tevatron/wikipedia-squad) has an empty answers field for all instances. Maybe there is a problem during data processing?
Hi, does this lib supports loading pre-trained tokenizer trained by other libs, like `sentencepiece` and `tiktoken`? Many models on hf hub store tokenizer in these formats
Hi, when I'm using a custom m_mmlu task, there is an error like this ``` Generating test split: 12909 examples [00:00, 512686.14 examples/s] Traceback (most recent call last): File "/output/lm-evaluation-harness/lm_eval/__main__.py",...
This is a 4-choice task, option_e is null for all but 3 samples. And it is never used in gold answer