LongRAG icon indicating copy to clipboard operation
LongRAG copied to clipboard

Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".

Results 4 LongRAG issues
Sort by recently updated
recently updated
newest added

Hello, I downloaded the wiki raw dataset you previously mentioned and ran process_wiki_page.py with the following command: `python process_wiki_page.py --dir_path './bz_file' --output_path_dir './result' --corpus_title_path './psgs_w100.tsv'` The bz_file directory contains the...

Hi, I load nq/full-00000-of-00001.parquet and compute the answer recall based on answers, context = item["answer"], item["context"] is_retrieval = has_correct_answer(context, answers) I could only get 0.8532 answer recall, which is below...

I understand that LongRAG extracts articles from Wikipedia XML dump files and stores them in multiple files, each of which contains multiple documents in XML or JSON format. LongRAG splits...

Hi, I try to build the index of the wiki corpus using the script you provide in `scripts/run_retrieve_tevatron.sh`. However, I find the performance of retrieval evaluation is very bad. The...