Ken Tsui issues

Results 5 issues of


                                            Ken Tsui

Constructing Wikihow for QA with Metadata and Different Response Format

This issue is to explore the possibility and necessity of building QA dataset based on WikiHow ## Existing dataset built on WikiHow: Summarisation: https://arxiv.org/pdf/1810.09305.pdf Commonsense: https://arxiv.org/abs/1905.07830 Subset of QA: https://huggingface.co/datasets?search=wikihow...

data

Add DEBUG_USD_SEED_DATA_PATH in config to make seed data flexible

Closes #322 Factor out fixed seed data by adding DEBUG_USE_SEED_DATA_PATH in config to control seed data to use

backend

Update retrieval.md

Added more documents and papers n the retrieval direction.

Integrate with LLM evaluation frameworks

Integrate MDEL with various evaluation framework - [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) - [helm](https://github.com/stanford-crfm/helm)

Evaluation

Automatic Training Scripts for All Expert Models

If most training script is homogenous except the data_path args/ config (I assume it is as they started from the same seed LM), then we could do a script that...

Trainer