DavidFarago

Results 21 issues of DavidFarago

Is there (or will there be) an option to validate a very large JSON file (up to 5GB) in chunks, e.g. via streaming, so that the whole JSON file never...

The output of pytest shows, e.g.: > pytest-parallel: 4 workers (processes), 5 tests per worker (threads) > test/test_event_driven.py::test_foo FAILED > test/test_event_driven.py::test_foo FAILED > test/test_event_driven.py::test_bar PASSED > test/test_event_driven.py::test_bar PASSED > ......

I use `pytest --slack_hook=https://hooks.slack.com/services/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 --slack_username=IGNORED --slack_channel='Dave Far'`. The [result](https://i.ibb.co/r5kz4Qg/Screenshot-from-2020-03-24-10-35-55.png) shows "Pytest", not "IGNORED". I tried many strings, also in quotes, but always get "Pytest".

I have problems resuming a checkpoint. What I did: 1) `python qlora.py --model_name_or_path huggyllama/llama-7b` 2) abort when a checkpoint has been written 3) `python qlora.py --model_name_or_path huggyllama/llama-7b` I expected fine-tuning...

### 🐛 Bug My experiment fails pretty early, with the following stacktrace upon BLEU evaluation: ``` 2023-05-18 17:58:52,287 - INFO: Validation BLEU: 0.32177 2023-05-18 17:58:52,333 - ERROR: Exception occurred during...

type/bug

**Describe the bug** Having trained two (mistral) models (with `model_type` either `llm` or `ecd`), I wanted to create a `compare_performance` visualization via `ludwig visualize --visualization compare_performance --test_statistics dir1/training_statistics.json dir2/training_statistics.json`, but...

Calling `python merge_peft_adapter.py --model_name ./reward_model_vicuna-7b` yields ``` ===================================BUG REPORT=================================== Welcome to bitsandbytes. For bug reports, please run python -m bitsandbytes and submit this information together with your error trace to:...

Since https://github.com/lm-sys/FastChat/ does not publish its data, but mentions it "enhanced the training scripts provided by Alpaca to better handle multi-round conversations and long sequences", I looked at ShareGPT Vicuna...

I would like to use data-measurements-tool for my NER dataset. However, the label field `ner_tags` consists of a list of tags. Consequently, `python3 run_data_measurements.py` crashes with a TypeError` (see details...

Since the `multi-v0` models are no longer available at [hu-berlin], switch to the `de` model that is also used in production. With this change, the tests pass again. Fix https://github.com/pd3f/dehyphen/issues/4