Artidoro Pagnoni comments

Results 50 comments of


                                            Artidoro Pagnoni

Can undo changes made by others

Undo is a central part of the development process which involves trying and reverting if needed. Having a global undo makes it very difficult for two people to work on...

Dataset

Hello @shashiongithub I am also having trouble downloading the dataset. After rerunning the script > 75 times I still have 11 articles that cannot be downloaded. I would like to...

Punctuation not consistent for M19 (Text Summarization with Pretrained Encoders)

Verifying with the original output: https://github.com/nlpyang/PreSumm, it seems like the model uses `` tokens for separation between sentences in the decoded outputs. So this seems to be problem in the...

Punctuation not consistent for M19 (Text Summarization with Pretrained Encoders)

I found a temporary solution: the double space seems to be indicating the missing period.

Support custom mapping without type parameters

@klausmh the main limitation however, is that the mapping cannot be stateful. It can only map rows to new rows with no external state for the mapping. For example, it...

RuntimeError: CUDA error: an illegal memory access was encountered

I also got this when using `decapoda-research/llama-7b-hf`. With another hf conversion (more recent I think) I did not get the problem. I recommend using newer conversions if possible. It looks...

Bug Fix: 443 Bytes `adapter_model.bin` files

Thank you @KKcorps! I also just replicated your fix and it seems to properly store the adapter checkpoints.

GenerationConfig argument for Seq2SeqTrainer / Seq2SeqTrainingArgument

Hello! It's a great idea to be able to pass GenerationConfig to the Seq2SeqTrainer. However, it would be great to have a matching `GenerationArguments` class that allows parsing. Right now...

Introduces load_from_disk datasets

Ideally huggingface does the parsing for us. We should stay away from deciding what is local and what is on the hub. Also isn't this handled by `load_dataset`? https://huggingface.co/docs/datasets/package_reference/loading_methods#datasets.load_dataset Please...

Strage error while launching the code

I haven't seen that error before, I would suggest using one GPU for debugging as this might be related to DDP. One A100 should easily fit the 7B model. https://discuss.pytorch.org/t/ddp-and-gradient-checkpointing/132244