cort
cort copied to clipboard
A toolkit for coreference resolution and error analysis.
I noticed some discrepancies between the 2 different traversal orders. `reversed()` on line 108 returns an iterator and if it gets exhausted in the first iteration of the loop on...
Because `CoNLLDocument.document_table` is a (mutable) list of lists, [line 531](https://github.com/smartschat/cort/blob/c637594b57ab9f556a236bd94fb2b6c75ac4e550/cort/core/documents.py#L531) also changes the original table, because it is not copied in the following line: https://github.com/smartschat/cort/blob/c637594b57ab9f556a236bd94fb2b6c75ac4e550/cort/core/documents.py#L528
cort currently needs a lot of RAM, predicting with the latent ranking model on the CoNLL-2012 development data takes ~8GB, mainly due to multiprocessing during feature extraction.
Is it possible to retrain models (for example, the one's from https://github.com/smartschat/cort/blob/master/COREFERENCE.md#model-downloads) with new data? I tried training using- ``` cort-train -in new_retraining_data.conll \ -out pretrained_model.obj \ -extractor cort.coreference.approaches.mention_ranking.extract_substructures \...
Hello, I encountered a few problems while trying to train a model with the gold standard version of the conll-2012 training set (*_gold_conll). The first issue occurs during the conversion...
I find this behaviour counter-intuitive: if you read a corpus (using `Corpus.from_file`) and write it out right away (using `write_to_file`), all set IDs are lost, i.e. the last column contains...
I was trying to run cort-predict-raw with following command: > python3.5 /usr/local/bin/cort-predict-raw -in ~/data/pilot_44_docs/*.txt > -model models/model-pair-train.obj > -extractor cort.coreference.approaches.mention_ranking.extract_substructures > -perceptron cort.coreference.approaches.mention_ranking.RankingPerceptron > -clusterer cort.coreference.clusterer.all_ante > -corenlp ~/systems/stanford/stanford-corenlp-full-2016-10-31 and...
I visualised coreference errors (errors_by_type.visualize()), but it is not possible to scroll the left part of the visualisation (the right part with the text works well). I am still using...