whichlang
whichlang copied to clipboard
Access to Training Dataset
Thank you for your awesome work!
I was wondering if you can link to the dataset you have used in the training notebook, I am referring to train.csv file.
In the python notebook train.ipynb, the logistic regression is trained with the 2 CSV files train.csv & dataset/archive/sentences.prepared.csv.
There is no description of the format of these CSV files.
Is train.csv created via https://github.com/quickwit-oss/whichlang/blob/main/src/bin/create_train_dataset.rs