whichlang icon indicating copy to clipboard operation
whichlang copied to clipboard

Access to Training Dataset

Open omarmhaimdat opened this issue 2 years ago • 1 comments

Thank you for your awesome work!

I was wondering if you can link to the dataset you have used in the training notebook, I am referring to train.csv file.

omarmhaimdat avatar May 24 '23 11:05 omarmhaimdat

In the python notebook train.ipynb, the logistic regression is trained with the 2 CSV files train.csv & dataset/archive/sentences.prepared.csv.

There is no description of the format of these CSV files.

Is train.csv created via https://github.com/quickwit-oss/whichlang/blob/main/src/bin/create_train_dataset.rs

LeMoussel avatar May 26 '23 08:05 LeMoussel