Results 6 comments of Michal Lukasik

Thanks for the reply! I am rerunning with 100 max_leaf_size to see if it will pass, however I think it might hurt classification accuracy. I didn't see any out of...

Thanks for following up. I am trying to run the training with --max_leaf_size 100 and --threads 5, but it seems to be training forever...

When running with 5 threads and 5 trees, I got this error message: 9790000 docs encoded 9800000 docs encoded Traceback (most recent call last): File "/usr/local/bin/fxml.py", line 4, in __import__('pkg_resources').run_script('fastxml==2.0.0',...

My machine has actually quite a lot of memory: mlukasik@mlukasik:~/workspace/fastxml_py$ cat /proc/meminfo MemTotal: 65865896 kB Is it because we try to load all data at once?