Segmentation issue keeps crashing the kernel
issue.csv The AutoMLSearch keeps crashing with the simple dataset. Running the code in terminal gives segmentation error. If run in Jupyter Notebook, the notebook crashes and gets restarted.
import pandas as pd
import evalml
fm = pd.read_csv('issue.csv')
fm.ww.init()
fm.ww.describe()
y = fm.ww.pop('label')
automl = evalml.AutoMLSearch(
X_train=fm,
y_train=y,
problem_type='binary',
random_seed=3,
max_batches=5
)
automl.search()
The data doesn't have infinity or null values. In principle, it shouldn't crash the kernel even though it won't get an amazing model.
The search went through once the search scope is limited to random forest and linear_model:
allowed_model_families=["random_forest", "linear_model"].
Further investigation finds the problem is with lightgbm. As long as lightgbm isn't here, the search would be fine.
Thanks for reporting and investigating @enfeizhan. Could you share what evalml and lightgbm versions you're running with, as well as a bit more information about your data (types, size, etc)?