MachineLearningNotebooks icon indicating copy to clipboard operation
MachineLearningNotebooks copied to clipboard

`training_data` in AutoMLConfig can not be a pandas data frame

Open JCZuurmond opened this issue 4 years ago • 6 comments

I see the following exception when pandas data frame is passed as training_data in the AutoMLConfig.

ConfigException: ConfigException: Message: Input of type '<class 'pandas.core.frame.DataFrame'>' is not supported. Supported types: [azureml.data.tabular_dataset.TabularDataset]Please refer to documentation for converting to Supported types: https://docs.microsoft.com/en-us/python/api/azureml-core/azureml.core.dataset.dataset?view=azure-ml-py InnerException: None ErrorResponse { "error": { "code": "UserError", "message": "Input of type '<class 'pandas.core.frame.DataFrame'>' is not supported. Supported types: [azureml.data.tabular_dataset.TabularDataset]Please refer to documentation for converting to Supported types: https://docs.microsoft.com/en-us/python/api/azureml-core/azureml.core.dataset.dataset?view=azure-ml-py", "details_uri": "https://aka.ms/AutoMLConfig", "target": "training_data", "inner_error": { "code": "BadArgument", "inner_error": { "code": "ArgumentInvalid", "inner_error": { "code": "InvalidInputDatatype" } } } } }

The documentation says that training_data can be a pandas data frame.

Versions:

  • Python 3.6
  • SDK version: 1.36.0

Document Details

Do not edit this section. It is required for docs.microsoft.com ➟ GitHub issue linking.

JCZuurmond avatar Jan 13 '22 13:01 JCZuurmond

Hello - I have been running into the exact same issue as @JCZuurmond. Is there any update on this?

slsu0424 avatar Apr 19 '22 23:04 slsu0424

Nope, no response on this PR yet. I don't remember what I did with this issue

JCZuurmond avatar Apr 20 '22 07:04 JCZuurmond

Any update ? running into same problem ..

majdoulinamakbal avatar Jul 10 '22 15:07 majdoulinamakbal

Any update ? running into same problem ..

kryukovaeks avatar Jul 20 '22 14:07 kryukovaeks

Hello, so here's how i solved the issue:

  1. Register your pandas dataframe as a tabular dataset using Dataset.Tabular.register_pandas_dataframe(your_dataframe,your_datastore,'a description')
  2. use the registered tabular dataset in your automl config

majdoulinamakbal avatar Jul 20 '22 15:07 majdoulinamakbal

Thank you!

Le 20 juil. 2022 à 17:09, Majdoulina Makbal @.***> a écrit :

 Hello, so here's how i solved the issue:

Register your pandas dataframe as a tabular dataset using Dataset.Tabular.register_pandas_dataframe(your_dataframe,your_datastore,'a description') use the registered tabular dataset in your automl config — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.

kryukovaeks avatar Jul 22 '22 17:07 kryukovaeks