MetPy icon indicating copy to clipboard operation
MetPy copied to clipboard

Simplifying workflow to get from ESS Xarray datasets with a time dimension to a Pandas DataFrame for machine learning

Open ThomasMGeo opened this issue 2 years ago • 0 comments

What should we add?

Many of our datasets in earth systems science have dimensions in x, y, z, and t. While this makes for easy multi-dimensional analysis, it does require a bit of work to get into the scikit-learn ecosystem for time series machine learning projects.

The workflow below might be out of scope for the MetPy project, but was curious if using some of the internal MetPy tools if this could be simplified and made more generalizable for all ESS xarray datasets with a time dimension.

Reference

Here is a short notebook with how the workflow could look using the xarray tutorial dataset.

ThomasMGeo avatar Nov 10 '23 16:11 ThomasMGeo