MetPy
MetPy copied to clipboard
Simplifying workflow to get from ESS Xarray datasets with a time dimension to a Pandas DataFrame for machine learning
What should we add?
Many of our datasets in earth systems science have dimensions in x, y, z, and t. While this makes for easy multi-dimensional analysis, it does require a bit of work to get into the scikit-learn ecosystem for time series machine learning projects.
The workflow below might be out of scope for the MetPy project, but was curious if using some of the internal MetPy tools if this could be simplified and made more generalizable for all ESS xarray datasets with a time dimension.
Reference
Here is a short notebook with how the workflow could look using the xarray tutorial dataset.