MIDASpy icon indicating copy to clipboard operation
MIDASpy copied to clipboard

Python package for missing-data imputation with deep learning

Results 15 MIDASpy issues
Sort by recently updated
recently updated
newest added

Looking at the codebase I could not locate a function where the trained model could be used to impute new data after training the model. There seems to be a...

question

Running MIDAS using VAE leads to deprecation warning re. tf.compat.v1.distributions. E.g. ``` >>> tf.compat.v1.distributions.Normal() WARNING:tensorflow:From :1: Normal.__init__ (from tensorflow.python.ops.distributions.normal) is deprecated and will be removed after 2019-01-01. Instructions for updating:...

enhancement

MIDASpy is currently implemented using logic of TF1 and compatibility layers. As TF2 matures and more graph-based features become deprecated (see e.g. #21), we will need to plan for larger...

enhancement
help wanted

In very large datasets (~30,000 samples x 1,000,000 features) with complex relationships (e.g. cancer omics data), the runtime for MIDAS can take a very long time (days?), even on a...

I am trying to utilize two GPUs with MIDASpy. However, I get the following error during set-up: ``` from sklearn.preprocessing import MinMaxScaler import numpy as np import pandas as pd...

enhancement
help wanted

I'm essentially running the [demo code](https://github.com/MIDASverse/MIDASpy/blob/master/Examples/midas_demo.ipynb), but with my own input data (all numeric data), and the data frames generated by `imputer.generate_samples(m=10).output_list` still have the same missing values as in...

Are there any plans of supporting python 3.9? I would really appreciate it.

enhancement

I'm working with Dirichlet distributions and the compositional data simplex, and am really enjoying MIDASpy's flexibility when dealing with this data (related to K-L divergence in the decoder). However, there...

enhancement
priority

It's usual for imputation or data pre-processing packages in python to support the scikit-learn interface. The interface allows the library to be used in data pipelines and existing scikit-learn infrastructures....

enhancement
priority

Sometimes we know that a set of variables should add up to a given total. Measurements involving proportions, percentages, probabilities, concentrations are compositional data. These data occur often in household...

enhancement
help wanted