bio-datasets issues

Update the dataset workflow with new structure/format

The main idea (to be confirmed though) is to have for the user the following process: - The user adds raw data files such as (csv + npy for embeddings...

theomeb

enhancement

Use vaex to load data

In order to be able to load our data with `to_npy_array` in memory

theomeb

enhancement

Compute the test coverage + add corresponding badge in the README

- The test coverage can be computed thanks to pytest job. - It is always nice for a user to know what is the test coverage of the library used.

martinp7

documentation

Add issue templates

An issue template is a good way to define the structure of the issue based on the type: bug, feature request, documentation, ...

martinp7

documentation

How to store unusual labels / Y values.

2

I've successfully uploaded a dataset (subset of PDB) but it has unusual labels in that they are matrices. Storing matrices/ndarrays/sparse arrays as a column in a `.csv` is not ideal....

sgrimbly

enhancement

Parse description.md and have different fonctions to display dataset descriptions

We should clarify the structure of the `description.md` file for a dataset. Given the structure, we would have different functions (i.e. `display_description()`, `display_summary`, etc..) that would display different parts of...

theomeb

enhancement

Add configuration file for a dataset

Configuration file to define the `dataset` and `embeddings` files as well the inputs/targets variable names (add them as attributes). - Also add an attribute when there is only one input...

theomeb

Force the download when dataset files shave changed remotely

theomeb

bio-datasets
bio-datasets copied to clipboard

Metadata

Update the dataset workflow with new structure/format

Use vaex to load data

Compute the test coverage + add corresponding badge in the README

Add issue templates

How to store unusual labels / Y values.

Parse description.md and have different fonctions to display dataset descriptions

Add configuration file for a dataset

Force the download when dataset files shave changed remotely

← Metadata

Owner

Metadata

bio-datasets bio-datasets copied to clipboard

Metadata

← Metadata

Owner

Metadata

bio-datasets
bio-datasets copied to clipboard