bio-datasets icon indicating copy to clipboard operation
bio-datasets copied to clipboard

Free collection of Bio datasets and embeddings

Results 8 bio-datasets issues
Sort by recently updated
recently updated
newest added

The main idea (to be confirmed though) is to have for the user the following process: - The user adds raw data files such as (csv + npy for embeddings...

enhancement

In order to be able to load our data with `to_npy_array` in memory

enhancement

- The test coverage can be computed thanks to pytest job. - It is always nice for a user to know what is the test coverage of the library used.

documentation

An issue template is a good way to define the structure of the issue based on the type: bug, feature request, documentation, ...

documentation

I've successfully uploaded a dataset (subset of PDB) but it has unusual labels in that they are matrices. Storing matrices/ndarrays/sparse arrays as a column in a `.csv` is not ideal....

enhancement

We should clarify the structure of the `description.md` file for a dataset. Given the structure, we would have different functions (i.e. `display_description()`, `display_summary`, etc..) that would display different parts of...

enhancement

Configuration file to define the `dataset` and `embeddings` files as well the inputs/targets variable names (add them as attributes). - Also add an attribute when there is only one input...