GenerativeModels icon indicating copy to clipboard operation
GenerativeModels copied to clipboard

Available Datasets

Open ericspod opened this issue 3 years ago • 6 comments

Let's list the available datasets that we have access to.

  • VerSe Dataset https://github.com/anjany/verse (labels are questionable)
  • ADNI
  • Decathlon

ericspod avatar Nov 29 '22 14:11 ericspod

Besides these, I tried to apply to RadImageNet, but no reply yet. Could anybody else also try to apply for it?

Warvito avatar Dec 06 '22 10:12 Warvito

I think we need to make the difference between restricted and open-source datasets as I am not sure if generative model trained on restricted datasets (ADNI, UKB) can be freely shared for inference.

danieltudosiu avatar Dec 06 '22 13:12 danieltudosiu

Besides these, I tried to apply to RadImageNet, but no reply yet. Could anybody else also try to apply for it?

I applied months ago and heard nothing

marksgraham avatar Dec 13 '22 16:12 marksgraham

I have access to the following CT datasets in a curated version from Nvidia:

Furthermore, there are lots of datasets under the TCIA

danieltudosiu avatar Dec 13 '22 16:12 danieltudosiu

I would like to organise datasets that we have access to into three categories: those that can be accessed without any sort of permission or any other requests, those that do require requests but will give data out to anyone, those that are much more stringent in their requirements such as having clear scientific reasons or having a grant. I'm interested in the first because these, such as decathlon, we can use for public tutorials and information, the second category is useful for actual research as well. The third includes Biobank and others which will be the best ones to use but demonstrating anything with them isn't helpful to people without access.

ericspod avatar Dec 21 '22 01:12 ericspod

I have a few more that I know of, perhaps we should have a wiki page we can all add to as we find things. I know of many more sources for general machine learning datasets, such as tabular and time series data, but that's beyond imaging and now our focus quite yet.

Cardiac:

Cancer

Imaging

Alzheimer’s

ericspod avatar Dec 21 '22 01:12 ericspod