using collection names with spaces in them breaks tcia_dataset.ipynb
Describe the bug In the "Create TciaDataset" step of the notebook if you enter a collection with spaces (e.g. "Lung Phantom") into the collection variable it produces an error. Here is the relevant part of the code:
Let's take the "QIN-PROSTATE-Repeatability" collection for example
collection, seg_type = "QIN-PROSTATE-Repeatability", "SEG"
Here is the error:
KeyError Traceback (most recent call last)
KeyError: 'Lung Phantom'
Environment (please complete the following information): I was working on Google Colab
could you please have a look @yiheng-wang-nv ?
Hi @kirbyju , it is not the "names with spaces" issue.
First of all, Lung Phantom seems not a collection name, I searched in: https://www.cancerimagingarchive.net/collections/ and found that the corresponding collection name is: Phantom FDA.
In addition, within the tutorial and the doc strings of the source code (monai.apps.tcia.TciaDataset), it is mentioned that so far, only SEG or RESTRUCT image types are supported, but Phantom FDA has the type "CT". Therefore, this collection may not be supported. Thanks!
Hi @yiheng-wang-nv , Phantom FDA is a collection but Lung Phantom is also one: Lung Phantom. It contains many segmentations (SEG) of a single CT scan. This API call will show you a full inventory of the scans in this collection: https://services.cancerimagingarchive.net/nbia-api/services/v1/getSeries?Collection=Lung%20Phantom.