QCFractal icon indicating copy to clipboard operation
QCFractal copied to clipboard

Client-side aggregation of datasets

Open trevorgokey opened this issue 6 years ago • 0 comments

Is your feature request related to a problem? Please describe. As the number of datasets increase, with many containing the same records, it would be convenient for a way to transparently deduplicate records across multiple datasets.

Describe the solution you'd like Essentially, something that will aggregate datasets for the client. Something along the lines of aggregate_ds = ds1 + ds2 + ds3 or aggregate_ds = client.get_collection(["ds1","ds2","ds3"], "torsiondrivedataset")

Describe alternatives you've considered Right now, a pairwise comparison of record IDs for equality can solve this, but it would be great to make this more transparent.

trevorgokey avatar Apr 30 '20 23:04 trevorgokey