datasketch
datasketch copied to clipboard
Synchronous Mongodb Storage
While async mongodb storage is nice for scalability, some environments (e.g. Apache Beam) don't support async io. Synchronous Mongodb storage would satisfy that particular use case as well as facilitate testing for smaller data sets.
Good point. Would love to get some help on this one.
In the near term, would be be possible to wrap the async functionality in a synchronous wrapper? How would one do this in python?
Perhaps the easiest thing to do is to implement a separate MongoDB storage layer. The code should be similar but simpler than the async MongoDB storage.