mdf
mdf copied to clipboard
Data-flow programming toolkit for Python
Just raising an issue to ask if this is still in use and still makes sense to use, sorry to bother
In python 3.5, I get an error running any graph, from line 304 in mdf.runner: multiprocessing.util.active_children() I'm guessing that either this has been moved in a newer version of python...
``` ctx = MDFContext() ctx[None] = 55 ``` results in runtime terminating; it would be better to throw: easier to diagnose the underlying issue (node used in indexer being None)
A quick and simple intro on what MDF is, what problems it solves and some simple examples. Also, a short explanation for the name would be useful.
pandas.DateRange is deprecated. If run() is passed a date_range parameter constructed from pandas.DateRange it will actually be of time DatetimeIndex. Consequently run() fails to extract tzinfo from date_range since it...
Add more support for vectorized operation, perhaps auto-vectorization and integration with Parakeet/Numba for dramatic speed improvements.
Tests involving serializing nodes fail on the newer pyro versions: ``` Traceback (most recent call last): File "/home/travis/build/SleepingPills/mdf/mdf/tests/test_regression.py", line 37, in test_df_differ diffs = mdf.regression.run(date_range, [df_differ], lhs=None, rhs=None) File "/home/travis/build/SleepingPills/mdf/mdf/regression/__init__.py",...