Stefan Krawczyk
Stefan Krawczyk
the link about the spark 3.2 integration goes no where -- http://apache-spark-developers-list.1001551.n3.nabble.com/VOTE-SPIP-Support-pandas-API-layer-on-PySpark-td30996.html
**Is your feature request related to a problem? Please describe.** It is conceivable to want to write Hamilton functions in an async based way. For example, the data loading functions...
This should fix #170. If an input function was annotated with a union, we would barf. Now it wont. This still means downstream nodes need to have the same type...
Logging an issue so people can depend/reference it more easily. Prototype PR is up from a while ago -- #119 that needs to be redone.
# Current behavior Creating a DAG with 100K+ nodes is slow. It took 1117.1986498832703 seconds for it to complete for the first example below. Execution time was then 450 seconds...
**Is your feature request related to a problem? Please describe.** Can we help users bootstrap the `check_output()` decorator? **Describe the solution you'd like** Setting up data quality is possible to...
**Is your feature request related to a problem? Please describe.** Data profiling is a way to help bootstrap creating data quality checks. Data profiling is also a way to facilitate...
**Is your feature request related to a problem? Please describe.** We should be able to configure whether we want data quality to run or not at DAG build/run time. **Describe...
These instructions could be more thorough, but I think they suffice. You need to read some anaconda docs if you really want to know what should happen. ## Changes -...
**Is your feature request related to a problem? Please describe.** We should be able to bootstrap a unit test suite given a hamilton function module. Most DS probably write functions...