Stefan Krawczyk

Results 157 issues of Stefan Krawczyk

the link about the spark 3.2 integration goes no where -- http://apache-spark-developers-list.1001551.n3.nabble.com/VOTE-SPIP-Support-pandas-API-layer-on-PySpark-td30996.html

**Is your feature request related to a problem? Please describe.** It is conceivable to want to write Hamilton functions in an async based way. For example, the data loading functions...

enhancement

This should fix #170. If an input function was annotated with a union, we would barf. Now it wont. This still means downstream nodes need to have the same type...

Logging an issue so people can depend/reference it more easily. Prototype PR is up from a while ago -- #119 that needs to be redone.

repo hygiene

# Current behavior Creating a DAG with 100K+ nodes is slow. It took 1117.1986498832703 seconds for it to complete for the first example below. Execution time was then 450 seconds...

bug
enhancement

**Is your feature request related to a problem? Please describe.** Can we help users bootstrap the `check_output()` decorator? **Describe the solution you'd like** Setting up data quality is possible to...

enhancement
data quality

**Is your feature request related to a problem? Please describe.** Data profiling is a way to help bootstrap creating data quality checks. Data profiling is also a way to facilitate...

enhancement
product idea
data quality

**Is your feature request related to a problem? Please describe.** We should be able to configure whether we want data quality to run or not at DAG build/run time. **Describe...

enhancement
data quality

These instructions could be more thorough, but I think they suffice. You need to read some anaconda docs if you really want to know what should happen. ## Changes -...

**Is your feature request related to a problem? Please describe.** We should be able to bootstrap a unit test suite given a hamilton function module. Most DS probably write functions...

enhancement
product idea