chet

Results 11 comments of chet

:+1: would love to have this

@davidmarin thanks for the note! SSH seems reasonable, but would something like this not work as a final step on a job? ``` def clear_hdfs(boto_client, cluster_id, name, hdfs_path): response =...

@quidproquo this isn't so much an issue for me anymore since I don't use pooling now that Amazon has per-minute billing. I ended up not needing to put this in...

👍 I need access to some of the objects recently added like `ET_Send`. I can copy source down directly in meantime but would be awesome to have up on PyPi...

@chrisbetz I've been looking into this and have a local branch wrapping the `DataFrame` API using `flambo`. One issue is that the Spark SQL API in 1.2 and 1.3 is...

@chrisbetz Great. Both those sound like viable options; once you pick a route I'll see where we could take support for this. Have a great Easter.

@erasmas Currently I'm working on getting dataframe support into Flambo at the moment since that's what I'm using in prod (looking at switching to sparkling once I get some time...

I found myself trying to scan passing in the same config file I used to add. Seems like that should be sufficient?

Interesting. I'm using the latest version. I'm brand new and am trying to setup this up first time to scan multiple repos. I'm still getting up to speed but will...

Sure. I was thinking I could pass the `repos.yaml` file as the parameter to scanning, so you could have your configuration passed directly into the scan process. ``` # add...