hbase scalding Store based on maple/storehaus
Thanks for working on this!
I've incorporated the feedback. I removed the error check for the empty pipe in readLast. It looks like ScaldingStore's merge should handle this correctly
in addition to using a converted HBaseByteArrayStore I also renamed main() in ScaldingRunning so the "getting started" wiki instructions weren't impacted by another main() in summingbird-example
any update on this PR. Would love to get this merged in as I have to use HBase with SB too
Sorry for the slow response. @MansurAshraf I guess you have reviewed the HBase stuff. We don't use it much at Twitter, so I'm only giving a summingbird review.
I don't see how this code is tracking the state of which batches this store has completed. That information needs to be available at planning time, and this code is currently just always claiming to have data. That will not be correct.
That said, we do need to build some kind of framework to make it easier to test Stores.
Added stuff to write/read the last processed BatchID from ZK. I have to confess that I don't have much experience interacting with ZK directly (mostly use systems that use ZK) so I'm quite open to feedback on how to do that better. It does seem like the ideal way to do this would be to register a Watcher with the yet-to-be-created zookeeper WaitingState so the BatchID could be written once the Scalding job has completed successfully.
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
You have signed the CLA already but the status is still pending? Let us recheck it.