Arjun Singh Bora comments

Results 12 comments of


                                            Arjun Singh Bora

[GOBBLIN-1703] avoid double quota increase for adhoc flows

Functionality of checking quota for every job is left unchanged that can be done in the other PR. Handling parallel flow functionality is also left unchanged for the other PR.

GOBBLIN-1692 Make GobblinHelixJobScheduler stop Helix workflow asynchronously

Also, be aware that this will break any job which needs more time than 15 mins.

[GOBBLIN-830] launcher.type to job.launcher.type

Yes, let's use both the configs and mark one as deprecated.

[GOBBLIN-1618] log number of ingested records in AsynchronusFork

Thanks for the review. But actually no longer need this PR. Maybe need to add some new logs.

[GOBBLIN-1642] add debug logs in AsynchronousFork

I found a way to overcome if timing out is the issue. One can increase fork.record.queue.timeout. If that does not help, will ask for review on this PR. Thanks!

GOBBLIN-759: Added feature to support DistCP to copy files that were …

+1 LGTM

[GOBBLIN-1083] Unit test improving & return failed when helix task cancelled

Can you add a 'Description' in the PR. I did not understand why you are trying to make a HelixTask return Failed when it is cancelled?

[GOBBLIN-1083] Unit test improving & return failed when helix task cancelled

I see. Just keep that in mind that sometimes, we do not want to reschedule it, e.g. when user cancelled the job in GaaS. Is there a way to do...

[GOBBLIN-2011] Fix bug where another host could report the job as skipped, then the …

I think we should not set the status "Failed" when the last execution is running. We should instead emit a new event "SKIPPED". With this any further execution should be...

[GOBBLIN-2137]merged dagNodeStateStore and failedDagNodeStateStore tables

Should a) isFailedDag be ONLY within DagNode always, everywhere, with up-to-date value? b) isFailedDag be within DagNode, but be also in mysql table as a cache; still be always in...