byteandbark
byteandbark
Running a hyperdrive run with a pytorch model I have some of my child runs hanging with a request in the log as below: 2022-04-20 12:27:52,149|azureml.BatchTaskQueueAdd_1_Batches.WaitFlushSource:BatchTaskQueueAdd_1_Batches|DEBUG|[STOP] 2022-04-20 12:27:52,273|azureml._SubmittedRun#HD_e32fc294-eadf-405e-96ac-970bdd35bc49_3.RunHistoryFacade.MetricsClient._post_run_metrics_log_failed_validations-async:False|DEBUG|[STOP] 2022-04-20 12:28:11,735|azureml.core.authentication|DEBUG|Time...
I know this is not a graph issue but thought id make an issue as you might be able to pass it across internally I have adapted the .md document...