Rohit Gupta comments

Results 52 comments of


                                            Rohit Gupta

streaming datasets doesn't work properly with multi-node

@mariosasko, @lhoestq, @albertvillanova hey guys! can anyone help? or can you guys suggest who can help with this?

streaming datasets doesn't work properly with multi-node

> if dataset.n_shards % world_size != 0 then all the nodes will read/stream the full dataset in order (possibly reading/streaming the same data multiple times), BUT will only yield one...

streaming datasets doesn't work properly with multi-node

what if the number of samples in that shard % num_nodes != 0? it will break/get stuck? or is the data repeated in that case for gradient sync?

FSD50K Speech Model Fine-tuning Tutorial

hey @FlorentMeyer, mind check the file you uploaded, looks like it's too big and there might be some redundant stuff here. Might clean it up?

Label tracking meta-issue (edit me to get automatically CC'ed on issues!)

is it possible to configure the same for discussions as well? we have labels there.

Add a brand watermark

if it's on images, I would say no.. it's open source.

Add "interval": "validation" to scheduler configuration

@tchaton do you think adding an additional `'validation'` interval would be a good idea? can't think of any configuration to support it with `'step'|'epoch'`. Although there are 2 cases in...

LightningModule self.log add_dataloader_idx doesn't reduce properly the metric across dataloaders

```py def test_multiple_dataloaders_logging(tmpdir): class TestModel(BoringModel): def validation_step(self, batch, batch_idx, dataloader_idx): self.log("value_1", dataloader_idx, add_dataloader_idx=False) ``` isn't this incorrect behavior since we have a single resultcollection instance handling all the keys but...

LightningModule self.log add_dataloader_idx doesn't reduce properly the metric across dataloaders

@tchaton I believe `add_dataloader_idx` is meant to distinguish the metrics stored internally by appending the index in front of it. The flow with multiple dataloaders is like this: ``` complete...

LightningModule self.log add_dataloader_idx doesn't reduce properly the metric across dataloaders

IMO, it's a good thing to support cross-reduction across dataloaders, but I'd argue from the user's point of view that what we have on master is good enough right now...