YLGH issues

Results 35 issues of


                                            YLGH

Expose _module_to_run_forward as an overridable variable

Summary: Context: We want to be able to use DistributedDataParallel in a composable way, e.g. after applying this paralleization scheme - the model properties (including methods on the model) should...

oncall: distributed

fb-exported

cla signed

array_constructor does not work for more than 4 arguments

my df consists of int_0 through int_12 for, I'm trying to turn these into an array of features, however df["dense_features"] = functional.array_constructor( *[df[int_name] for int_name in DEFAULT_INT_NAMES] ) fails with...

Add composability for ShardedEBC and FusedShardedEBC

Summary: The current EmbeddingBagCollection/FusedEmbeddingBagCollection are only usable through the DistributedModelParallel wrapper which override common torch.nn.module APIs (named_parameters/state_dict) etc. However, this makes these modules inflexible, and sometimes unusable without using DMP....

CLA Signed

fb-exported

add get_fused_optimizers

Summary: Since we no longer rely on DistributedModelParallel (for composability piece), we need an alternative way of getting the fused optimizer. get_fused_optimizer implements this, logically it's the same as the...

CLA Signed

fb-exported

base example

Base training loop examples run cmd `torchx run -s local_cwd dist.ddp -j 1x8 --script train_dlrm.py ` Some TODO items: 1. Add NE/QPS metrics checkpointing 2. Show saving this model and...

CLA Signed

rename quantized_comms_config -> qcomms_config

Summary: Rename quantized comms config Differential Revision: D37221312

CLA Signed

fb-exported

torch profiler

CLA Signed

YLGH

Expose _module_to_run_forward as an overridable variable

array_constructor does not work for more than 4 arguments

Add composability for ShardedEBC and FusedShardedEBC

add get_fused_optimizers

TorchRec Sharding Composability

Add Quantized Comms example to golden models

TorchRec first class QuantizedComms support

base example

rename quantized_comms_config -> qcomms_config

torch profiler