Yanbo Liang

Results 24 issues of Yanbo Liang

# Bug report A clear and concise description of what the bug is. Include a minimal, reproducible example (https://stackoverflow.com/help/minimal-reproducible-example), if possible. Repo: ``` class MyConfig(dict): def __getattr__(self, x): return self[x]...

type-bug

Fixes 14k github models: https://github.com/jansel/pytorch-jit-paritybench/blob/master/generated/test_ELEKTRONN_elektronn3.py cc @mlazos @soumith @voznesenskym @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @chunyuan-w @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @desertfire

ciflow/trunk
module: dynamo
ciflow/inductor

Fixes Meta internal user case, see the pattern in unit test. cc @mlazos @soumith @voznesenskym @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @chunyuan-w @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @desertfire

ciflow/trunk
module: dynamo
ciflow/inductor

This is based on #57. Please checkout https://github.com/yanboliang/gpt-fast/tree/mixtral-moe to try this. Performance numbers (tokens/second): ``` | | 1 GPU | 2 GPU | 8 GPU | |------------------|---------|-----------|-------------| |baseline(bfloat16)| OOM |...

CLA Signed

Fixes #120387 cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @aakhundov @kadeng

topic: not user facing
module: dynamo
ciflow/inductor

Run different head_dims [64, 128], which are the most popular ones across major GPT models. Enumerate different ```BLOCK_M``` and ```BLOCK_N``` candidates [16, 32, 64, 128], and get the best config...

ciflow/trunk
topic: not user facing
oncall: pt2
module: inductor
ciflow/inductor

### 🐛 Describe the bug Repro: ``` import torch import torch.nn.functional as F class BasicModule(torch.nn.Module): def __init__(self): super().__init__() self.linear1 = torch.nn.Linear(10, 10) self.scale = torch.randn(1, 10) def forward(self, x): x...

oncall: pt2
module: dynamo

## H100 ### torch.bfloat16 No major change, as expected. ``` | Type | Speedup | batch_size | num_heads | q_seq_len | k_seq_len | head_dim | score_mod | dtype | |---------|-----------|--------------|-------------|-------------|-------------|------------|-------------|----------------|...

topic: not user facing
module: inductor
ciflow/inductor

I'm adding this prototype to demonstrate a possible way that how intermediate logging works with torch.compile. The following example is a minimized repro and how it ladders up with proposed...

Stale
release notes: fx
module: inductor
module: dynamo
ciflow/inductor

Fixes #ISSUE_NUMBER

release notes: releng
ciflow/inductor-micro-benchmark