Jerry Zhang

Results 39 issues of Jerry Zhang

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #83132 Summary: att Test Plan: visual inspection of generated docs page https://pytorch.org/docs/stable/quantization-support.html Reviewers: Subscribers: Tasks: Tags:

cla signed

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #85070 Summary: This is a developer-oriented design doc/README for FX Graph Mode Quantization, the goal for the doc is for new developers...

cla signed
topic: not user facing
fx

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #85306 * #85070 Summary: att Test Plan: visual inspection of generated docs Reviewers: Subscribers: Tasks: Tags:

cla signed
release notes: AO frontend

Caffe2 C++ API is undergoing changes since we are currently working on merging the caffe2 and pytorch backend. There might be breaking changes, but most of them should be easy...

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #102 Summary: att Adding this for accuracy evaluation, we also added this in executorch repo and we'll dedup later Test Plan: quantization:...

CLA Signed

Summary: Added an example and util for awq like flow that applies extra equalization scale tensor to input activation Test Plan: python tutorials/calibration_flow/awq_like.py Reviewers: Subscribers: Tasks: Tags:

CLA Signed

Recently we landed https://github.com/pytorch/ao/pull/939 to support tensor parallelism for int8 weight only quantization, another example: https://github.com/pytorch/ao/pull/785 now we can support tensor parallelism for other types of quantization as well. *...

good first issue
triaged

Currently `pip3 install torchao==0.5.0` only supports mac and linux, so it failed in `aarch64`: https://github.com/pytorch/torchchat/issues/1173 we want to add this, example: https://github.com/pytorch/vision/blob/main/.github/workflows/build-wheels-aarch64-linux.yml

multibackend

Status: Draft Updated: 09/18/2024 # Objective In this doc we’ll talk about how different optimization techniques are structured in torchao and how to contribute to torchao. # torchao Stack Overview...

rfc

[run_vit_b_quant.py](https://github.com/pytorch/ao/blob/main/tutorials/quantize_vit/run_vit_b_quant.py) elapsed_time: 11.0519150390625 milliseconds [run_bit_b.py](https://github.com/pytorch/ao/blob/main/tutorials/quantize_vit/run_vit_b.py) elapsed_time: 1.2272755432128906 milliseconds this is with `int8_dynamic_activation_int8_weight`