Max Podkorytov

Results 13 issues of Max Podkorytov

Summary: Prepare to supply lightweight wrappers instead of `std::vector` Differential Revision: D35620952

fb-exported
cla signed

See issue https://github.com/prestodb/presto/issues/17359 Test plan - Added testMaterializedViewResultForThreeTableJoin() in TestHiveLogicalPlanner.java and made sure it passes ``` == RELEASE NOTES == Hive Changes * Support materialized views of joins of multiple...

stale

Summary: we can iterate on the structure of the container if/when we want to it adds some flexibility to the order of the kernel calls during runtime Differential Revision: D43612905

CLA Signed
fb-exported

Differential Revision: D43131944

CLA Signed
fb-exported

Summary: created a module mimicking torch.nn interface separated iteration logic from inputs specification Differential Revision: D43547253

CLA Signed
fb-exported

This PR adds an alternative backend for Inductor, adding Composable Kernel Universal GEMM instances to the autotune instance selection. The implementation is heavily influenced by the series of PRs which...

module: rocm
triaged
open source
module: inductor
ciflow/inductor
ciflow/rocm

Hi, thank you for sharing the code! I believe in order to run inference I would need the checkpoint files, could you add them as well? Thank you

Plug into lowering and end to end test in a later PR Instance parsing companion PR https://github.com/ROCm/composable_kernel/pull/1585 cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225...

module: rocm
open source
topic: not user facing
module: inductor
ciflow/inductor
ciflow/rocm

## Proposed changes The current implementation of static_ford uses recursion over the sequence of dimensions and instantiates lambdas at each internal step. This can be avoided and should improve compilation...

compilation time