FindDefinition

Results 4 issues of FindDefinition

## Problem currently the API of context parallel have five problems. 1. only support apply CP to whole model. if we have some cross attn in prep part of model...

enhancement
module: context parallel

## New Feature We can use memory instant events in pytorch profiler result to generate a nice gpu memory trace in perfetto: ![Image](https://github.com/user-attachments/assets/63168de3-f515-418f-8be1-fe1229c1bd4e) This memory trace is aligned to python...

enhancement

gluon, as a tile-based low-level GPU programming language, has a core advantage over other similar languages (such as tilelang and tilus): users can perform thread-level operations through **Linear Layout**, for...

enhancement
documentation

### Describe the bug compiler hint functions such as `tl.multiple_of` don't work when appied to return value of a jit function. this bug also exists in gluon. ```Python import triton...

bug