Wonkyo Choe comments

Results 11 comments of


                                            Wonkyo Choe

RFC: Enabling cookie files throughout a RETS session

@gruler Basically, `$rets = new \PHTRES\Session($config);` is a session itself so you don't need to try `$rests->session` again. Here is the example for connection if someone get trouble with connection....

Implement: Align Your Steps: Optimizing Sampling Schedules in Diffusion Models

@DeFek1 you can use a linear interpolation in between those timestep indices

Implement: Align Your Steps: Optimizing Sampling Schedules in Diffusion Models

The original repo includes the code or you could use numpy or other linear algebra library. https://research.nvidia.com/labs/toronto-ai/AlignYourSteps/howto.html

Inference bottleneck

@SA-j00u The bottleneck mainly comes from the MUL_MAT operator. You can profile your run with `GGML_PERF` @ring-c If you are using CUDA, that is the normal behavior. If you want...

Inference bottleneck

@FSSRepo I understand the repo is a bit infant compared to Pytorch and therefore, the slow inference is because of under-optimization. Yet, the one thing that I do not understand...

Inference bottleneck

I found that this issue was actually created from my end although diffusers is still better. For some reason, I used `CMAKE_BUILD_TYPE=Debug` for the build and this took out `-O3`...

Inference bottleneck

@JohnAlcatraz I just updated the first post and am reopening the issue.

precomputed pile binidx dataset

Which tokenizer did you use to generate the dataset?

Problem on custom device_map

Okay. I found that `device_map` actually only offloads the model weight, not the execution as well. If there is a GPU then the GPU is the main priority in executing...

RWKV-7中的代码，加载huggingface中的预训练模型报错

I think this is intentional. Other blocks have those variables but only the first one does not. This inconsistent behavior can be found in this: https://github.com/BlinkDL/RWKV-LM/blob/d6a1efc06c46681b61694a67b8591120865446ba/RWKV-v7/train_temp/src/model.py#L173-L176