Yu-Hsiang M. Tsai

Results 52 issues of Yu-Hsiang M. Tsai

This PR ports the Jacobi to dpcpp including scaler and adaptive-block. Currently, dpcpp can not use different subgroup size inside the kernel, so we do not have the concept, which...

reg:build
reg:testing
type:solver
type:preconditioner
type:matrix-format
1:ST:ready-for-review
type:factorization
reg:helper-scripts
mod:all

~~IR use the mutable workspace to reuse the memory when the input #rows of rhs does not change.~~ Also, splitting the crierion in IR to eliminate one reduction computation in...

mod:core
type:solver
1:ST:ready-for-review
1:ST:high-importance
1:ST:need-feedback

This PR adds transposition LinOp such that build the transposition in lazy way or now. When the behavior is lazy, building transposition will happen in first apply, get_transposition or prepare_transposition....

is:new-feature
reg:build
reg:testing
mod:core
1:ST:low-importance

This PR makes the preconditioner IC wrapper without SolverType. It moves the solver type into the factory operator. When we provide the solver, the SolverType is useless but we still...

reg:build
reg:testing
mod:core
mod:reference
type:preconditioner
1:ST:low-importance

This PR adds the extension resource manager. The purpose is to add a way to search/build/insert data from json (possible yaml or command-line support in future). Use json as the...

reg:build
reg:testing
mod:core
mod:reference
type:solver
type:preconditioner
type:matrix-format
type:stopping-criteria

This pr is mainly for the status and the collection of recent multigrid improvement or fix. the updates are splitted into #981 #980. Some changes related multigrid itself will be...

reg:build
reg:testing
reg:example
type:solver
type:matrix-format
1:ST:WIP
type:multigrid
mod:all

Track the possible interface breaking we find now. 1. csr strategy (#320 ) 2. csr clac_size -> calc_size 3. Rename / Rebrand / Add executor information (e.g. change `warp_size` so...

is:interface-breaking

Due to the limitation of block/grid size, the max of gridDim.x is `2^31 - 1` and max of blockDim.x is `1024`. (in cuda) We usually use this two parameter to...

is:help-wanted
mod:cuda
mod:hip

When we add more function into dpcpp module, the debug build will give some compilation issue. It will throw `relocation truncated to fit: R_X86_64_GOTPCREL....` and `PC-relative offset overflows in PLT...

is:bug
is:confirmed
reg:build
plat:intel
mod:dpcpp
is:bugfix

# Description From https://github.com/intel/llvm/pull/6407, it moves almost all headers from CL/sycl to sycl I followed https://github.com/oneapi-src/oneMKL/pull/199 way make the header can use sycl/* if they exist and allow the old...