Laurent Montigny
Laurent Montigny
So how to add the list of publication from a bib file using the same font? Could it be integrated in the master branch? Thanks
Created branch: 116-add-shared-memory-support-for-l0-path
Moved to pakurapo/smem-2 branch, need to change the **is_l0** flag to true in several place to avoid CPU fallback (and use GPU via L0). Otherwise reported address space casting issue...
Large address space issue related to the traits object? solved using ``` if (slot_bytes == sizeof(int32_t)) { // return traits.smemPointerType(llvm::Type::getInt32Ty(context)); return llvm::Type::getInt32PtrTy(context, /*address_space=*/3) ``` generate: `%dest_slot_adr_0 = bitcast i8 addrspace(3)*...
L0Mgr issue, it seems to be an invalid opcode as 65535 is not reserved by any vendor: https://github.com/KhronosGroup/SPIRV-Headers/blob/main/include/spirv/spir-v.xml#L152 `` Wrong data is read as an opcode earlier in the IGC...
Searching the invalid opcode, see [spir-v disassembly](https://gist.github.com/lmontigny/f4b201f08b412c1dd1915b27b604a03c)
Invalid casting in the `write_projection_int64` from RuntimeFunctions.cpp ``` /usr/bin/llvm-as-14: after.linking.before.insert_declaration.spirv.ll:1712:16: error: invalid cast opcode for cast from 'i8 addrspace(4)*' to 'i64*' %6 = bitcast i8 addrspace(4)* %0 to i64*, !ResultSetReductionCodegen.cpp...
Added `write_projection_int64` in genx.ll with the casting to addrspacecast(4) `%6 = bitcast i8 addrspace(4)* %0 to i64 addrspace(4)*` `write_projection_int32` is required in genx.ll as well. Now same casting issue is...
Fixed several function and added implementation in genx.ll Need to fix this one: ``` /usr/bin/llvm-as-14: after.linking.spirv.ll:1978:13: error: '@reduce_from_smem_to_gmem' defined with type 'void (i64 addrspace(4)*, i64 addrspace(4)*, i32)*' but expected 'void...
Now on the shared_memory related issue: ``` $ /usr/bin/llvm-as-14 after.linking.spirv.ll /usr/bin/llvm-as-14: after.linking.spirv.ll:1258:45: error: use of undefined value '@slm.buf.i64' %res = bitcast [1024 x i64] addrspace(3)* @slm.buf.i64 to i64 addrspace(3)* ```...