You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on May 13, 2024. It is now read-only.
add another URE W(P)=matrixC(total_j, total_i), add another ImageParam matrixC.
Z(P) = select(kkk == 0 && kk == 0 && k == 0, W(P), ... W.merge_ures(X, Y, Z, Out); W.set_bounds(...); W.space_time_transform(...);
Follow t2s/tests/performance/gemm/README to compile it on an FPGA emulator. We will see Internal Error at /home/u128292/t2sp/Halide/src/CodeGen_LLVM.cpp:1465 triggered by user code at : Symbol not found: W.s0.kkk
Note the operand 'read_shift_reg("W.shreg", W.s0.jjj, W.s0.iii, W.s0.kkk)' is before the kkk loop, but it refers to kkk. This looks like an issue with devectorization.