You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Source register re-use for the llk_unpack_AB_matmul.h and llk_math_matmul.h kernels was a necessity for wormhole_b0, due to the unpacker bandwidth being too low to saturate the compute engine. For blackhole, that is no longer the case, we should review the matmul kernel, and measure performance without re-using Source register as found here:
Source register re-use for the
llk_unpack_AB_matmul.h
andllk_math_matmul.h
kernels was a necessity for wormhole_b0, due to the unpacker bandwidth being too low to saturate the compute engine. For blackhole, that is no longer the case, we should review the matmul kernel, and measure performance without re-using Source register as found here:If the matmul kernels performance is similar or greater without re-use, then the re-use flags and functionality should be removed.
@ttmtrajkovic @rdjogoTT fyi.
The text was updated successfully, but these errors were encountered: