You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Functions with errors:
onednn_amx_sgemm_f32bf16f32_compute_biasadd
Matmul matrix shape:
M = 51200, N = 12288, K= 4096, transA = 0,alpha=1.000000, lda=4096, beta=0.000000,ldc=12288
oneDNN_verbose:
onednn_verbose,info,oneDNN v3.2.0 (commit 04b180b9a58a78cf1a1cd2329671a5060c2be8de)
onednn_verbose,info,cpu,runtime:OpenMP,nthr:48
onednn_verbose,info,cpu,isa:Intel AVX-512 with float16, Intel DL Boost and bfloat16 support and Intel AMX with bfloat16 and 8-bit integer support
onednn_verbose,info,gpu,runtime:none
onednn_verbose,info,prim_template:operation,engine,primitive,implementation,prop_kind,memory_descriptors,attributes,auxiliary,problem_desc,exec_time
Segmentation fault occurs at large batch sizes
Command Line:
./run_benchmark.sh -m llama-7b -d bf16 -s 1 -bs 100 -in 512 -out 256 -i 1
Functions with errors:
onednn_amx_sgemm_f32bf16f32_compute_biasadd
Matmul matrix shape:
M = 51200, N = 12288, K= 4096, transA = 0,alpha=1.000000, lda=4096, beta=0.000000,ldc=12288
oneDNN_verbose:
onednn_verbose,info,oneDNN v3.2.0 (commit 04b180b9a58a78cf1a1cd2329671a5060c2be8de)
onednn_verbose,info,cpu,runtime:OpenMP,nthr:48
onednn_verbose,info,cpu,isa:Intel AVX-512 with float16, Intel DL Boost and bfloat16 support and Intel AMX with bfloat16 and 8-bit integer support
onednn_verbose,info,gpu,runtime:none
onednn_verbose,info,prim_template:operation,engine,primitive,implementation,prop_kind,memory_descriptors,attributes,auxiliary,problem_desc,exec_time
Command Line:
./run_benchmark.sh -m llama-7b -d bf16 -s 1 -bs 100 -in 32 -out 32 -i 1
Functions with errors:
hpj::Matrix &input, hpj::Matrix &output, hpj::Matrix &residential, bool isMaster) {
TimeLine t("DownProj")
assert(input.Rows() == output.Rows()); (ASSERT FAILED input.Cols()=22016, downWeight.Rows()=11008;)
Matmul matrix shape:
M = 3200, N = 12288, K= 4096, transA = 0,alpha=1.000000, lda=4096, beta=0.000000,ldc=12288
Verbose:
xft_verbose,exec,cpu,api,onednn_amx_sgemm_f32bf16f32_compute_biasadd,m3200n12288k4096,29.308059
xft_verbose,exec,cpu,api,onednn_amx_sgemm_f32bf16f32_compute_residential,m3200n4096k4096,12.953664
xft_verbose,exec,cpu,api,onednn_amx_sgemm_f32bf16f32_compute,m3200n22016k4096,42.813326
The text was updated successfully, but these errors were encountered: