Skip to content

5090 FP4 Gemm Flops cublaslt_gemm benchmark error #721

@ffflyZzz

Description

@ffflyZzz

this is logs
root@ubuntu:/data/superbenchmark/superbench/benchmarks/micro_benchmarks/cublaslt_gemm# cmake . -- The CXX compiler identification is GNU 11.4.0 -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/c++ - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Looking for C++ include pthread.h -- Looking for C++ include pthread.h - found -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- The CUDA compiler identification is NVIDIA 12.8.61 -- Detecting CUDA compiler ABI info -- Detecting CUDA compiler ABI info - done -- Check for working CUDA compiler: /usr/local/cuda-12.8/bin/nvcc - skipped -- Detecting CUDA compile features -- Detecting CUDA compile features - done -- Configuring done -- Generating done -- Build files have been written to: /data/superbenchmark/superbench/benchmarks/micro_benchmarks/cublaslt_gemm root@ubuntu:/data/superbenchmark/superbench/benchmarks/micro_benchmarks/cublaslt_gemm# make [ 25%] Building CXX object CMakeFiles/cublaslt_utils.dir/cublaslt_utils.cc.o [ 50%] Linking CXX shared library libcublaslt_utils.so [ 50%] Built target cublaslt_utils [ 75%] Building CUDA object CMakeFiles/cublaslt_gemm.dir/cublaslt_gemm.cu.o [100%] Linking CUDA executable cublaslt_gemm [100%] Built target cublaslt_gemm root@ubuntu:/data/superbenchmark/superbench/benchmarks/micro_benchmarks/cublaslt_gemm# ls CMakeCache.txt cmake_install.cmake cublaslt_gemm cublaslt_utils.cc libcublaslt_utils.so CMakeFiles CMakeLists.txt cublaslt_gemm.cu cublaslt_utils.h Makefile root@ubuntu:/data/superbenchmark/superbench/benchmarks/micro_benchmarks/cublaslt_gemm# ./cublaslt_gemm -w 50 -i 200 -t fp4e2m1 -m 8192 -n 8192 -k 8192 cuBLAS call cublasLtMatmulAlgoGetHeuristic(handle_.get(), op_desc_.get(), a_desc_.get(), b_desc_.get(), c_desc_.get(), d_desc_.get(), preference_.get(), max_algorithm_count, results.data(), &found_algorithm_count) failed at /data/superbenchmark/superbench/benchmarks/micro_benchmarks/cublaslt_gemm/cublaslt_utils.cc:216 'the requested functionality is not supported' root@ubuntu:/data/superbenchmark/superbench/benchmarks/micro_benchmarks/cublaslt_gemm# ./cublaslt_gemm -w 50 -i 200 -t fp8e4m3 -m 8192 -n 8192 -k 8192 8192 8192 8192 0 2280.463379 482.114525

Image

nvidia-smi

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions