Skip to content

[Bug] 在qwen2.5上评测mmlu_pro时eval报错 #2031

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
2 tasks done
Nexround opened this issue Apr 22, 2025 · 0 comments
Open
2 tasks done

[Bug] 在qwen2.5上评测mmlu_pro时eval报错 #2031

Nexround opened this issue Apr 22, 2025 · 0 comments

Comments

@Nexround
Copy link

先决条件

问题类型

我正在使用官方支持的任务/模型/数据集进行评估。

环境

(opencompass) root@08f741bfce41:/workspace/oc/opencompass# python -c "import opencompass.utils;import pprint;pprint.pprint(dict(opencompass.utils.collect_env()))"
{'CUDA available': True,
'CUDA_HOME': '/usr/local/cuda',
'GCC': 'gcc (conda-forge gcc 12.1.0-17) 12.1.0',
'GPU 0': 'NVIDIA GeForce RTX 4090',
'MMEngine': '0.10.6',
'MUSA available': False,
'NVCC': 'Cuda compilation tools, release 12.8, V12.8.61',
'PyTorch': '2.6.0+cu124',
'PyTorch compiling details': 'PyTorch built with:\n'
' - GCC 9.3\n'
' - C++ Version: 201703\n'
' - Intel(R) oneAPI Math Kernel Library Version '
'2024.2-Product Build 20240605 for Intel(R) 64 '
'architecture applications\n'
' - Intel(R) MKL-DNN v3.5.3 (Git Hash '
'66f0cb9eb66affd2da3bf5f8d897376f04aae6af)\n'
' - OpenMP 201511 (a.k.a. OpenMP 4.5)\n'
' - LAPACK is enabled (usually provided by '
'MKL)\n'
' - NNPACK is enabled\n'
' - CPU capability usage: AVX512\n'
' - CUDA Runtime 12.4\n'
' - NVCC architecture flags: '
'-gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_60,code=sm_60;-gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_75,code=sm_75;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_86,code=sm_86;-gencode;arch=compute_90,code=sm_90\n'
' - CuDNN 90.1\n'
' - Magma 2.6.1\n'
' - Build settings: BLAS_INFO=mkl, '
'BUILD_TYPE=Release, '
'COMMIT_SHA=2236df1770800ffea5697b11b0bb0d910b2e59e1, '
'CUDA_VERSION=12.4, CUDNN_VERSION=9.1.0, '
'CXX_COMPILER=/opt/rh/devtoolset-9/root/usr/bin/c++, '
'CXX_FLAGS= -D_GLIBCXX_USE_CXX11_ABI=0 '
'-fabi-version=11 -fvisibility-inlines-hidden '
'-DUSE_PTHREADPOOL -DNDEBUG -DUSE_KINETO '
'-DLIBKINETO_NOROCTRACER -DLIBKINETO_NOXPUPTI=ON '
'-DUSE_FBGEMM -DUSE_PYTORCH_QNNPACK '
'-DUSE_XNNPACK -DSYMBOLICATE_MOBILE_DEBUG_HANDLE '
'-O2 -fPIC -Wall -Wextra -Werror=return-type '
'-Werror=non-virtual-dtor -Werror=bool-operation '
'-Wnarrowing -Wno-missing-field-initializers '
'-Wno-type-limits -Wno-array-bounds '
'-Wno-unknown-pragmas -Wno-unused-parameter '
'-Wno-strict-overflow -Wno-strict-aliasing '
'-Wno-stringop-overflow -Wsuggest-override '
'-Wno-psabi -Wno-error=old-style-cast '
'-Wno-missing-braces -fdiagnostics-color=always '
'-faligned-new -Wno-unused-but-set-variable '
'-Wno-maybe-uninitialized -fno-math-errno '
'-fno-trapping-math -Werror=format '
'-Wno-stringop-overflow, LAPACK_INFO=mkl, '
'PERF_WITH_AVX=1, PERF_WITH_AVX2=1, '
'TORCH_VERSION=2.6.0, USE_CUDA=ON, USE_CUDNN=ON, '
'USE_CUSPARSELT=1, USE_EXCEPTION_PTR=1, '
'USE_GFLAGS=OFF, USE_GLOG=OFF, USE_GLOO=ON, '
'USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=OFF, '
'USE_NCCL=1, USE_NNPACK=ON, USE_OPENMP=ON, '
'USE_ROCM=OFF, USE_ROCM_KERNEL_ASSERT=OFF, \n',
'Python': '3.10.16 (main, Dec 11 2024, 16:24:50) [GCC 11.2.0]',
'TorchVision': '0.21.0+cu124',
'lmdeploy': "not installed:No module named 'lmdeploy'",
'numpy_random_seed': 2147483648,
'opencompass': '0.4.2+455bb05',
'sys.platform': 'linux',
'transformers': '4.48.3'}

重现问题 - 代码/配置示例

python run.py --hf-path /cache/models/suppressed_5_L --datasets mmlu_pro_gen --hf-type chat --batc
h-size 64

重现问题 - 命令或脚本

python run.py --hf-path /cache/models/suppressed_5_L --datasets mmlu_pro_gen --hf-type chat --batc
h-size 64

重现问题 - 错误信息

04/22 14:03:00 - OpenCompass - INFO - Try to load the data from /root/.cache/opencompass/./data/mmlu_pro
Traceback (most recent call last):
File "/workspace/oc/opencompass/opencompass/tasks/openicl_eval.py", line 458, in
inferencer.run()
File "/workspace/oc/opencompass/opencompass/tasks/openicl_eval.py", line 85, in run
self._score()
File "/workspace/oc/opencompass/opencompass/tasks/openicl_eval.py", line 97, in _score
result = self._evaluate_predictions(
File "/workspace/oc/opencompass/opencompass/tasks/openicl_eval.py", line 281, in _evaluate_predictions
result = evaluator.evaluate(k, n, copy.deepcopy(test_set), **preds)
AttributeError: 'AccEvaluator' object has no attribute 'evaluate'

其他信息

Qwen2.5 0.5B模型在跑mmlu_pro测评时可以正常推理但在评测结果时报错,导致所有结果都为空

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant