Reminder, the training result is werd when torch=2.6 unsloth=2025.2.15
using same script same model.
Resolve method for me is using torch=2.5 forcely.
conda create --name unsloth_env2 python=3.11 pytorch-cuda=12.1 pytorch=2.5 cudatoolkit xformers -c pytorch -c nvidia -c xformers -y
pip3 install unsloth torch==2.5.1
This problem troubles me for two days.