Conversation
|
@ggerganov how to trigger ggml-7-sycl? |
|
Add the string |
WA for permute(0,1,3,2) mul_mat ggml-ci
|
@arthw it is wield that |
|
@ggerganov only ggml-7-sycl not triggered, is it offline? |
|
Looks like the same connectivity issues as before: # when trying to SSH to the instance
channel 0: open failed: connect failed: No route to host
stdio forwarding failed
Connection closed by UNKNOWN port 65535The instance is shown as "Ready" in the Intel Cloud. |
Please let us know if this happens from time to time? Sorry for inconvenience, ITDC is still in developments. |
|
It seemed to happen quite often - usually about 10 minutes after I manage to SSH. I've reported at least 3 occurrences to Intel support on 2 different machines, but no root cause was found. Yesterday it looked like the issue was gone because I was able to SSH and setup the CI successfully without disconnects. But now it seems the issue is back. Note that I wasn't connected or doing anything on the machine. The last CI run was ~4 hours ago from the commits in this PR: https://github.com/ggml-org/ci/commits/results/. Will let you know if the connection comes back again. |
|
seems the machine hang there and now it is back. @ggerganov could you share your command, for example, a script ro run llama.cpp CI with one click that the cloud engineers could debug it, either here or through the email? we think this should be a simple issue but one we hadn't considered before |
|
Yes, will send you an email in about half an hour |
|
@ggerganov solved the connection issue, you might need to re-request another instance. Sorry for the inconvenience, you can use this PR to test the CI again. |
|
Thank you. Will try to set it up today. Will let you know. |
|
@airMeng I just configured the new instance today and as I was just finishing the deployment of |
|
Is this PR still needed or has everything been fixed with #10257? |
Has this been fixed yet? |
|
Closing for inactivity |
fix some issues in norm and permute GEMM
There are instability within the cloud instance, so only keep the basic functionality in SYCL CI to save the machine time