Eval bug: Responses API (/v1/responses) can`t cancel a stream to stop generation

### Name and Version

after [b7793](https://github.com/ggml-org/llama.cpp/releases/tag/b7793)

### Operating systems

Windows

### GGML backends

CUDA

### Hardware

irrelevant

### Models

irrelevant

### Problem description & steps to reproduce

When starting an llama-server, if the HTTP connection is interrupted on the client side during the generation process, the previous Chat Completions API (v1/chat/completions) can cancel the stream  to stop generation, but the new Responses API (/v1/responses) cannot.

### First Bad Commit

https://github.com/ggml-org/llama.cpp/pull/18486

### Relevant log output

<details>
<summary>Logs</summary>


```console

```
</details>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eval bug: Responses API (/v1/responses) can`t cancel a stream to stop generation #19173

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Eval bug: Responses API (/v1/responses) can`t cancel a stream to stop generation #19173

Description

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions