bug: Jan UI Bottlenecks Token Rendering Speed to ~300 TPS Despite Faster Cerebras API Output

**Version:** 0.6.6

## Describe the Bug

When running inferance at very high token per second rates (like 1500 tokens per second) through an api like [Cerebras](https://www.cerebras.ai/inference), the Jan is not able to keep up. Jan cannot render more than 200 to 300 tokens per second on a good computer.


## Steps to Reproduce
1. Get a Cerebras api key at https://cloud.cerebras.ai/ (you can get some usage for free).
2. Add Cerebras as a model provider in Jan with the base url of `https://api.cerebras.ai/v1`.
3. Select any model like `llama-3.3-70b` from Cerebras.
4. Observe that generation is limited to 200 or 300 tokens per second. This is not a limitation of the Cerebras API, rather it is the Jan UI failing to render tokens that fast.
5. Try out Cerebras inferance to see how fast it normally generates tokens at https://inference.cerebras.ai/.


## Screenshots / Logs

<img width="825" height="590" alt="Image" src="https://github.com/user-attachments/assets/0426e4f4-20c8-4c15-a236-58d9c7d8498a" />
<img width="1146" height="454" alt="Image" src="https://github.com/user-attachments/assets/629c3082-af9e-4e3e-beba-6653b647e22e" />
You can see that with the same model, the Cerebras API ui can show generated tokens in real time around 1600 tokens per second, but Jan is stuck at 238.


## Operating System
- [x] MacOS
- [x] Windows
- [x] Linux

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bug: Jan UI Bottlenecks Token Rendering Speed to ~300 TPS Despite Faster Cerebras API Output #6199

Describe the Bug

Steps to Reproduce

Screenshots / Logs

Operating System

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

bug: Jan UI Bottlenecks Token Rendering Speed to ~300 TPS Despite Faster Cerebras API Output #6199

Description

Describe the Bug

Steps to Reproduce

Screenshots / Logs

Operating System

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions