Find the latency in async mode

## 🐛 Bug

It seems like there is a fixed latency while processing concurrent requests. Time for 1 request and 1000 requests is similar. 

### 1 request: 6.7 seconds
![Image](https://github.com/user-attachments/assets/614561f0-4566-46fa-adba-d1bc0caf2743)

### 1000 concurrent requests: 7.122 seconds
![Image](https://github.com/user-attachments/assets/2d02de03-b80c-48d1-8d06-6041c3dbd46f)


cc: @bhimrazy 

### To Reproduce

Attach a [Lightning Studio](https://lightning.ai/studios) which is fully reproducible (code, dependencies, environment, etc...) to reproduce this:   

1. Create a [Studio](https://lightning.ai/studios).    
2. Reproduce the issue in the Studio.    
3. [Publish the Studio](https://lightning.ai/docs/overview/studios/publishing#how-to-publish).
4. Paste the Studio link here.    



#### Code sample



### Expected behavior



### Environment
If you published a Studio with your bug report, we can automatically get this information. Otherwise, please describe:   

- PyTorch/Jax/Tensorflow Version (e.g., 1.0):
- OS (e.g., Linux):
- How you installed PyTorch (`conda`, `pip`, source):
- Build command you used (if compiling from source):
- Python version:
- CUDA/cuDNN version:
- GPU models and configuration:
- Any other relevant information:

### Additional context

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Find the latency in async mode #481

🐛 Bug

1 request: 6.7 seconds

1000 concurrent requests: 7.122 seconds

To Reproduce

Code sample

Expected behavior

Environment

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Find the latency in async mode #481

Description

🐛 Bug

1 request: 6.7 seconds

1000 concurrent requests: 7.122 seconds

To Reproduce

Code sample

Expected behavior

Environment

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions