-
Notifications
You must be signed in to change notification settings - Fork 259
Open
Labels
bugSomething isn't workingSomething isn't workinghelp wantedExtra attention is neededExtra attention is needed
Description
🐛 Bug
It seems like there is a fixed latency while processing concurrent requests. Time for 1 request and 1000 requests is similar.
1 request: 6.7 seconds
1000 concurrent requests: 7.122 seconds
cc: @bhimrazy
To Reproduce
Attach a Lightning Studio which is fully reproducible (code, dependencies, environment, etc...) to reproduce this:
- Create a Studio.
- Reproduce the issue in the Studio.
- Publish the Studio.
- Paste the Studio link here.
Code sample
Expected behavior
Environment
If you published a Studio with your bug report, we can automatically get this information. Otherwise, please describe:
- PyTorch/Jax/Tensorflow Version (e.g., 1.0):
- OS (e.g., Linux):
- How you installed PyTorch (
conda,pip, source): - Build command you used (if compiling from source):
- Python version:
- CUDA/cuDNN version:
- GPU models and configuration:
- Any other relevant information:
Additional context
bhimrazy
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workinghelp wantedExtra attention is neededExtra attention is needed

