Fix async streaming with OpenAISpec #552

aniketmaurya · 2025-06-16T12:55:13Z

What does this PR do?

Old: users are forced to implement all the methods as async and not rely on default decode_request and encode_response method.

import litserve as ls

class OpenAIAPI(ls.LitAPI):
    def setup(self, device):
        pass

    async def decode_request(self, request):
        return super().decode_request(request)

    async def predict(self, request):
        for i in range(10):
            yield f"Hello, world! {i}"

    async def encode_response(self, output):
        return super().encode_response(output)

if __name__ == "__main__":
    api = OpenAIAPI(enable_async=True, spec=ls.OpenAISpec())
    server = ls.LitServer(api, accelerator="auto")
    server.run(port=8000)

New (this PR): User only implements predict and rest of the methods gets automatically converted into async.

import litserve as ls

class OpenAIAPI(ls.LitAPI):
    def setup(self, device):
        pass

    async def predict(self, request):
        for i in range(10):
            yield f"Hello, world! {i}"

if __name__ == "__main__":
    api = OpenAIAPI(enable_async=True, spec=ls.OpenAISpec())
    server = ls.LitServer(api, accelerator="auto")
    server.run(port=8000)

Before submitting

Was this discussed/agreed via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in GitHub issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

…utils.py - Added `asyncify` decorator to convert sync functions, sync generators, async functions, and async generators to a consistent async interface. - Introduced `_stream_gen_from_thread` to handle streaming from sync generators in a separate thread, allowing for non-blocking async operations. - Enhanced error handling within the streaming generator to manage exceptions effectively.

…tionality - Enhanced the `asyncify` decorator to streamline the conversion of sync functions and generators to async counterparts. - Improved documentation within the decorator for better understanding of its behavior. - Updated imports in base.py and openai.py to utilize the new asyncify decorator, ensuring consistent async handling across the codebase.

- Removed the `_stream_gen_from_thread` and `asyncify` functions from `utils.py` to streamline async function handling. - Updated `_async_inject_context` in `base.py` to improve context injection for async functions and generators. - Enhanced `StreamingLoop` to correctly handle async generators in response encoding. - Introduced `as_async` method in `LitSpec` and `OpenAISpec` to support async specifications. - Added `_AsyncOpenAISpecWrapper` to manage async behavior for OpenAI specifications. These changes improve the overall async functionality and maintainability of the codebase.

- Introduced `_AsyncSpecWrapper` to facilitate async handling in `LitSpec`. - Updated `as_async` method in `LitSpec` to return an instance of `_AsyncSpecWrapper`. - Refactored `_AsyncOpenAISpecWrapper` to inherit from `_AsyncSpecWrapper`, streamlining async behavior for `OpenAISpec`. - Improved async request and response handling in both specifications. These changes enhance the overall async functionality and maintainability of the codebase.

…nerator requirement

- Improved the `_validate_async_methods` function to use a structured validation approach for async methods. - Introduced a dictionary to define validation rules for `decode_request`, `encode_response`, and `predict` methods. - Enhanced error handling by collecting warnings and errors separately, providing clearer feedback when async requirements are not met. - Ensured that appropriate warnings are issued and errors raised based on the validation results. These changes enhance the clarity and maintainability of async method validation in the LitAPI class.

…ing in AsyncTestStreamLitAPI

codecov · 2025-06-16T14:47:20Z

Codecov Report

Attention: Patch coverage is 75.86207% with 14 lines in your changes missing coverage. Please review.

Project coverage is 85%. Comparing base (e03ba13) to head (bc0ee32).
Report is 1 commits behind head on main.

Additional details and impacted files

@@         Coverage Diff         @@
##           main   #552   +/-   ##
===================================
- Coverage    85%    85%   -0%     
===================================
  Files        38     38           
  Lines      2902   2940   +38     
===================================
+ Hits       2480   2504   +24     
- Misses      422    436   +14

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

aniketmaurya added 9 commits May 19, 2025 00:56

update

5e3e26b

Merge branch 'main' into asyncify

601b8b7

Merge branch 'main' into asyncify

8ff9ec4

Merge branch 'main' into asyncify

ffe96f1

Wrap the spec in an async spec wrapper if it is async

5d2685d

aniketmaurya marked this pull request as ready for review June 16, 2025 13:13

aniketmaurya requested review from Borda, KaelanDt, andyland, ethanwharris, justusschock, k223kim, lantiga and tchaton as code owners June 16, 2025 13:13

Borda approved these changes Jun 16, 2025

View reviewed changes

aniketmaurya added 7 commits June 16, 2025 19:17

fixes

fcd4582

fix tests

8656df1

fix

ebec7ed

Update error message in test_enable_async_not_set to clarify async ge…

0f63b03

…nerator requirement

update

9897a31

Update encode_response method to use async iteration for output handl…

bc0ee32

…ing in AsyncTestStreamLitAPI

aniketmaurya merged commit 550a23e into main Jun 16, 2025
21 checks passed

aniketmaurya deleted the asyncify branch June 16, 2025 14:52

aniketmaurya mentioned this pull request Jun 17, 2025

asyncify default decode_request and encode_response if not overridden & asyncified by user #502

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix async streaming with OpenAISpec #552

Fix async streaming with OpenAISpec #552

Uh oh!

aniketmaurya commented Jun 16, 2025 •

edited by Borda

Loading

Uh oh!

codecov bot commented Jun 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix async streaming with OpenAISpec #552

Fix async streaming with OpenAISpec #552

Uh oh!

Conversation

aniketmaurya commented Jun 16, 2025 • edited by Borda Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

PR review

Did you have fun?

Uh oh!

codecov bot commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aniketmaurya commented Jun 16, 2025 •

edited by Borda

Loading

codecov bot commented Jun 16, 2025 •

edited

Loading