Skip to content

[RFC] OPEA Inference Microservices (OIM)#327

Merged
ashahba merged 6 commits intoopea-project:mainfrom
poussa:oim-rfc
Mar 21, 2025
Merged

[RFC] OPEA Inference Microservices (OIM)#327
ashahba merged 6 commits intoopea-project:mainfrom
poussa:oim-rfc

Conversation

@poussa
Copy link
Copy Markdown
Member

@poussa poussa commented Mar 10, 2025

RFC for a new OPEA feature: OPEA Inference Microservices (OIM). OIM will address LLM inference deployment challenges via Kubernetes operator, custom resources, profiles and standardised container images.

Copy link
Copy Markdown
Collaborator

@mkbhanda mkbhanda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor typos and a question. Thank you @poussa !

Copy link
Copy Markdown
Contributor

@eero-t eero-t left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks OK, but I have few minor suggestions.

@Yu-amd
Copy link
Copy Markdown
Contributor

Yu-amd commented Mar 11, 2025

Hi team,

I raised a feature request for the OIM operator here: opea-project/GenAIExamples#1525 Please review and consider adding this as MVP.

Thank you,
Yu

Copy link
Copy Markdown
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @poussa
The PR looks good overall but please address @eero-t 's comment where you see fit.
Also we need a few lines like a high level summary for the description.

@AlexHe99
Copy link
Copy Markdown

AlexHe99 commented Mar 12, 2025

Really the feature required of OPEA!

@poussa poussa force-pushed the oim-rfc branch 2 times, most recently from 9865d85 to 5b8dd15 Compare March 12, 2025 07:37
@poussa
Copy link
Copy Markdown
Member Author

poussa commented Mar 12, 2025

Thanks @poussa The PR looks good overall but please address @eero-t 's comment where you see fit. Also we need a few lines like a high level summary for the description.

@eero-t comments addressed and description updated.

@eero-t
Copy link
Copy Markdown
Contributor

eero-t commented Mar 12, 2025

Thanks @poussa The PR looks good overall but please address @eero-t 's comment where you see fit.

That's done now.

Also we need a few lines like a high level summary for the description.

Could you suggest something for that, or is it good now?

@poussa
Copy link
Copy Markdown
Member Author

poussa commented Mar 14, 2025

@mkbhanda @ftian1 @ashahba I have now addressed all the feedback. Time for merge? PTAL.

poussa added 5 commits March 20, 2025 10:29
Signed-off-by: Sakari Poussa <[email protected]>
Signed-off-by: Sakari Poussa <[email protected]>
Signed-off-by: Sakari Poussa <[email protected]>
Signed-off-by: Sakari Poussa <[email protected]>
Signed-off-by: Sakari Poussa <[email protected]>
Copy link
Copy Markdown
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@ashahba ashahba merged commit 5d4eda9 into opea-project:main Mar 21, 2025
4 checks passed
haim-barad pushed a commit to SAPD-Intel/docs that referenced this pull request Apr 1, 2025
* [RFC] OPEA Inference Microservices (OIM)

Signed-off-by: Sakari Poussa <[email protected]>

* review fixes

Signed-off-by: Sakari Poussa <[email protected]>

* review fixes

Signed-off-by: Sakari Poussa <[email protected]>

* review fixes

Signed-off-by: Sakari Poussa <[email protected]>

* RFC: add OIM operator diagram

Signed-off-by: Sakari Poussa <[email protected]>

* review fixes, add picture

Signed-off-by: Sakari Poussa <[email protected]>

---------

Signed-off-by: Sakari Poussa <[email protected]>
Signed-off-by: Haim Barad <[email protected]>
mkbhanda added a commit that referenced this pull request Apr 4, 2025
* Add rfc for Routing Agent

Signed-off-by: Haim Barad <[email protected]>

* Add blogs page to OPEA (#318)

* Create index.rst

* Update index.rst

Signed-off-by: Haim Barad <[email protected]>

* Update index.rst (#329)

* Update index.rst

Added "Moving from OpenAI to Opensource using OPEA" blog post

* Updated index.rst with commit message

* Updated index.rst with the right dates. Signed off by: chrisahsiong23 <[email protected]>

* Update index.rst to pass DCO

Signed-off-by: chrisahsiong23 <[email protected]>

---------

Signed-off-by: chrisahsiong23 <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* Add GitHub Action to check and close stale issues and PRs (#332)

Signed-off-by: Sun, Xuehao <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* [Bug: 181] Getting started quide mentions Intel Tiber Cloud (#321)

Fixes #181
Co-authored-by: Ghosh, Soumyadip <[email protected]>
Signed-off-by: Piroozan, Nariman <[email protected]>
Signed-off-by:  Jaini, Pallavi <[email protected]>
Signed-off-by: Kavulya, Soila <[email protected]>
Signed-off-by: Rajabose, Shifani <[email protected]>
Signed-off-by: Shifani Rajabose <[email protected]>

Co-authored-by: Malini Bhandaru <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* Add doc for contributing vectorDB.

Signed-off-by: Katherine Druckman <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* Apply suggestions from code review

Co-authored-by: Malini Bhandaru <[email protected]>
Signed-off-by: Katherine Druckman <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* Apply suggestions from code review

Co-authored-by: Malini Bhandaru <[email protected]>
Signed-off-by: Katherine Druckman <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* Fix the URL for add_vectorDB.md (#334)

* Fix the URL for add_vectorDB.md

Signed-off-by: Abolfazl Shahbazi <[email protected]>

* minor formattingupdate to CONTRIBUTING.md

Signed-off-by: Abolfazl Shahbazi <[email protected]>

---------

Signed-off-by: Abolfazl Shahbazi <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* Add AMD OPEA blog (#339)

Signed-off-by: Yu Wang <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* [RFC] OPEA Inference Microservices (OIM) (#327)

* [RFC] OPEA Inference Microservices (OIM)

Signed-off-by: Sakari Poussa <[email protected]>

* review fixes

Signed-off-by: Sakari Poussa <[email protected]>

* review fixes

Signed-off-by: Sakari Poussa <[email protected]>

* review fixes

Signed-off-by: Sakari Poussa <[email protected]>

* RFC: add OIM operator diagram

Signed-off-by: Sakari Poussa <[email protected]>

* review fixes, add picture

Signed-off-by: Sakari Poussa <[email protected]>

---------

Signed-off-by: Sakari Poussa <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* Add 'Document Summarization' to the list of OPEA blogs (#337)

Signed-off-by: Abolfazl Shahbazi <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* Add workflow executor agent strategy rfc (#77)

* Add workflow executor agent strategy rfc

Signed-off-by: JoshuaL3000 <[email protected]>

* Remove space in name. Add title

Signed-off-by: JoshuaL3000 <[email protected]>

---------

Signed-off-by: JoshuaL3000 <[email protected]>
Co-authored-by: Ying Hu <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* add chapter Paublications

Signed-off-by: Haim Barad <[email protected]>

* add date item in blogs

Signed-off-by: Haim Barad <[email protected]>

* update the content by table

Signed-off-by: Haim Barad <[email protected]>

* Hybridrag Framework (#341)

* Hybrid RAG PR request

Signed-off-by: Raghava, Sharath <[email protected]>

* Signing off PR

Signed-off-by: Raghava, Sharath <[email protected]>

* filing RFC HybridRAG

Signed-off-by: Raghava, Sharath <[email protected]>

---------

Signed-off-by: Raghava, Sharath <[email protected]>
Co-authored-by: Ying Hu <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* [Bug: 234] Replaced Low res image with High res image on top-left of the docs website  (#325)

* [Bug: 234] Replaced Low res image with High res image on top-left of the docs website

Fixes #234
Co-authored-by: Piroozan, Nariman <[email protected]>
Signed-off-by:  Ghosh, Soumyadip <[email protected]>
Signed-off-by:  Jaini, Pallavi <[email protected]>
Signed-off-by: Kavulya, Soila <[email protected]>
Signed-off-by: Rajabose, Shifani <[email protected]>
Signed-off-by: Shifani Rajabose <[email protected]>

* Add blogs page to OPEA (#318)

* Create index.rst

* Update index.rst

Signed-off-by: Shifani Rajabose <[email protected]>

---------

Signed-off-by: Shifani Rajabose <[email protected]>
Co-authored-by: Pranav Gupta <[email protected]>
Co-authored-by: Malini Bhandaru <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* Update conf.py

Changed datatype to fix the error/warning

Signed-off-by: Haim Barad <[email protected]>

* Add build_chatbot_blog

Signed-off-by: Haim Barad <[email protected]>

* Fefine the format

Signed-off-by: Haim Barad <[email protected]>

* A brief introduction of OPEA in first part

Signed-off-by: Haim Barad <[email protected]>

* reformat AudioQnA sample guide (#323)

Signed-off-by: intel-whye <[email protected]>
Co-authored-by: intel-whye <[email protected]>
Co-authored-by: Rachel R <[email protected]>
Co-authored-by: Ying Hu <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* Revert "Add build_chatbot_blog" (#342)

* Revert "A brief introduction of OPEA in first part"

This reverts commit 9342278.

* Revert "Fefine the format"

This reverts commit fd1e2f2.

* Revert "Add build_chatbot_blog"

This reverts commit 0126778.

Signed-off-by: Haim Barad <[email protected]>

* doc:Add emeritus code owners page (#312)

* doc:Add emeritus code owners page

Signed-off-by: Wang,Le3 <[email protected]>

* remove lines

Signed-off-by: Wang,Le3 <[email protected]>

---------

Signed-off-by: Wang,Le3 <[email protected]>
Signed-off-by: Haim Barad <[email protected]>

* Added link about routellm.

Signed-off-by: Haim Barad <[email protected]>

* Added note for text-based inputs

Signed-off-by: Haim Barad <[email protected]>

* DCO checkin this commit...

Signed-off-by: Haim Barad <[email protected]>

---------

Signed-off-by: Haim Barad <[email protected]>
Signed-off-by: chrisahsiong23 <[email protected]>
Signed-off-by: Sun, Xuehao <[email protected]>
Signed-off-by: Katherine Druckman <[email protected]>
Signed-off-by: Abolfazl Shahbazi <[email protected]>
Signed-off-by: Yu Wang <[email protected]>
Signed-off-by: Sakari Poussa <[email protected]>
Signed-off-by: JoshuaL3000 <[email protected]>
Signed-off-by: Raghava, Sharath <[email protected]>
Signed-off-by: Shifani Rajabose <[email protected]>
Signed-off-by: intel-whye <[email protected]>
Signed-off-by: Wang,Le3 <[email protected]>
Co-authored-by: Pranav Gupta <[email protected]>
Co-authored-by: chrisahsiong23 <[email protected]>
Co-authored-by: Sun, Xuehao <[email protected]>
Co-authored-by: Shifani Rajabose <[email protected]>
Co-authored-by: Malini Bhandaru <[email protected]>
Co-authored-by: Katherine Druckman <[email protected]>
Co-authored-by: kdruckman <[email protected]>
Co-authored-by: Abolfazl Shahbazi <[email protected]>
Co-authored-by: Yu-amd <[email protected]>
Co-authored-by: Sakari Poussa <[email protected]>
Co-authored-by: JoshuaL3000 <[email protected]>
Co-authored-by: Ying Hu <[email protected]>
Co-authored-by: ZhangJianyu <[email protected]>
Co-authored-by: intelsharath <[email protected]>
Co-authored-by: Wang, Xigui <[email protected]>
Co-authored-by: Hao Ruan <[email protected]>
Co-authored-by: intel-whye <[email protected]>
Co-authored-by: Rachel R <[email protected]>
Co-authored-by: xiguiw <[email protected]>
Co-authored-by: wangleflex <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

10 participants