OPEA - AmazonBedrock integration by Vihanth · Pull Request #1031 · opea-project/GenAIComps

Vihanth · 2024-12-13T00:07:53Z

Description

This PR adds Amazon Bedrock LLMs to OPEA text-generation.

Issues

n/a

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Tests

Tested it by building docker image and deploying it using docker compose.

comps/llms/text-generation/bedrock/Dockerfile

joshuayao · 2025-01-07T05:22:46Z

Hi @Vihanth, The comps/llms/text-generation module has been refactored for v1.2. Could you please update your code to align with these changes? Thanks.
p.s.: OPEA v1.2 plan to code freeze at WW3.

joshuayao · 2025-01-14T02:22:06Z

Hi @Vihanth , OPEA is approaching its code freeze. Could you please check the CI failures?

smguggen · 2025-01-14T23:29:57Z

@joshuayao Sorry about the delay we had a bit of a staffing shake up, we have someone on this again. How much time do we have before the code freeze?

srinarayan-srikanthan · 2025-01-15T16:37:02Z

hi @smguggen , the code freeze is on Friday. Also it seems to be a CI issue of missing credential. So please feel free if you need any help in resolving the issue/merging this PR.

smguggen · 2025-01-15T17:05:01Z

@srinarayan-srikanthan That's exactly what was causing the delay, our security reviewer was not a fan of storing long-lasting credentials to perform the test, so we set up OIDC. I think we can get that together today, so everything should be handled well before Friday. Thanks!

chensuyue · 2025-01-16T05:44:26Z

I can help to add the new secrets for CI. But my concern is this PR is not adapt to the new code structure. The comps/llms/text-generation will be fully removed, and instead there is https://github.com/opea-project/GenAIComps/tree/main/comps/llms/src/text-generation/integrations.

smguggen · 2025-01-16T23:23:16Z

@chensuyue Those changes are underway, can you share the name of the key for the secret? The secret itself was shared with @srinarayan-srikanthan

jonminkin97 · 2025-01-17T00:49:46Z

@chensuyue We will need to use the secret key to get AWS credentials. Once we have the secret key, we will need to update 1165 to use that secret key, and ensure that PR is merged prior to this one.

chensuyue · 2025-01-17T02:30:06Z

@chensuyue We will need to use the secret key to get AWS credentials. Once we have the secret key, we will need to update 1165 to use that secret key, and ensure that PR is merged prior to this one.

srikanthan send me the AWS_IAM_ROLE_ARN, and I have added it into the secrets. But how about those 3 parameter, seems the test also need them.
-e AWS_ACCESS_KEY_ID=${AWS_ACCESS_KEY_ID}
-e AWS_SECRET_ACCESS_KEY=${AWS_SECRET_ACCESS_KEY}
-e AWS_SESSION_TOKEN=${AWS_SESSION_TOKEN} \

jonminkin97 · 2025-01-17T17:19:00Z

@chensuyue We will need to use the secret key to get AWS credentials. Once we have the secret key, we will need to update 1165 to use that secret key, and ensure that PR is merged prior to this one.

srikanthan send me the AWS_IAM_ROLE_ARN, and I have added it into the secrets. But how about those 3 parameter, seems the test also need them. -e AWS_ACCESS_KEY_ID=${AWS_ACCESS_KEY_ID} -e AWS_SECRET_ACCESS_KEY=${AWS_SECRET_ACCESS_KEY} -e AWS_SESSION_TOKEN=${AWS_SESSION_TOKEN} \

The AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, and AWS_SESSION token are automatically set by the "configure-aws-credentials" action.

That action needs an IAM Role ARN as input, and sets those three environment variables for future steps. See the documentation for configure-aws-credentials.

jonminkin97 · 2025-01-17T17:34:40Z

@chensuyue Hopefully the above comment clarifies your question and the question on 1165. The 1165 pull request must be merged so that the Workflow checks in this PR can pass. My comment above should address the comment on that PR as well.

chensuyue · 2025-01-20T01:39:41Z

@chensuyue Hopefully the above comment clarifies your question and the question on 1165. The 1165 pull request must be merged so that the Workflow checks in this PR can pass. My comment above should address the comment on that PR as well.

This PR has been merged. But look at the CI, there is another issue need to resolve.

chensuyue · 2025-01-20T01:41:56Z

Regarding to the pre-commit.ci, the fix is not commit to your repo directly, so you need to run it locally to fix the issue.

pip install pre-commit
pre-commit install
pre-commit run --all-files

letonghan · 2025-01-20T06:33:25Z

Hi @jonminkin97 , please delete the comps/llms/text-generation folder, and retain the comps/llms/src folder only.

This is for the new file structure refactoring of this release. All of the llm-related components will be integrated into opea_llm_microservice.py, and other files outside src folder will be all deleted. Please refer to the new file structure here.
Thanks for your work!

chensuyue · 2025-01-20T13:38:31Z

Switch the target branch for test. After the test pass, we can switch back to main branch.

chensuyue · 2025-01-20T13:44:08Z

I have setting the permission in pre-ci branch, but test still failed, 690363d

* initial code for sql agent llama Signed-off-by: minmin-intel <minmin.hou@intel.com> * add test for sql agent Signed-off-by: minmin-intel <minmin.hou@intel.com> * update sql agent test Signed-off-by: minmin-intel <minmin.hou@intel.com> * fix bugs and use vllm to test sql agent Signed-off-by: minmin-intel <minmin.hou@intel.com> * add tag-bench test and google search tool Signed-off-by: minmin-intel <minmin.hou@intel.com> * test sql agent with hints Signed-off-by: minmin-intel <minmin.hou@intel.com> * fix bugs for sql agent with hints and update test Signed-off-by: minmin-intel <minmin.hou@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add readme for sql agent and fix ci bugs Signed-off-by: minmin-intel <minmin.hou@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add sql agent using openai models Signed-off-by: minmin-intel <minmin.hou@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix bugs in sql agent openai Signed-off-by: minmin-intel <minmin.hou@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * make wait time longer for sql agent microservice to be ready Signed-off-by: minmin-intel <minmin.hou@intel.com> * update readme Signed-off-by: minmin-intel <minmin.hou@intel.com> * fix test bug Signed-off-by: minmin-intel <minmin.hou@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * skip planexec with vllm due to vllm-gaudi bug Signed-off-by: minmin-intel <minmin.hou@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * debug ut issue Signed-off-by: minmin-intel <minmin.hou@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * use vllm for all uts Signed-off-by: minmin-intel <minmin.hou@intel.com> * debug ci issue Signed-off-by: minmin-intel <minmin.hou@intel.com> * change vllm port Signed-off-by: minmin-intel <minmin.hou@intel.com> * update ut Signed-off-by: minmin-intel <minmin.hou@intel.com> * remove tgi server Signed-off-by: minmin-intel <minmin.hou@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * align vllm port Signed-off-by: minmin-intel <minmin.hou@intel.com> --------- Signed-off-by: minmin-intel <minmin.hou@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> opea bedrock integration Signed-off-by: vihanth sura <vihanth@amazon.com>

* remove examples gateway. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * remove gateway. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * refine service code. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update http_service.py * remove gateway ut. * remove gateway ut. * fix conflict service name. * Update http_service.py * add handle message ut. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * remove `multiprocessing.Process` start server code. * fix ut. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * remove multiprocessing and enhance ut for coverage. --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>

* vllm support openai API Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * fix bug Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * fix bug Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * test_llms_text-generation_vllm_langchain_on_intel_hpu.sh Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * fix time Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * fix bug Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix bug Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> --------- Signed-off-by: Xinyao Wang <xinyao.wang@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

This reverts commit c36c503. Co-authored-by: lkk <33276950+lkk12014402@users.noreply.github.com>

* Disable telemetry by default Signed-off-by: lvliang-intel <liang1.lv@intel.com>

Refine retrievers Dockerfile and requirements.txt and move --extra-index-url into Dockerfile for CPU Docker image. Signed-off-by: letonghan <letong.han@intel.com>

Signed-off-by: Jonathan Minkin <minkinj@amazon.com> Add id-token permissions for workflow Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

* Fix test_telemetry.py import Signed-off-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* Enable AWS access when testing microservices Signed-off-by: Jonathan Minkin <minkinj@amazon.com> * Add id-token permissions to workflow to allow OIDC auth Signed-off-by: Jonathan Minkin <minkinj@amazon.com> --------- Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

)" (opea-project#1173) This reverts commit b0abbde to recover the CI test. Signed-off-by: chensuyue <suyue.chen@intel.com>

Issue: When property graph store gets filled (~12K nodes, 15K relationships) insertion time in dataprep gets slow. Extraction + insertion starts at ~30 sec and once it gets filled grows to (~12K nodes, 15K relationships) ~800 sec Perf bottleneck this cypher call in llama-index to do node upsert: https://github.com/run-llama/llama_index/blob/795bebc2bad31db51b854a5c062bedca42397630/llama-index-integrations/graph_stores/llama-index-graph-stores-neo4j/llama_index/graph_stores/neo4j/neo4j_property_graph.py#L334 Performance optimizations in this PR: 1. Move neo4j GraphStore initialization out of detaprep and retrieve function so it's only performed once at the begining 2. Disable schema_refresh of neo4j graph when not necessary because for large graph this is very slow. 3. Switch to OpenAILike class from llama-index to work with vllm or tgi endpoints without code changes (only docker compose.yaml changes) 4. Added concurrency and batching for generating community summaries and generating answers from summaries --------- Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

According to the RFC's Phase 2 plan, this PR adds image query support, PDF ingestion support, and dynamic ports to the microservices used by MultimodalQnA. This PR goes with this one in GenAIExamples. Signed-off-by: dmsuehir <dina.s.jones@intel.com> Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com>

Signed-off-by: chensuyue <suyue.chen@intel.com>

The original output was unclear, this optimization ensures that the user can find the source of the path check CI failure. Signed-off-by: ZePan110 <ze.pan@intel.com>

Enable redis for saving agent_config, messages: 1. agent recovery from redis (agent_config) 2. assemble history for multi-turn related opea-project#977

* Add helm-chart CI test workflow. --------- Signed-off-by: ZePan110 <ze.pan@intel.com>

Refactor dataprep microservice, including opensearch, elasticsearch, pinecone, pgvector, neo4j, qdrant, vdms. Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>

Fix the issue where CD test files cannot be obtained and add GOOGLE secrets. Signed-off-by: ZePan110 <ze.pan@intel.com>

Enhance the bug & feature template according to the issue opea-project/GenAIExamples#1002. Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com>

1. Fix CD workflow type error. 2. Change dockerhub default value and add the judgment that matrix is not empty to fix workflow errors. Signed-off-by: ZePan110 <ze.pan@intel.com>

* delete intent detection Remove comps that use the old code structure Signed-off-by: Jonathan Minkin <minkinj@amazon.com> Trigger new run of test Re-trigger new run of test

Vihanth requested a review from lvliang-intel as a code owner December 13, 2024 00:07

lvliang-intel reviewed Dec 13, 2024

View reviewed changes

comps/llms/text-generation/bedrock/Dockerfile Show resolved Hide resolved

joshuayao linked an issue Jan 7, 2025 that may be closed by this pull request

[Feature] AWS Bedrock endpiont #382

Closed

joshuayao added this to the v1.2 milestone Jan 7, 2025

smguggen requested review from chensuyue, ftian1 and letonghan as code owners January 14, 2025 15:24

jonminkin97 requested a review from ZePan110 as a code owner January 17, 2025 21:25

chensuyue changed the base branch from main to pre-ci January 20, 2025 13:37

jonminkin97 requested a review from XinyaoWa as a code owner January 22, 2025 18:39

minmin-intel and others added 4 commits January 23, 2025 01:59

Revert "Add SQL agent strategy (opea-project#975)" (opea-project#1030)

a04b2a2

This reverts commit c36c503. Co-authored-by: lkk <33276950+lkk12014402@users.noreply.github.com>

lvliang-intel and others added 18 commits January 23, 2025 02:04

Disable telemetry by default (opea-project#1168)

0523edc

* Disable telemetry by default Signed-off-by: lvliang-intel <liang1.lv@intel.com>

Refactor retrievers vdms into E-RAG style. (opea-project#1167)

1d36c9a

Refine retrievers Dockerfile and requirements.txt and move --extra-index-url into Dockerfile for CPU Docker image. Signed-off-by: letonghan <letong.han@intel.com>

Enable AWS access when testing microservices (opea-project#1165)

b188f8f

Signed-off-by: Jonathan Minkin <minkinj@amazon.com> Add id-token permissions for workflow Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Revert "Fix Workflow Access to id-token for OIDC Auth (opea-project#1170

5f17459

)" (opea-project#1173) This reverts commit b0abbde to recover the CI test. Signed-off-by: chensuyue <suyue.chen@intel.com>

Enhance docker container clean up in CI (opea-project#1174)

130506d

Signed-off-by: chensuyue <suyue.chen@intel.com>

Optimize output prompt words (opea-project#1136)

e8e6a86

The original output was unclear, this optimization ensures that the user can find the source of the path check CI failure. Signed-off-by: ZePan110 <ze.pan@intel.com>

add redis persistence for long term memory (opea-project#1144)

e30e942

Enable redis for saving agent_config, messages: 1. agent recovery from redis (agent_config) 2. assemble history for multi-turn related opea-project#977

Add helm-chart CI test workflow. (opea-project#1140)

4a97039

* Add helm-chart CI test workflow. --------- Signed-off-by: ZePan110 <ze.pan@intel.com>

Refactor dataprep microservice (opea-project#1153)

8f4d887

Refactor dataprep microservice, including opensearch, elasticsearch, pinecone, pgvector, neo4j, qdrant, vdms. Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com>

Fix the issue where CD test files cannot be obtained (opea-project#1177)

40bb156

Fix the issue where CD test files cannot be obtained and add GOOGLE secrets. Signed-off-by: ZePan110 <ze.pan@intel.com>

Enhance the issue template (opea-project#1157)

2779ef4

Enhance the bug & feature template according to the issue opea-project/GenAIExamples#1002. Co-authored-by: ZhangJianyu <zhang.jianyu@outlook.com>

Fix CD workflow type error (opea-project#1181)

bbbab81

1. Fix CD workflow type error. 2. Change dockerhub default value and add the judgment that matrix is not empty to fix workflow errors. Signed-off-by: ZePan110 <ze.pan@intel.com>

Delete intent detection (opea-project#1192)

4166563

* delete intent detection Remove comps that use the old code structure Signed-off-by: Jonathan Minkin <minkinj@amazon.com> Trigger new run of test Re-trigger new run of test

test signing

bec28d9

smguggen force-pushed the opea-aws-bedrock branch from 46d96dc to bec28d9 Compare January 23, 2025 02:05

smguggen requested review from Spycsh, XinyuYe-Intel, hteeyeoh, lkk12014402, minmin-intel, yao531441 and yogeshmpandey as code owners January 23, 2025 02:05

chensuyue force-pushed the pre-ci branch from 690363d to 48ccd5f Compare January 23, 2025 04:19

chensuyue mentioned this pull request Jan 23, 2025

Add Bedrock support #1214

Merged

4 tasks

chensuyue deleted the branch opea-project:pre-ci January 24, 2025 02:37

chensuyue closed this Jan 24, 2025

Conversation

Vihanth commented Dec 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issues

Type of change

Tests

Uh oh!

Uh oh!

joshuayao commented Jan 7, 2025

Uh oh!

joshuayao commented Jan 14, 2025

Uh oh!

smguggen commented Jan 14, 2025

Uh oh!

srinarayan-srikanthan commented Jan 15, 2025

Uh oh!

smguggen commented Jan 15, 2025

Uh oh!

chensuyue commented Jan 16, 2025

Uh oh!

smguggen commented Jan 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jonminkin97 commented Jan 17, 2025

Uh oh!

chensuyue commented Jan 17, 2025

Uh oh!

jonminkin97 commented Jan 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jonminkin97 commented Jan 17, 2025

Uh oh!

chensuyue commented Jan 20, 2025

Uh oh!

chensuyue commented Jan 20, 2025

Uh oh!

letonghan commented Jan 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chensuyue commented Jan 20, 2025

Uh oh!

chensuyue commented Jan 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Vihanth commented Dec 13, 2024 •

edited

Loading

smguggen commented Jan 16, 2025 •

edited

Loading

jonminkin97 commented Jan 17, 2025 •

edited

Loading

letonghan commented Jan 20, 2025 •

edited

Loading