Skip to content

GraphRAG with llama-index#793

Merged
ashahba merged 50 commits intoopea-project:mainfrom
rbrugaro:graphRAG_LI
Oct 30, 2024
Merged

GraphRAG with llama-index#793
ashahba merged 50 commits intoopea-project:mainfrom
rbrugaro:graphRAG_LI

Conversation

@rbrugaro
Copy link
Copy Markdown
Collaborator

Description

GraphRAG components using llama-index:

comps/dataprep/neo4j/llama-index/extract_graph_neo4j.py
comps/retrievers/neo4l/llama-index/retriever_community_answers_neo4j.py

later i will add a GenAIExample that will combine these 2 with the DocSum to generate the final answers from the community summaries

rbrugaro and others added 14 commits October 15, 2024 13:53
Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com>
Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Copy link
Copy Markdown

@ichbinblau ichbinblau left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi, @rbrugaro Thanks for enabling Graphrag with llama-index. I found some inconsistent places in the commits during the test. PTAL.

@rbrugaro
Copy link
Copy Markdown
Collaborator Author

@ichbinblau Thanks for catching this!, PR still in draft since i need to do some clean up and will take care of the READMEs

rbrugaro and others added 5 commits October 23, 2024 06:23
…triever

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
@codecov
Copy link
Copy Markdown

codecov bot commented Oct 23, 2024

Codecov Report

Attention: Patch coverage is 9.52381% with 38 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
comps/cores/mega/gateway.py 7.31% 38 Missing ⚠️
Files with missing lines Coverage Δ
comps/cores/mega/constants.py 98.41% <100.00%> (+0.02%) ⬆️
comps/cores/mega/gateway.py 30.24% <7.31%> (-2.07%) ⬇️

... and 1 file with indirect coverage changes

@rbrugaro rbrugaro marked this pull request as ready for review October 23, 2024 21:19
rbrugaro and others added 3 commits October 24, 2024 03:40
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Copy link
Copy Markdown
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @rbrugaro for this PR!
This looks like a great PR and my comments and change requests are mostly BKMs for code consistency.

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
@rbrugaro rbrugaro requested a review from ashahba October 30, 2024 03:34
Copy link
Copy Markdown
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@ashahba ashahba requested review from lvliang-intel and removed request for lvliang-intel October 30, 2024 05:43
@ashahba ashahba merged commit 19330ea into opea-project:main Oct 30, 2024
madison-evans pushed a commit to SAPD-Intel/GenAIComps that referenced this pull request May 12, 2025
* graphRAG dataprep llama-index validated w openai endpoints

Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com>

* llama-index graphRAG retrieval validated with openai models

Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* validated code usng TGI/TEI instead of openai

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* compose.yaml for dataprep validated with neo4j, TGI/TEI, openai

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* graphRAG retriever validated and full compose.yaml

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* minor fix

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add graphRAGGateway WIP

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* graphragGateway working in E2E Example

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* fix schedule in orchestrator to support ChatCompletionRequest input

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* change default to TGI instead of openAI and add test code for neo4jretriever

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* test code for dataprep-neo4j microservice

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* improved READMES

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update docker build path for tests

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* minor fix

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* fix typo in container name

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* resolve image name conflict for hub publishing

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* add tgi validation to miicroservice tests

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* rename test script to trigger cicd correctly w hpu

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* rename test containers for cicd

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* update HF_TOKEN in TGI/TEI test scripts

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* swaped larger model so the graph isnt empty in ci test

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* set 4 hpu for 70B model in ci test

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* add extra time for large model loading cicd

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* fix tgi gaudi shard args

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* switch to chat cause chat template is needed

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* enable logs in test

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* use locally downloaded model in CI machine

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* use local model path and reduce wait time

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* clear ports before ci run

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* fix cache model access

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* fix cache model access

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* incrased wait time for tgi shards ready

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* wait until tgi connected

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* switch back to small model for testing

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* minor readability fixes

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

* README fixes

Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>

---------

Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com>
Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants