Skip to content

Conversation

@CuriousPanCake
Copy link
Contributor

wip: added the setup part and half of the first matcher

@github-actions github-actions bot added category: Core OpenVINO Core (aka ngraph) category: Python API OpenVINO Python bindings category: transformations OpenVINO Runtime library - Transformations category: CPP API OpenVINO CPP API bindings labels Apr 22, 2024
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we typically name files with snake_case

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should we create paged_attention subfolder to contain files related to this new meta-transformation?

@ilya-lavrenov ilya-lavrenov added this to the 2024.2 milestone Apr 23, 2024
@CuriousPanCake CuriousPanCake force-pushed the paged_attention_transformation branch from 64d760d to 5efd778 Compare April 24, 2024 09:07
@CuriousPanCake CuriousPanCake force-pushed the paged_attention_transformation branch from 8dd42df to 9a665e7 Compare April 29, 2024 17:49
@CuriousPanCake CuriousPanCake force-pushed the paged_attention_transformation branch from 9a665e7 to d6730d9 Compare April 29, 2024 18:41
Copy link
Contributor

@ilya-lavrenov ilya-lavrenov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

opt125m and llama2 work

@itikhono
Copy link

itikhono commented May 1, 2024

opt125m and llama2 work

this list of models work

         "hf-internal-testing/tiny-random-BloomForCausalLM",
         "hf-internal-testing/tiny-random-FalconForCausalLM",
         "hf-internal-testing/tiny-random-Starcoder2ForCausalLM",
         "hf-internal-testing/tiny-random-GPTJForCausalLM",
         "hf-internal-testing/tiny-random-StableLmForCausalLM",
         "hf-internal-testing/tiny-random-LlamaForCausalLM",
         "hf-internal-testing/tiny-random-MistralForCausalLM",
         "hf-internal-testing/tiny-random-OPTForCausalLM",
         "hf-internal-testing/tiny-random-PhiForCausalLM",
         "hf-internal-testing/tiny-random-StableLmForCausalLM",
         "facebook/opt-125m"

I will re-open this PR and resolve some comments

@itikhono
Copy link

itikhono commented May 1, 2024

a new PR has been opened: #24336

@itikhono itikhono closed this May 1, 2024
ilya-lavrenov pushed a commit that referenced this pull request May 2, 2024
### Details:
Ported SDPA to PagedAttention transformation from python to C++ code.

the related PRs:
#24127
#24177

Tested model scope:
- [x]          "hf-internal-testing/tiny-random-BloomForCausalLM",
- [x]          "hf-internal-testing/tiny-random-FalconForCausalLM",
- [x]          "hf-internal-testing/tiny-random-Starcoder2ForCausalLM",
- [x]          "hf-internal-testing/tiny-random-GPTJForCausalLM",
- [x]          "hf-internal-testing/tiny-random-StableLmForCausalLM",
- [x]          "hf-internal-testing/tiny-random-LlamaForCausalLM",
- [x]          "hf-internal-testing/tiny-random-MistralForCausalLM",
- [x]          "hf-internal-testing/tiny-random-OPTForCausalLM",
- [x]          "hf-internal-testing/tiny-random-PhiForCausalLM",
- [x]          "hf-internal-testing/tiny-random-StableLmForCausalLM",
- [x]          "facebook/opt-125m",
- [x]          "llama2",
- [x]          "bigcode/starcoder2-7b"
- [ ] "mosaicml/mpt-7b-chat" (FAILED both py/c++) - acceptable for this
PR
Issue: RuntimeError: Check '(axis_range_min <= axis) && (axis <=
axis_range_max)' failed at src/core/src/validation_util.cpp:386:
Concat Parameter axis 2 out of the tensor rank range [0, 0].
- [x] _means, that the response to the dedicated prompt is the same for
the py and c++ transformations._




### Tickets:
 - *CVS-138664*

---------

Co-authored-by: Sergey Lyalin <[email protected]>
Co-authored-by: Andrii Staikov <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: Core OpenVINO Core (aka ngraph) category: CPP API OpenVINO CPP API bindings category: Python API OpenVINO Python bindings category: transformations OpenVINO Runtime library - Transformations

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants