GitHub - jychen21/Habana-LLM-Viewer

Habana-LLM-Viewer

Habana-LLM-Viewer is a tool that provides Roofline model, LLM performance prediction and memory analysis for Intel Gaudi platform. Inspired by LLM-Viewer, Habana-LLM-Viewer can be used to estimate performance of models such as Llama2-13B, Qwen-7B, Mixtral-8x7B on Intel Gaudi platform.

Model Projection

Command

Simpily run with habana_viewer.py and the results will show up on localhost.
```
python habana_viewer.py
```

Simpily run with run_model_projection.py and the results will be saved to folder "data/model".

python run_model_projection.py \
--device IntelGaudi2 \
--device-type B \
--model Llama2-7B \
--data-type BF16 \
--batch-size BATCH_SIZE \
--context-input CONTEXT_INPUT \
--context-output CONTEXT_OUTPUT \
--kvcache-bucket 256 \
--vec-bmm

Example

Model Name	Projected Data
Llama2-7B	Link
Llama2-13B	Link
Llama3-8B	Link
Qwen-7B	Link
Qwen-14B	Link
Mixtral-8x7B	Link

Operation Projection

Command

Simpily run with run_op_projection.py and the results will be saved to folder "data/operation", same with model projection, one can modify proj_cfg in main.

python run_op_projection.py \
--device IntelGaudi2 \
--device-type B \
--op Matmul \
--data-type BF16 \
--m-list m1 m2 ... \
--n-list n1 n2 ... \
--k-list k1 k2 ...

Example

Op Name	Projected Data
Matmul	Link

Todo

Currently only cover single card perf projection, will support multi-card / multi-node.
Will cover more models / operations.

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
data		data
scripts		scripts
.gitignore		.gitignore
README.md		README.md
habana_viewer.py		habana_viewer.py
models.py		models.py
run_model_projection.py		run_model_projection.py
run_op_projection.py		run_op_projection.py
run_projection.ipynb		run_projection.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Habana-LLM-Viewer

Model Projection

Command

Example

Operation Projection

Command

Example

Todo

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Habana-LLM-Viewer

Model Projection

Command

Example

Operation Projection

Command

Example

Todo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages