Skip to content

Conversation

@AllenHaoHuang
Copy link
Collaborator

@AllenHaoHuang AllenHaoHuang commented Mar 22, 2025

vLLM model code is based off https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/llama.py

Used the following checkpoint /capstor/store/cscs/swissai/a06/main_run_megatron/Megatron-LM/logs/Meg-Runs/main-runs-v1/apertus3-1b-21-nodes/checkpoints/ to debug with and uploaded it to HF here https://huggingface.co/Saesara/swissai

Code passes unit tests matching the outputs of HF https://github.com/swiss-ai/transformers/tree/swissai-model and vLLM for the SwissAI Model

For a clearer view of the files modified, refer to vllm-project/vllm@main...AllenHaoHuang:vllm:swissai-model

Example setup on an interactive node

python -m pip install --upgrade pip

git clone https://github.com/swiss-ai/transformers.git
cd transformers
git checkout swissai-model
pip install -e .
cd ..

git clone https://github.com/AllenHaoHuang/vllm.git
cd vllm
git checkout swissai-model
# The following is necessary for setting up vllm on GH200 GPUs
# See "Use an existing PyTorch installation" https://docs.vllm.ai/en/latest/getting_started/installation/gpu.html
python use_existing_torch.py 
pip install -r requirements/build.txt
export MAX_JOBS=40 # OOM if MAX_JOBS>48
pip install -e . --no-build-isolation # Takes quite some time

Docker Image with transformers and vllm setup for swissai-model but not editable is available at

/capstor/store/cscs/swissai/a06/containers/vllm/swissai_vllm.sqsh

Tests for SwissAI model

# Registry test - Passes
cd vllm/tests/models/
pytest test_registry.py

# HF and vLLM output matching test - Passes
cd vllm/tests/models/decoder_only/language/
pytest -v -s test_models.py # Would recommend commenting out models other than swissai

russellb and others added 30 commits March 11, 2025 20:21
Signed-off-by: <[email protected]>
Signed-off-by: Richard Liu <[email protected]>
Signed-off-by: Randy Chen <[email protected]>
Signed-off-by: Cody Yu <[email protected]>
Co-authored-by: Cody Yu <[email protected]>
Signed-off-by: Jennifer Zhao <[email protected]>
Co-authored-by: Jennifer Zhao <[email protected]>
Co-authored-by: Roger Wang <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: Roger Wang <[email protected]>
Signed-off-by: DarkLight1337 <[email protected]>
Co-authored-by: Roger Wang <[email protected]>
Co-authored-by: DarkLight1337 <[email protected]>
…ash Attention to enable embedding models. (vllm-project#14664)

Signed-off-by: tjtanaa <[email protected]>
… and add KVTransferConfig.kv_connector_extra_config (vllm-project#14367)

Signed-off-by: Mathis Felardos <[email protected]>
cyang49 and others added 28 commits March 20, 2025 19:21
… capture sizes (vllm-project#15308)

Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Naitong Yu <[email protected]>
Signed-off-by: jiangxin <[email protected]>
Co-authored-by: Jason Fang <[email protected]>
Co-authored-by: jiangxin <[email protected]>
AllenHaoHuang added a commit that referenced this pull request Aug 9, 2025
Bugfixes in swiss-ai/main
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.