[Misc] Use model_redirect to redirect the model name to a local folder. #14116

noooop · 2025-03-03T06:08:55Z

TLDR

Use model_redirect to redirect the model name to a local folder.
Use model name in the code without hard-coding the model path.

Usage

One redirect rule per line, model_name and redirect_name separated by \t.

For example, given a redirection facebook/opt-125m -> /data/LLM-model/opt-125m

echo -e "facebook/opt-125m\t/data/LLM-model/opt-125m\n" > .model.redirect

VLLM_MODEL_REDIRECT_PATH=".model.redirect" vllm serve facebook/opt-125m

should be equivalent to

vllm serve /data/LLM-model/opt-125m --served-model-name facebook/opt-125m

Use Case

Use model name instead of model path to serve a local model, (e.g. your own trained model).
Allow models from different sources, such as ModelScope and huggingface, using non-standard paths, local models to coexist harmoniously
Offline mode. If you use model name, it need to request huggingface several times to confirm the model path and whether the model is up-to-date. Redirecting model names to a local folder can be done completely without using the network.
Pin model version. Using the model name will automatically request huggingface and download the latest model version. Sometimes you need a specific model version. Redirecting model names to a local folder allows you to avoid hardcoding revisions.

github-actions · 2025-03-03T06:09:05Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

DarkLight1337 · 2025-03-25T05:40:16Z

I think this is not necessary anymore since we have migrated the CI/CD to using our own file storage which has pre-downloaded models.

noooop · 2025-03-25T05:48:20Z

@DarkLight1337

This feature is very convenient for everyone who needs to load models locally.

No need to use the model path every time

Not just for vllm CI/CD

mergify · 2025-03-25T05:51:40Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @noooop.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

DarkLight1337 · 2025-03-25T05:57:43Z

Don't we already have HF_HOME to change the directory containing locally downloaded HF repos?

noooop · 2025-03-25T06:08:00Z

Some of my models use modelscope, some use hf, and some, e.g. deepseek v3, need to be downloaded to another partition.

I found that using model_overwrite is very convenient

DarkLight1337 · 2025-03-25T06:10:40Z

Personally I don't use ModelScope, so I'll have @Isotr0py review this instead.

noooop · 2025-03-25T09:36:27Z

@Isotr0py
ready to review

examples/offline_inference/basic/use_model_overwrite.py will be deleted.

I think the config is too complicated, and it will also request hf multiple times.

Isotr0py

Hmmm, I'm fine to have a function to redirect the model_repo to a downloaded local directory manually, but I don't really like the name of "overwrite" and the introduction of .model.overwrite (seems it's not a formal file used by neither HF nor modelscope)...

examples/offline_inference/basic/use_model_overwrite.py

Isotr0py · 2025-03-25T09:25:31Z

vllm/transformers_utils/config.py

Suggested change

def model_overwrite(model: str):

def maybe_model_redirect(model: str):

I prefer to use maybe_model_redirect here.

maybe_model_redirect fine

Isotr0py · 2025-03-25T09:27:14Z

vllm/transformers_utils/tokenizer.py

We should move the helper function to vllm.transformers_utils.utils if it's used by both config and tokenizer.

Isotr0py · 2025-03-25T09:37:02Z

vllm/model_executor/model_loader/loader.py

I think we just need to call the redirect function when initialize ModelConfig, so that we don't need to add it here and there.

I don't want to override any model name in ModelConfig.

Yes, reading config is too complicated, and I don't want to write it everywhere, too.

Isotr0py · 2025-03-25T09:39:22Z

examples/offline_inference/basic/use_model_overwrite.py

Is this a formal file in modelscope? I can't find it in modelscope's API documentations.

Isotr0py · 2025-03-25T09:47:22Z

I think the config is too complicated, and it will also request hf multiple times.

I think we just need to redirect to local location when initializing ModelConfig, so that we don't need to request hf multiple times.

noooop · 2025-03-25T09:52:48Z

I think we just need to redirect to local location when initializing ModelConfig, so that we don't need to request hf multiple times.

Redirection to local location when initializing ModelConfig. Log output and server names will be very weird.
It may even trigger strange bugs
Redirection in each submodule. Redirection logic is written in many places.
Add a model_path parameter to each module？It's a huge project.

It's all very hacky anyway.

@Isotr0py

Maybe I should stop this stupid idea.

Isotr0py · 2025-03-25T10:08:49Z

Otherwise, log output and server name will be very weird.

IMO, if we redirect the model_repo to local directory manually, we should also make sure the model's name updated accordingly, otherwise it's a little bit hacky and make bugs caused by outdated custom code difficult to find. (For HF, model with custom code won't be updated automatically when loading an outdated loacl checkpoint)

About the server name, we can use --served-model-name to use model repo name.

noooop · 2025-03-25T10:37:24Z

I personally prefer 2

2. Redirection in each submodule. Redirection logic is written in many places.

@Isotr0py

Looking forward to hearing your opinions

Isotr0py

LGTM now!

noooop · 2025-03-26T09:36:13Z

Thanks for reviewing the code

noooop · 2025-03-27T06:29:30Z

@Isotr0py

Please restart the test

noooop · 2025-03-27T07:34:20Z

I'm not sure if this PR is the cause of the problem. It seems that these tests also reported errors yesterday.

QVQ

Isotr0py · 2025-03-27T07:47:35Z

The entrypoint test failure should be unrelated, I can confirm it's passed locally. The V1 test is flaky currently. 😅

noooop · 2025-03-27T08:18:47Z

The entrypoint test failure should be unrelated, I can confirm it's passed locally. The V1 test is flaky currently. 😅

QVQ

…r. (vllm-project#14116) Signed-off-by: xinyuxiao <[email protected]>

…r. (vllm-project#14116) Signed-off-by: Louis Ulmer <[email protected]>

…r. (vllm-project#14116)

…r. (vllm-project#14116) Signed-off-by: Mu Huai <[email protected]>

noooop marked this pull request as draft March 3, 2025 06:09

noooop mentioned this pull request Mar 3, 2025

[Misc] typo find in deepseek_v2 #14106

Merged

noooop force-pushed the model_overwrite branch 3 times, most recently from 50acdc7 to 5fecb39 Compare March 3, 2025 06:49

noooop marked this pull request as ready for review March 3, 2025 07:08

noooop changed the title ~~[WIP] Use model_overwrite to redirect the model name to a local folder.~~ [Misc] Use model_overwrite to redirect the model name to a local folder. Mar 3, 2025

noooop marked this pull request as draft March 3, 2025 07:39

noooop marked this pull request as ready for review March 3, 2025 08:33

noooop mentioned this pull request Mar 25, 2025

[Usage]: Serve From Hard disk and folder Path issue #15439

Closed

1 task

mergify bot added the needs-rebase label Mar 25, 2025

DarkLight1337 requested a review from Isotr0py March 25, 2025 06:10

noooop closed this Mar 25, 2025

noooop force-pushed the model_overwrite branch from 5fecb39 to 25f560a Compare March 25, 2025 06:25

noooop reopened this Mar 25, 2025

mergify bot added documentation Improvements or additions to documentation and removed needs-rebase labels Mar 25, 2025

Isotr0py reviewed Mar 25, 2025

View reviewed changes

Isotr0py approved these changes Mar 26, 2025

View reviewed changes

Isotr0py enabled auto-merge (squash) March 26, 2025 09:14

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 26, 2025

auto-merge was automatically disabled March 27, 2025 03:12
Head branch was pushed to by a user without write access

noooop closed this Mar 27, 2025

auto-merge was automatically disabled March 27, 2025 03:12
Pull request was closed

noooop force-pushed the model_overwrite branch from e4f5311 to 7f301dd Compare March 27, 2025 03:12

noooop reopened this Mar 27, 2025

model_redirect

4d493a7

noooop force-pushed the model_overwrite branch from 3e0954e to 4d493a7 Compare March 27, 2025 04:09

Isotr0py enabled auto-merge (squash) March 27, 2025 04:18

vllm-bot merged commit 3f532cb into vllm-project:main Mar 27, 2025
31 of 33 checks passed

noooop mentioned this pull request Apr 1, 2025

[Installation]: Model checkpoint shards reloading every time on Kubernetes with vLLM image (even if already downloaded) #15862

Closed

1 task

Alex4210987 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Apr 5, 2025

[Misc] Use model_redirect to redirect the model name to a local folde…

57eb51e

…r. (vllm-project#14116) Signed-off-by: xinyuxiao <[email protected]>

Isotr0py mentioned this pull request Apr 6, 2025

[Misc] Improve model redirect to accept json dictionary #16119

Merged

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

[Misc] Use model_redirect to redirect the model name to a local folde…

b24abe0

…r. (vllm-project#14116) Signed-off-by: Louis Ulmer <[email protected]>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[Misc] Use model_redirect to redirect the model name to a local folde…

28b9fdd

…r. (vllm-project#14116)

shreyankg pushed a commit to shreyankg/vllm that referenced this pull request May 3, 2025

[Misc] Use model_redirect to redirect the model name to a local folde…

5f64503

…r. (vllm-project#14116)

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[Misc] Use model_redirect to redirect the model name to a local folde…

4f2c28c

…r. (vllm-project#14116) Signed-off-by: Mu Huai <[email protected]>

Isotr0py mentioned this pull request Jun 18, 2025

[Minor] Allow redirecting model path for HfRunner in test #19795

Merged

4 tasks

noooop deleted the model_overwrite branch July 10, 2025 04:47

	def model_overwrite(model: str):
	def maybe_model_redirect(model: str):

Uh oh!

[Misc] Use model_redirect to redirect the model name to a local folder. #14116

[Misc] Use model_redirect to redirect the model name to a local folder. #14116

Uh oh!

Conversation

noooop commented Mar 3, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TLDR

Usage

Use Case

Uh oh!

github-actions bot commented Mar 3, 2025

Uh oh!

DarkLight1337 commented Mar 25, 2025

Uh oh!

noooop commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mergify bot commented Mar 25, 2025

Uh oh!

DarkLight1337 commented Mar 25, 2025

Uh oh!

noooop commented Mar 25, 2025

Uh oh!

DarkLight1337 commented Mar 25, 2025

Uh oh!

noooop commented Mar 25, 2025

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Isotr0py Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

noooop Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

Isotr0py Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

Isotr0py Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

noooop Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Isotr0py Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

noooop Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

Isotr0py commented Mar 25, 2025

Uh oh!

noooop commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Isotr0py commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

noooop commented Mar 25, 2025

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

noooop commented Mar 26, 2025

Uh oh!

noooop commented Mar 27, 2025

Uh oh!

noooop commented Mar 27, 2025

Uh oh!

Isotr0py commented Mar 27, 2025

Uh oh!

noooop commented Mar 27, 2025

Uh oh!

Uh oh!

Reviewers

noooop commented Mar 3, 2025 •

edited by github-actions bot

Loading

noooop commented Mar 25, 2025 •

edited

Loading

noooop Mar 25, 2025 •

edited

Loading

noooop commented Mar 25, 2025 •

edited

Loading

Isotr0py commented Mar 25, 2025 •

edited

Loading