Add Hugging Face Inference Endpoints as Model Provider #5642

ramonpzg · 2025-07-01T05:34:43Z

Add Hugging Face Inference Endpoints as Model Provider

This PR begins scaffolding proper integration with HF Inference Enpoints following issue #5281.

What's Added

Hugging Face provider in model providers list with proper branding
Manual endpoint configuration via Settings → Model Providers → Hugging Face
API key and endpoint URL settings with clear instructions
OpenAI-compatible API integration for seamless model communication
Comprehensive documentation with step-by-step setup guide
Empty models list requiring manual model configuration

Current Limitation

As noted by the HF team, the HF model router is not fully functioning yet due to an internal fix needed to proxy to the correct provider. This implementation provides a manual but functional approach that sets the foundation for a more streamlined experience once HF's auto-routing is ready.

Docs

Complete setup guide available at /docs/remote-models/huggingface showing users how to:

Deploy models via HF Inference Endpoints
Configure Jan with endpoint URLs and API keys
Add custom models manually
Troubleshoot common issues

Fixes: #5281

Important

Adds Hugging Face as a model provider with manual endpoint configuration and OpenAI-compatible API integration, including documentation and UI updates.

Behavior:
- Adds Hugging Face as a model provider with manual endpoint configuration in predefinedProviders in data.ts.
- Supports OpenAI-compatible API integration for model communication.
- Requires manual model configuration due to empty models list in huggingface.json.
Documentation:
- New setup guide in huggingface.mdx for integrating Hugging Face Inference Endpoints.
- Updates _meta.json to include Hugging Face documentation link.
Engine Management:
- Adds huggingface.json to engines.mjs for engine and model management.
- Defines Hugging Face engine metadata in resources/huggingface.json.
UI Updates:
- Adds Hugging Face logo and title handling in utils.ts.

^{This description was created by}^{for 216399d. You can customize this summary. It will automatically update as commits are pushed.}

ellipsis-dev

Caution

Changes requested ❌

Reviewed everything up to 216399d in 1 minute and 52 seconds. Click for details.

Reviewed 301 lines of code in 8 files
Skipped 12 files when reviewing.
Skipped posting 8 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. docs/src/pages/docs/remote-models/_meta.json:30

Draft comment:
New Hugging Face entry added; verify href matches the documentation URL.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

2. extensions/engine-management-extension/engines.mjs:11

Draft comment:
Hugging Face provider imported and integrated into engines and models arrays correctly.
Reason this comment was not posted:
Confidence changes required: 10% <= threshold 50% None

3. extensions/engine-management-extension/models/huggingface.json:1

Draft comment:
Empty model list is intentional; ensure users know to add models manually.
Reason this comment was not posted:
Confidence changes required: 10% <= threshold 50% None

4. extensions/engine-management-extension/resources/huggingface.json:11

Draft comment:
The transform_req template is quite verbose; consider refactoring or externalizing the list of keys for maintainability.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

5. web-app/src/lib/utils.ts:34

Draft comment:
Hugging Face provider logo and title added; ensure the SVG asset '/images/model-provider/hugging-face.svg' exists.
Reason this comment was not posted:
Confidence changes required: 10% <= threshold 50% None

6. web-app/src/mock/data.ts:287

Draft comment:
Hugging Face provider settings added with correct placeholders; verify the base URL always ends with '/v1' per API requirements.
Reason this comment was not posted:
Confidence changes required: 10% <= threshold 50% None

7. docs/src/pages/docs/remote-models/huggingface.mdx:45

Draft comment:
Typo: "cick on Create Endpoint" should be "click on Create Endpoint".
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 10% vs. threshold = 50% While this is technically correct - there is a typo - the rules state we should not make purely informative comments. Typos in documentation, while worth fixing, are minor issues that don't affect functionality. The meaning is still clear despite the typo. The typo could potentially confuse non-native English speakers following the documentation. Documentation quality is important for user experience. While documentation quality matters, this particular typo is extremely minor and the meaning is obvious from context. This type of minor editorial feedback creates noise in the PR review process. Delete this comment as it's a minor documentation typo that doesn't affect functionality or comprehension significantly.

8. docs/src/pages/docs/remote-models/huggingface.mdx:64

Draft comment:
Typo: "alongside you endpoint URL" should be "alongside your endpoint URL".
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 20% vs. threshold = 50% While this is a real typo, typo fixes in documentation are very minor issues. The meaning is still clear despite the typo. Documentation typos don't affect functionality. However, since this is a new file being added, catching typos before they make it to production could improve quality. Documentation quality and correctness matters for user experience. Typos can make documentation look unprofessional. While documentation quality matters, this particular typo is extremely minor and obvious. The PR author will likely catch it in their own review. This comment, while technically correct, is too minor to be worth keeping in the PR review. It adds noise without significant value.

Workflow ID: wflow_CTRBK99NHyf2QnoC

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

ellipsis-dev · 2025-07-01T05:36:44Z

docs/src/pages/docs/remote-models/huggingface.mdx

+<br/>
+
+This will take you to the deployment set up page. For this example, we will leave the default settings 
+as they are under the GPU tab and cick on **Create Endpoint**.


Typo alert: 'cick' should be 'click'.

Suggested change

as they are under the GPU tab and cick on **Create Endpoint**.

as they are under the GPU tab and click on **Create Endpoint**.

github-actions · 2025-07-01T05:37:38Z

Preview URL: https://e4c09b01.docs-9ba.pages.dev

LazyYuuki · 2025-07-19T15:56:55Z

The objective has changed from Inference Endpoint to Inference Provider as HF team has expressed that on cost basis it is not suitable for casual user which is the main group of user for Jan right now.

That being said, we can still cherry pick the example from the docs for and add it as instruction to create Custom Provider with Inference Endpoint instead.

Refer to here for the current implementation in discussion: #5808

ramonpzg added 3 commits July 1, 2025 15:20

Updated documentation for HF Provider

cb97be0

Adding HF as a Model Provider to Jan

a527747

chore: update yarn.lock and ignore flatpak build artifacts

216399d

ramonpzg requested review from louis-jan and urmauur July 1, 2025 05:34

ramonpzg self-assigned this Jul 1, 2025

ramonpzg added this to Jan Jul 1, 2025

ellipsis-dev bot reviewed Jul 1, 2025

View reviewed changes

github-actions bot deployed to docs (Preview) July 1, 2025 05:37 View deployment

ramonpzg mentioned this pull request Jul 1, 2025

goal: Add HuggingFace Inference Provider #5281

Closed

1 task

LazyYuuki added this to the v0.6.5 milestone Jul 4, 2025

LazyYuuki moved this to Todo in Jan Jul 4, 2025

LazyYuuki moved this from Todo to In Progress in Jan Jul 4, 2025

dan-menlo moved this from In Progress to Blocked in Jan Jul 4, 2025

LazyYuuki removed this from the v0.6.5 milestone Jul 5, 2025

ramonpzg marked this pull request as draft July 7, 2025 13:31

LazyYuuki closed this Jul 19, 2025

github-project-automation bot moved this from Blocked to Done in Jan Jul 19, 2025

louis-jan deleted the feature/hf-model-provider branch August 19, 2025 03:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Hugging Face Inference Endpoints as Model Provider #5642

Add Hugging Face Inference Endpoints as Model Provider #5642

Uh oh!

ramonpzg commented Jul 1, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

ellipsis-dev bot Jul 1, 2025

Uh oh!

github-actions bot commented Jul 1, 2025

Uh oh!

LazyYuuki commented Jul 19, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	as they are under the GPU tab and cick on Create Endpoint.
	as they are under the GPU tab and click on Create Endpoint.

Add Hugging Face Inference Endpoints as Model Provider #5642

Add Hugging Face Inference Endpoints as Model Provider #5642

Uh oh!

Conversation

ramonpzg commented Jul 1, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!