Add /upstream endpoint #31

mostlygeek · 2024-12-17T22:35:26Z

This adds an /upstream/:model_name endpoint which is a reverse proxy for the loaded inference server. Useful for accessing the UI of the upstream, or other endpoints, if it has one.

New functionality:

automatically load/swap the model depending on the ID in the URL
/upstream shows an index of available models

This PR also introduces the unlisted: true configuration for models to omit them from /v1/models and /upstream.

- add /upstream endpoint to show a list of available models - add `unlisted` configuration option to omit a model from /v1/models and /upstream lists

mostlygeek added 5 commits December 17, 2024 12:59

remove catch-all route to upstream proxy

adf291c

add handle so /upstream/:model_id gets routed to upstream HTTP server

a114e21

Add /upstream HTML endpoint and unlisted option

8d569ca

- add /upstream endpoint to show a list of available models - add `unlisted` configuration option to omit a model from /v1/models and /upstream lists

update README

eabc641

add favicon.ico

436d20a

mostlygeek merged commit 891f6a5 into main Dec 17, 2024

mostlygeek deleted the add-upstream branch December 17, 2024 22:37

mostlygeek changed the title ~~Add /upstream endpoint (#30)~~ Add /upstream endpoint Dec 17, 2024

mostlygeek mentioned this pull request Dec 17, 2024

Some observations and a question about error messages #30

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add /upstream endpoint #31

Add /upstream endpoint #31

Uh oh!

mostlygeek commented Dec 17, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add /upstream endpoint #31

Add /upstream endpoint #31

Uh oh!

Conversation

mostlygeek commented Dec 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mostlygeek commented Dec 17, 2024 •

edited

Loading