[Feature]: Add embeddings api for Llama 

currently I load openai api server using the command
python3 -m vllm.entrypoints.openai.api_server --model Llama3-8B-Instruct --dtype auto --host 0.0.0.0 --port 8051 --gpu-memory-utilization 0.8 --enforce-eager
I want to try embedding using llama3 but. after loading i can see that embedding API is not loaded
![image](https://github.com/user-attachments/assets/17018d2d-90b4-4b67-bc0d-7d335806d0a2)

I couldn't find any parm to enable embeddings.

Help me to enable embeddings API


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature]: Add embeddings api for Llama #6947

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Feature]: Add embeddings api for Llama #6947

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions