File ingesting is stuck #1699

pavelber · 2024-03-10T16:49:11Z

pavelber
Mar 10, 2024

Running private gpt with recommended setup ("ui llms-ollama embeddings-ollama vector-stores-qdrant") on WSL (Ubuntu, Windows 11, 32 gb RAM, i7, Nvidia GeForce RTX 4060 ). LLM Chat (no context from files) works well. When trying to upload a small (1Kb) text file it stucks either on 0% while generating embeddings.
No errors in ollama service log.
What is the issue?

Answered by d1g33k

Mar 10, 2024

Had the same issue with ollama, it just wasn't generating the embeddings on WSL. Worked fine when I switched to llama.cpp though.

View full answer

d1g33k · 2024-03-10T18:55:46Z

d1g33k
Mar 10, 2024

Had the same issue with ollama, it just wasn't generating the embeddings on WSL. Worked fine when I switched to llama.cpp though.

0 replies

iotnxt · 2024-03-12T02:42:56Z

iotnxt
Mar 12, 2024

@d1g33k
-I have the same issue with ollama (too slow to ingest)
-can you please provide the steps you used when you switched to llama.cpp?

if I try this for llama.cpp:
poetry install --extras "ui llms-llama-cpp embeddings-huggingface vector-stores-qdrant"
then I have authentication issues with huggingface when running this: poetry run python scripts/setup
are you using huggingface API? Where do you configure it?

I am running on ubuntu using a docker container

0 replies

dbzoo · 2024-03-18T14:58:23Z

dbzoo
Mar 18, 2024

If you are using ollama see #1691

0 replies

HyuPete · 2024-07-22T09:16:43Z

HyuPete
Jul 22, 2024

In my case, I don't use ollama. The file ingesting process is stuck with a tabular-data file (like .csv or .xlsx files). This is what I use:

embedding:
  mode: gemini
  # Should be matching the value above in most cases
  ingest_mode:
    # simple
    pipeline  # the fastest mode according to docs
  count_workers:
    # 4
    # 8
    16  # depends on your machine
  embed_dim: 768

gemini:
  api_key: ${GOOGLE_API_KEY:}
  model:
    models/gemini-1.5-pro
    # models/gemini-1.5-flash
  embedding_model: models/text-embedding-004

To solve the slow ingestion for this file, I split each row of the tabular data into a single-row table file. Then I ingest all those files at once. And the performance of file ingestion for this whole tabular data really speeds up.

Another thing I think we should investigate is the transformations parameter here.
I have tried to not use these transformations or change from SentenceWindowNodeParser to SimpleFileNodeParser, and do see a faster file ingesting speed.

Just hope these notes could help someone out there using this great project.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

File ingesting is stuck #1699

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

File ingesting is stuck #1699

Uh oh!

pavelber Mar 10, 2024

Replies: 4 comments

Uh oh!

d1g33k Mar 10, 2024

Uh oh!

iotnxt Mar 12, 2024

Uh oh!

dbzoo Mar 18, 2024

Uh oh!

HyuPete Jul 22, 2024

pavelber
Mar 10, 2024

d1g33k
Mar 10, 2024

iotnxt
Mar 12, 2024

dbzoo
Mar 18, 2024

HyuPete
Jul 22, 2024