Allow converting HF Falcon models with only one shard in memory at a time by KerfuffleV2 · Pull Request #1 · jploski/ggml

KerfuffleV2 · 2023-06-14T09:12:35Z

Some changes to the conversion script to allow loading just one part of the multi-part model into memory at a time. From what I have heard, Transformers shards don't split tensors so this should be safe.

Without this change, I wasn't able to convert the real 40B Falcon model in 64GB RAM. I was able to verify that the mini-Shakespeare model gets converted correctly and I get reasonable output from the quantized version.

cmp-nct · 2023-06-14T12:35:51Z

Just as a sidenote, I could convert the 40B model on 64GB RAM, you just need plenty of fast swap and a bit patience.
It heavily randomly swaps for a while which stabilizes (on windows at least) and then it finishes.

In any case, great work to split it up :)

KerfuffleV2 · 2023-06-14T12:41:53Z

you just need plenty of fast swap and a bit patience.

It's a shame I possess neither of those things!

But yeah, assuming you're willing to throw enough time and swap at the problem there's no memory issue that's insurmountable. :)

KerfuffleV2 mentioned this pull request Jun 14, 2023

add falcon7b example ggml-org/ggml#231

Closed

Allow converting Falcon models one part at a time

ac64e94

KerfuffleV2 force-pushed the feat-improve-falcon-convert-hf branch from bc4dadb to ac64e94 Compare June 14, 2023 21:22

KerfuffleV2 changed the title ~~Allow converting Falcon models one part at a time~~ Allow converting HF Falcon models with only one shard in memory at a time Jun 14, 2023

jploski merged commit cc8ac10 into jploski:falcon40b Jun 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow converting HF Falcon models with only one shard in memory at a time#1

Allow converting HF Falcon models with only one shard in memory at a time#1
jploski merged 1 commit intojploski:falcon40bfrom
KerfuffleV2:feat-improve-falcon-convert-hf

KerfuffleV2 commented Jun 14, 2023

Uh oh!

cmp-nct commented Jun 14, 2023

Uh oh!

KerfuffleV2 commented Jun 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

KerfuffleV2 commented Jun 14, 2023

Uh oh!

cmp-nct commented Jun 14, 2023

Uh oh!

KerfuffleV2 commented Jun 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants