You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/gguf.md
+15-1Lines changed: 15 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -20,7 +20,7 @@ The key difference between GGJT and GGUF is the use of a key-value structure for
20
20
21
21
### GGUF Naming Convention
22
22
23
-
GGUF follow a naming convention of `<Model>-<Version>-<ExpertsCount>x<Parameters>-<EncodingScheme>.gguf`
23
+
GGUF follow a naming convention of `<Model>-<Version>-<ExpertsCount>x<Parameters>-<EncodingScheme>-<Shard>.gguf`
24
24
25
25
The components are:
26
26
1.**Model**: A descriptive name for the model type or architecture.
@@ -34,6 +34,9 @@ The components are:
34
34
-`M`: Million parameters.
35
35
-`K`: Thousand parameters.
36
36
5.**EncodingScheme**: Indicates the weights encoding scheme that was applied to the model. Content, type mixture and arrangement however are determined by user code and can vary depending on project needs.
37
+
6.**Shard**: (Optional) Indicates and denotes that the model has been split into multiple shards, formatted as `<ShardNum>-of-<ShardTotal>`.
38
+
-*ShardNum* : Shard position in this model. Must be at least 5 digits padded by zeros.
39
+
-*ShardTotal* : Total number of shards in this model. Must be at least 5 digits padded by zeros.
0 commit comments