Commit 79bba4f
[NVIDIA] Add support for cudnn fp4 gemm via flashinfer (vllm-project#26107)
Signed-off-by: kaixih <[email protected]>
Signed-off-by: mgoin <[email protected]>
Co-authored-by: mgoin <[email protected]>
Signed-off-by: 0xrushi <[email protected]>1 parent dc6a8bc commit 79bba4f
3 files changed
Lines changed: 57 additions & 38 deletions
File tree
- vllm
- model_executor/layers/quantization
- compressed_tensors/schemes
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
191 | 191 | | |
192 | 192 | | |
193 | 193 | | |
| 194 | + | |
194 | 195 | | |
195 | 196 | | |
196 | 197 | | |
| |||
1292 | 1293 | | |
1293 | 1294 | | |
1294 | 1295 | | |
1295 | | - | |
1296 | | - | |
1297 | | - | |
1298 | | - | |
1299 | | - | |
| 1296 | + | |
| 1297 | + | |
| 1298 | + | |
| 1299 | + | |
| 1300 | + | |
| 1301 | + | |
| 1302 | + | |
| 1303 | + | |
| 1304 | + | |
1300 | 1305 | | |
1301 | 1306 | | |
1302 | 1307 | | |
| |||
1492 | 1497 | | |
1493 | 1498 | | |
1494 | 1499 | | |
1495 | | - | |
1496 | 1500 | | |
1497 | 1501 | | |
1498 | 1502 | | |
| |||
1524 | 1528 | | |
1525 | 1529 | | |
1526 | 1530 | | |
| 1531 | + | |
1527 | 1532 | | |
1528 | 1533 | | |
1529 | 1534 | | |
| |||
Lines changed: 25 additions & 15 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
18 | 21 | | |
19 | 22 | | |
20 | 23 | | |
| |||
29 | 32 | | |
30 | 33 | | |
31 | 34 | | |
32 | | - | |
33 | | - | |
34 | | - | |
35 | | - | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
36 | 41 | | |
37 | 42 | | |
38 | 43 | | |
| |||
42 | 47 | | |
43 | 48 | | |
44 | 49 | | |
45 | | - | |
46 | | - | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | | - | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
51 | 61 | | |
52 | 62 | | |
53 | 63 | | |
| |||
184 | 194 | | |
185 | 195 | | |
186 | 196 | | |
187 | | - | |
188 | | - | |
189 | | - | |
190 | | - | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
191 | 200 | | |
192 | 201 | | |
193 | 202 | | |
| |||
198 | 207 | | |
199 | 208 | | |
200 | 209 | | |
| 210 | + | |
201 | 211 | | |
202 | 212 | | |
203 | 213 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
926 | 926 | | |
927 | 927 | | |
928 | 928 | | |
929 | | - | |
930 | | - | |
931 | | - | |
932 | | - | |
933 | | - | |
934 | | - | |
935 | | - | |
936 | | - | |
937 | | - | |
938 | | - | |
| 929 | + | |
| 930 | + | |
| 931 | + | |
| 932 | + | |
| 933 | + | |
| 934 | + | |
| 935 | + | |
| 936 | + | |
| 937 | + | |
| 938 | + | |
| 939 | + | |
| 940 | + | |
| 941 | + | |
939 | 942 | | |
940 | | - | |
941 | | - | |
942 | | - | |
| 943 | + | |
| 944 | + | |
943 | 945 | | |
944 | 946 | | |
| 947 | + | |
| 948 | + | |
945 | 949 | | |
946 | 950 | | |
947 | 951 | | |
| |||
1109 | 1113 | | |
1110 | 1114 | | |
1111 | 1115 | | |
1112 | | - | |
1113 | | - | |
1114 | | - | |
1115 | | - | |
| 1116 | + | |
| 1117 | + | |
| 1118 | + | |
1116 | 1119 | | |
| 1120 | + | |
1117 | 1121 | | |
1118 | 1122 | | |
1119 | 1123 | | |
| |||
0 commit comments