Commit e2bac43
[ModelOpt] Load w13/w2_input_scale for all experts, nvfp4 (vllm-project#26135)
Signed-off-by: Shu Wang <[email protected]>
Signed-off-by: Shu Wang. <[email protected]>
Co-authored-by: Michael Goin <[email protected]>1 parent 79090ca commit e2bac43
File tree
3 files changed
+58
-9
lines changed- vllm/model_executor/layers
- fused_moe
- quantization
- utils
3 files changed
+58
-9
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
49 | 49 | | |
50 | 50 | | |
51 | 51 | | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
52 | 55 | | |
53 | 56 | | |
54 | 57 | | |
| |||
1289 | 1292 | | |
1290 | 1293 | | |
1291 | 1294 | | |
| 1295 | + | |
1292 | 1296 | | |
1293 | 1297 | | |
1294 | 1298 | | |
| |||
1632 | 1636 | | |
1633 | 1637 | | |
1634 | 1638 | | |
1635 | | - | |
1636 | | - | |
| 1639 | + | |
| 1640 | + | |
| 1641 | + | |
| 1642 | + | |
| 1643 | + | |
| 1644 | + | |
| 1645 | + | |
| 1646 | + | |
| 1647 | + | |
| 1648 | + | |
| 1649 | + | |
| 1650 | + | |
| 1651 | + | |
| 1652 | + | |
| 1653 | + | |
1637 | 1654 | | |
1638 | 1655 | | |
1639 | 1656 | | |
1640 | 1657 | | |
1641 | | - | |
1642 | 1658 | | |
1643 | 1659 | | |
1644 | 1660 | | |
| |||
1723 | 1739 | | |
1724 | 1740 | | |
1725 | 1741 | | |
1726 | | - | |
| 1742 | + | |
| 1743 | + | |
| 1744 | + | |
1727 | 1745 | | |
1728 | 1746 | | |
1729 | 1747 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
49 | 49 | | |
50 | 50 | | |
51 | 51 | | |
| 52 | + | |
52 | 53 | | |
53 | 54 | | |
54 | 55 | | |
| |||
1217 | 1218 | | |
1218 | 1219 | | |
1219 | 1220 | | |
| 1221 | + | |
1220 | 1222 | | |
1221 | 1223 | | |
1222 | 1224 | | |
| |||
1295 | 1297 | | |
1296 | 1298 | | |
1297 | 1299 | | |
| 1300 | + | |
| 1301 | + | |
| 1302 | + | |
| 1303 | + | |
| 1304 | + | |
1298 | 1305 | | |
1299 | | - | |
| 1306 | + | |
1300 | 1307 | | |
1301 | 1308 | | |
1302 | 1309 | | |
1303 | 1310 | | |
1304 | 1311 | | |
1305 | | - | |
| 1312 | + | |
1306 | 1313 | | |
1307 | 1314 | | |
1308 | 1315 | | |
| |||
1457 | 1464 | | |
1458 | 1465 | | |
1459 | 1466 | | |
1460 | | - | |
| 1467 | + | |
| 1468 | + | |
| 1469 | + | |
| 1470 | + | |
| 1471 | + | |
| 1472 | + | |
| 1473 | + | |
| 1474 | + | |
| 1475 | + | |
| 1476 | + | |
| 1477 | + | |
1461 | 1478 | | |
1462 | 1479 | | |
1463 | 1480 | | |
| |||
1469 | 1486 | | |
1470 | 1487 | | |
1471 | 1488 | | |
| 1489 | + | |
| 1490 | + | |
| 1491 | + | |
| 1492 | + | |
| 1493 | + | |
| 1494 | + | |
| 1495 | + | |
| 1496 | + | |
1472 | 1497 | | |
1473 | | - | |
| 1498 | + | |
1474 | 1499 | | |
1475 | 1500 | | |
1476 | 1501 | | |
1477 | 1502 | | |
1478 | 1503 | | |
1479 | | - | |
| 1504 | + | |
1480 | 1505 | | |
1481 | 1506 | | |
1482 | 1507 | | |
| |||
Lines changed: 6 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
263 | 263 | | |
264 | 264 | | |
265 | 265 | | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
0 commit comments