Commit c2b4565
ggml-hexagon: gelu operation (#17921)
* feat: inital support for gelu using sigmoid approximation
* snapshot: faster gelu using polynomial approximation
* test: disable l2-block prefetch in polynomail approximation
* Revert "test: disable l2-block prefetch in polynomail approximation"
This reverts commit 72339994d45b2bed887e79994403c378d90b62b5.
* Revert "snapshot: faster gelu using polynomial approximation"
This reverts commit 2a787a61d11f9e63e5943a2e6d134b2f0c402ace.
* debug: temporarily disable unnecessary log message for debug purpose
* Feat: optiized unaligned sigmoid_f32
* Feat: larger l2prefetch block
* feat: apply unaligned-load optimization on mul and mul_scalar
* Revert "debug: temporarily disable unnecessary log message for debug purpose"
This reverts commit 84f2f23aa9f17e2fa826db969cd825d0ab192995.
* refactor: cleanup commented unused code
* chore: reformat code with clang-formatter to pass cli test
* Revert "chore: reformat code with clang-formatter to pass cli test"
This reverts commit 952877ec24732b12010c7fa7ed3fc8de4b74e718.
* fix: fix loop overflow
* chore: fix formating ci error1 parent baa2a66 commit c2b4565
File tree
6 files changed
+256
-21
lines changed- ggml/src/ggml-hexagon
- htp
6 files changed
+256
-21
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2161 | 2161 | | |
2162 | 2162 | | |
2163 | 2163 | | |
2164 | | - | |
2165 | | - | |
| 2164 | + | |
| 2165 | + | |
| 2166 | + | |
| 2167 | + | |
| 2168 | + | |
| 2169 | + | |
| 2170 | + | |
| 2171 | + | |
2166 | 2172 | | |
2167 | 2173 | | |
2168 | 2174 | | |
| |||
2662 | 2668 | | |
2663 | 2669 | | |
2664 | 2670 | | |
| 2671 | + | |
| 2672 | + | |
| 2673 | + | |
| 2674 | + | |
2665 | 2675 | | |
2666 | 2676 | | |
2667 | 2677 | | |
| |||
2677 | 2687 | | |
2678 | 2688 | | |
2679 | 2689 | | |
| 2690 | + | |
2680 | 2691 | | |
2681 | 2692 | | |
2682 | 2693 | | |
| |||
2956 | 2967 | | |
2957 | 2968 | | |
2958 | 2969 | | |
| 2970 | + | |
| 2971 | + | |
2959 | 2972 | | |
2960 | 2973 | | |
2961 | 2974 | | |
| |||
3254 | 3267 | | |
3255 | 3268 | | |
3256 | 3269 | | |
3257 | | - | |
3258 | 3270 | | |
3259 | 3271 | | |
3260 | 3272 | | |
| |||
3294 | 3306 | | |
3295 | 3307 | | |
3296 | 3308 | | |
| 3309 | + | |
| 3310 | + | |
| 3311 | + | |
3297 | 3312 | | |
3298 | 3313 | | |
3299 | 3314 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
255 | 255 | | |
256 | 256 | | |
257 | 257 | | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
| 286 | + | |
| 287 | + | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
| 297 | + | |
| 298 | + | |
| 299 | + | |
| 300 | + | |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
| 310 | + | |
| 311 | + | |
| 312 | + | |
| 313 | + | |
| 314 | + | |
| 315 | + | |
| 316 | + | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
| 320 | + | |
| 321 | + | |
| 322 | + | |
| 323 | + | |
| 324 | + | |
| 325 | + | |
| 326 | + | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
| 334 | + | |
| 335 | + | |
| 336 | + | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
258 | 343 | | |
259 | 344 | | |
260 | 345 | | |
| |||
371 | 456 | | |
372 | 457 | | |
373 | 458 | | |
374 | | - | |
| 459 | + | |
| 460 | + | |
| 461 | + | |
| 462 | + | |
375 | 463 | | |
376 | 464 | | |
377 | 465 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
54 | | - | |
55 | | - | |
56 | | - | |
57 | | - | |
58 | | - | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
59 | 60 | | |
60 | 61 | | |
61 | 62 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
49 | 49 | | |
50 | 50 | | |
51 | 51 | | |
| 52 | + | |
| 53 | + | |
52 | 54 | | |
53 | 55 | | |
54 | 56 | | |
| |||
60 | 62 | | |
61 | 63 | | |
62 | 64 | | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
63 | 82 | | |
64 | | - | |
65 | | - | |
66 | | - | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
67 | 105 | | |
68 | | - | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
69 | 109 | | |
70 | | - | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
71 | 113 | | |
72 | 114 | | |
73 | 115 | | |
74 | | - | |
| 116 | + | |
| 117 | + | |
75 | 118 | | |
76 | 119 | | |
77 | 120 | | |
| |||
464 | 507 | | |
465 | 508 | | |
466 | 509 | | |
467 | | - | |
| 510 | + | |
468 | 511 | | |
469 | 512 | | |
470 | 513 | | |
| |||
475 | 518 | | |
476 | 519 | | |
477 | 520 | | |
| 521 | + | |
| 522 | + | |
| 523 | + | |
| 524 | + | |
| 525 | + | |
| 526 | + | |
| 527 | + | |
| 528 | + | |
| 529 | + | |
| 530 | + | |
| 531 | + | |
| 532 | + | |
478 | 533 | | |
479 | | - | |
480 | | - | |
| 534 | + | |
| 535 | + | |
| 536 | + | |
| 537 | + | |
| 538 | + | |
| 539 | + | |
| 540 | + | |
481 | 541 | | |
482 | | - | |
| 542 | + | |
| 543 | + | |
| 544 | + | |
| 545 | + | |
483 | 546 | | |
484 | | - | |
| 547 | + | |
| 548 | + | |
| 549 | + | |
| 550 | + | |
| 551 | + | |
| 552 | + | |
| 553 | + | |
| 554 | + | |
| 555 | + | |
| 556 | + | |
| 557 | + | |
485 | 558 | | |
486 | 559 | | |
487 | 560 | | |
488 | | - | |
| 561 | + | |
489 | 562 | | |
490 | 563 | | |
491 | 564 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
265 | 265 | | |
266 | 266 | | |
267 | 267 | | |
| 268 | + | |
| 269 | + | |
268 | 270 | | |
269 | 271 | | |
270 | 272 | | |
271 | 273 | | |
272 | 274 | | |
273 | 275 | | |
| 276 | + | |
| 277 | + | |
274 | 278 | | |
275 | 279 | | |
276 | 280 | | |
| |||
994 | 998 | | |
995 | 999 | | |
996 | 1000 | | |
| 1001 | + | |
| 1002 | + | |
| 1003 | + | |
| 1004 | + | |
| 1005 | + | |
| 1006 | + | |
| 1007 | + | |
| 1008 | + | |
| 1009 | + | |
| 1010 | + | |
| 1011 | + | |
| 1012 | + | |
| 1013 | + | |
| 1014 | + | |
| 1015 | + | |
| 1016 | + | |
| 1017 | + | |
| 1018 | + | |
| 1019 | + | |
| 1020 | + | |
| 1021 | + | |
| 1022 | + | |
| 1023 | + | |
| 1024 | + | |
| 1025 | + | |
| 1026 | + | |
| 1027 | + | |
| 1028 | + | |
| 1029 | + | |
| 1030 | + | |
| 1031 | + | |
| 1032 | + | |
| 1033 | + | |
| 1034 | + | |
| 1035 | + | |
| 1036 | + | |
| 1037 | + | |
| 1038 | + | |
| 1039 | + | |
| 1040 | + | |
| 1041 | + | |
| 1042 | + | |
| 1043 | + | |
| 1044 | + | |
| 1045 | + | |
| 1046 | + | |
| 1047 | + | |
| 1048 | + | |
| 1049 | + | |
| 1050 | + | |
| 1051 | + | |
| 1052 | + | |
| 1053 | + | |
997 | 1054 | | |
998 | 1055 | | |
999 | 1056 | | |
| |||
0 commit comments