Commit 424a3e9
authored
Fix binary quantizer host transform bounds and stream order (#1473)
Fix two issues with the binary quantizer's host transform:
1. Synchronize the stream after async GPU operations and before a host-only omp loop;
2. Fill the bytes between `minimul_out_dim` and `out_dim` with zeroes to match device transform behavior.
This PR is a prerequisite to merging rapidsai/raft#2835 (otherwise the tests fail due to host mdarray not initializing with zeroes anymore).
Authors:
- Artem M. Chirkin (https://github.com/achirkin)
Approvers:
- tsuki (https://github.com/enp1s0)
- Micka (https://github.com/lowener)
URL: #14731 parent 96b6e3c commit 424a3e9
1 file changed
Lines changed: 7 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
432 | 432 | | |
433 | 433 | | |
434 | 434 | | |
| 435 | + | |
| 436 | + | |
435 | 437 | | |
436 | 438 | | |
437 | 439 | | |
438 | 440 | | |
439 | | - | |
| 441 | + | |
| 442 | + | |
| 443 | + | |
| 444 | + | |
| 445 | + | |
440 | 446 | | |
441 | 447 | | |
442 | 448 | | |
| |||
0 commit comments