Commit 7187e18
cuda.parallel: Add optional stream argument to reduce_into() (NVIDIA#3348)
* Add optional stream argument to reduce_into()
* Add tests to check for reduce_into() stream behavior
* Move protocol related utils to separate file and rework __cuda_stream__ error messages
* Fix synchronization issue in stream test and add one more invalid stream test case
* Rename cuda stream validation function after removing leading underscore
* Unpack values from __cuda_stream__ instead of indexing
* Fix linting errors
* Handle TypeError when unpacking invalid __cuda_stream__ return
* Use stream to allocate cupy memory in new stream test1 parent aa1ca79 commit 7187e18
2 files changed
+10
-4
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | | - | |
| 14 | + | |
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
| |||
Lines changed: 9 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
89 | 89 | | |
90 | 90 | | |
91 | 91 | | |
92 | | - | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
93 | 99 | | |
94 | 100 | | |
95 | 101 | | |
| |||
104 | 110 | | |
105 | 111 | | |
106 | 112 | | |
107 | | - | |
| 113 | + | |
108 | 114 | | |
109 | 115 | | |
110 | 116 | | |
| |||
126 | 132 | | |
127 | 133 | | |
128 | 134 | | |
129 | | - | |
| 135 | + | |
130 | 136 | | |
131 | 137 | | |
132 | 138 | | |
| |||
0 commit comments