Commit e087ccc
[Fix] Fix get_valid_count flaky test for cuda (apache#4901)
* get_valid_count accuracy issue fixed for individual tests but not for all tests running together
* minor fix
* initialize valid_count and PrefixSum buffers
* test updated
* udpate relay test as well
* update document
* fix lint
* address comment
* fix lint
* correct atomicAdd identifier name1 parent b9dc7db commit e087ccc
3 files changed
Lines changed: 166 additions & 255 deletions
File tree
- tests/python/relay
- topi
- python/topi/cuda
- tests/python
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
221 | 221 | | |
222 | 222 | | |
223 | 223 | | |
224 | | - | |
225 | | - | |
226 | 224 | | |
227 | 225 | | |
228 | 226 | | |
| |||
0 commit comments