Commit 5571cd8
Backport to 2.8: B200 tunings for histogram (#3728)
* Add b200 tunings for histogram (#3616)
Co-authored-by: Giannis Gonidelis <ggonidelis@nvidia.com>
* Fix SM100 histogram tunings (#3691)
The tuning data member names did not match the one used when selecting
tunings, so all SM100 tunings were SFINAE-ed out.
Also drop tunings with no benefit.
---------
Co-authored-by: Giannis Gonidelis <ggonidelis@nvidia.com>1 parent c8bda1a commit 5571cd8
File tree
2 files changed
+104
-12
lines changed- cub/cub/device/dispatch
- tuning
2 files changed
+104
-12
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
| 40 | + | |
| 41 | + | |
40 | 42 | | |
41 | 43 | | |
42 | 44 | | |
| |||
554 | 556 | | |
555 | 557 | | |
556 | 558 | | |
557 | | - | |
558 | | - | |
| 559 | + | |
559 | 560 | | |
560 | 561 | | |
561 | 562 | | |
| |||
920 | 921 | | |
921 | 922 | | |
922 | 923 | | |
923 | | - | |
924 | | - | |
| 924 | + | |
| 925 | + | |
| 926 | + | |
| 927 | + | |
| 928 | + | |
| 929 | + | |
| 930 | + | |
| 931 | + | |
925 | 932 | | |
926 | 933 | | |
927 | 934 | | |
| |||
1124 | 1131 | | |
1125 | 1132 | | |
1126 | 1133 | | |
1127 | | - | |
1128 | | - | |
| 1134 | + | |
| 1135 | + | |
| 1136 | + | |
| 1137 | + | |
| 1138 | + | |
| 1139 | + | |
| 1140 | + | |
1129 | 1141 | | |
1130 | 1142 | | |
1131 | 1143 | | |
| |||
1292 | 1304 | | |
1293 | 1305 | | |
1294 | 1306 | | |
1295 | | - | |
1296 | | - | |
| 1307 | + | |
| 1308 | + | |
| 1309 | + | |
| 1310 | + | |
| 1311 | + | |
| 1312 | + | |
| 1313 | + | |
1297 | 1314 | | |
1298 | 1315 | | |
1299 | 1316 | | |
| |||
1513 | 1530 | | |
1514 | 1531 | | |
1515 | 1532 | | |
1516 | | - | |
1517 | | - | |
| 1533 | + | |
| 1534 | + | |
| 1535 | + | |
| 1536 | + | |
| 1537 | + | |
| 1538 | + | |
| 1539 | + | |
1518 | 1540 | | |
1519 | 1541 | | |
1520 | 1542 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
60 | 60 | | |
61 | 61 | | |
62 | 62 | | |
| 63 | + | |
| 64 | + | |
63 | 65 | | |
64 | 66 | | |
65 | 67 | | |
| |||
125 | 127 | | |
126 | 128 | | |
127 | 129 | | |
128 | | - | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
129 | 176 | | |
130 | 177 | | |
131 | 178 | | |
| |||
173 | 220 | | |
174 | 221 | | |
175 | 222 | | |
176 | | - | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
| 229 | + | |
| 230 | + | |
| 231 | + | |
| 232 | + | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
| 236 | + | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
177 | 247 | | |
178 | 248 | | |
179 | 249 | | |
| |||
0 commit comments