Skip to content
Merged
Changes from 1 commit
Commits
Show all changes
58 commits
Select commit Hold shift + click to select a range
803e781
[libcudacxx] Experimental try_cancel exposure
gonzalobg Feb 4, 2025
3740f0f
Update documentation
gonzalobg Feb 4, 2025
09c48f8
Update try_cancel_blocks ABI
gonzalobg Feb 4, 2025
6d15626
Update ABI of __cluster_get_dim
gonzalobg Feb 4, 2025
4b9f6a3
Use if target and provide SW fallback
gonzalobg Feb 4, 2025
57dcd42
Use simple license
gonzalobg Feb 4, 2025
aba388b
Fix if guard
gonzalobg Feb 4, 2025
220cf29
Guard for C++20 or newer
gonzalobg Feb 4, 2025
d2baa62
Simplify API
gonzalobg Feb 6, 2025
27a6a52
Add tests
gonzalobg Feb 6, 2025
e1f090b
Clarify C++20 support in docs
gonzalobg Feb 6, 2025
7e99c76
Test main function should only run in host
gonzalobg Feb 6, 2025
0b62b46
Rename to for_each_cancelled_block and extend docs
gonzalobg Feb 6, 2025
0202b52
Fix typo in docs
gonzalobg Feb 6, 2025
a9b46bc
[pre-commit.ci] auto code formatting
pre-commit-ci[bot] Feb 7, 2025
dd6c971
Support C++17, move to different file, improve docs
gonzalobg Feb 18, 2025
cbd4885
Fix two typos
gonzalobg Feb 18, 2025
bdc1011
Free memory in doc example
gonzalobg Feb 18, 2025
ac02abc
Fix typos and add suggestions
gonzalobg Feb 18, 2025
ad807c1
[pre-commit.ci] auto code formatting
pre-commit-ci[bot] Feb 18, 2025
9e75ce2
remove dangling requires clauses
gonzalobg Feb 18, 2025
ab5f28b
More comments; initial arrive can be relaxed
gonzalobg Feb 18, 2025
b3a9933
cancelled 2 cancelled for consistency with PTX
gonzalobg Feb 18, 2025
299e4b3
Add missing invocable include
gonzalobg Feb 18, 2025
e20a87a
Add missing __detail namespace closing brace
gonzalobg Feb 18, 2025
1ddcc83
[pre-commit.ci] auto code formatting
pre-commit-ci[bot] Feb 18, 2025
d11cb32
Merge branch 'main' into try_cancel_api
gonzalobg Feb 19, 2025
c09a381
Stabilize API for CTK
gonzalobg Feb 19, 2025
aba1484
[pre-commit.ci] auto code formatting
pre-commit-ci[bot] Feb 19, 2025
69c5313
Update docs
gonzalobg Feb 19, 2025
59cf5e1
Update docs
gonzalobg Feb 19, 2025
9cb8360
Enable tests in C++17
gonzalobg Feb 19, 2025
c3da9b3
Update test
gonzalobg Feb 19, 2025
cb16ef3
Update test
gonzalobg Feb 19, 2025
898ae57
Run test on all silicon
gonzalobg Feb 19, 2025
ea9f3bb
Fix Bernhard suggestions
gonzalobg Feb 19, 2025
709671c
Improve docs clarity
gonzalobg Feb 19, 2025
1e5ed5d
This needs a cuda compiler
gonzalobg Feb 19, 2025
18d2cec
[pre-commit.ci] auto code formatting
pre-commit-ci[bot] Feb 19, 2025
57a6918
Use int as the size type
miscco Feb 19, 2025
2654baa
Use `assert` in tests
miscco Feb 19, 2025
828fbd1
Use functions from cuda::std
miscco Feb 19, 2025
01bb9ac
Reduce includes to necessary ones
miscco Feb 19, 2025
e788fb0
Use proper license
miscco Feb 19, 2025
f7fc53c
Drop unnecessary `__detail` namespace
miscco Feb 19, 2025
ca0e92a
Cleanup the test a bit more
miscco Feb 19, 2025
e03de4e
Drop unsupported dialects
miscco Feb 19, 2025
4104076
Move to `<cuda/functional>`
miscco Feb 19, 2025
a4b9783
clusterlaunchcontrol.try_cancel requires PTX 8.7
miscco Feb 19, 2025
0272ffe
Add missing include
miscco Feb 19, 2025
7ea70fb
Drop superfluous header guard
miscco Feb 19, 2025
284d84b
Use `NV_DISPATCH_TARGET` because that is more future proof
miscco Feb 19, 2025
8829e90
document requirement on PTX ISA 8.7
miscco Feb 19, 2025
0d1991e
Add check back ^^
miscco Feb 19, 2025
da83903
Fix namespace
miscco Feb 19, 2025
7068da5
Add pre PTX ISA 8.7 fallback and use invoke to support function pointers
miscco Feb 19, 2025
91f3e4d
Move to `<cuda/work_stealing>`
miscco Feb 20, 2025
990bb85
move test file
miscco Feb 20, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Update docs
  • Loading branch information
gonzalobg authored Feb 19, 2025
commit 69c53136f5c4c274451e4cb40620d58c7c7d5d4d
4 changes: 3 additions & 1 deletion docs/libcudacxx/extended_api/work_stealing.rst
Original file line number Diff line number Diff line change
Expand Up @@ -62,8 +62,10 @@ This example shows how to perform work-stealing at thread-block granularity usin
// - etc.

cuda::for_each_canceled_block<1>([=](dim3 block_idx) {
// block_idx may be different than the built-in blockIdx variable, that is:
// assert(block_idx == blockIdx); // may fail!
// so we need to use "block_idx" consistently inside for_each_canceled:
int idx = threadIdx.x + block_idx.x * blockDim.x;
// assert(block_idx == blockIdx); // May fail!
if (idx < n) {
c[idx] += a[idx] + b[idx];
}
Expand Down