Skip to content

Pull requests: fw-ai/cookbook

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Preflight training smoke secret setup
#320 opened Apr 11, 2026 by benjibc Contributor Draft
fix(infra): skip pod-identity wait on same-trainer reattach
#319 opened Apr 10, 2026 by Hecate0821 Contributor Loading…
3 of 4 tasks
fix(infra): detect UPDATING→READY in reattach settle to avoid timeout
#318 opened Apr 10, 2026 by Hecate0821 Contributor Loading…
2 tasks
feat(checkpoint): validate checkpoint entries before resume
#314 opened Apr 9, 2026 by hershalb Contributor Loading…
14 of 23 tasks
Add cookbook release workflows and runbook
#307 opened Apr 7, 2026 by benjibc Contributor Loading…
Add KV-cache key compression reproducible comparison
#303 opened Apr 6, 2026 by yi-fireworks Loading…
4 of 5 tasks
feat: LoRA self-reference RL — use policy trainer as KL reference
#299 opened Apr 6, 2026 by mayinghan Contributor Loading…
use parsed content as output
#296 opened Apr 3, 2026 by hershalb Contributor Loading…
[codex] Fix training SDK smoke compatibility
#292 opened Apr 2, 2026 by benjibc Contributor Loading…
docs(training): update cookbook install guidance
#289 opened Apr 1, 2026 by benjibc Contributor Loading…
3 tasks done
feat: IGPO training recipe and multi-hop QA example
#284 opened Mar 31, 2026 by morgendave Loading…
6 tasks done
training: bound sampler checkpoint export
#270 opened Mar 27, 2026 by benjibc Contributor Loading…
Reduce cleanup grace period from 30s to 5s
#267 opened Mar 26, 2026 by mayinghan Contributor Loading…
2 tasks
feat(training): async RL loop with AReaL-style streaming pipeline
#243 opened Mar 19, 2026 by Hecate0821 Contributor Loading…
4 tasks done
fix(training): harden cleanup and improve SFT observability
#228 opened Mar 17, 2026 by mayinghan Contributor Loading…
fix: point deepmath launcher at validated grpo path
#214 opened Mar 15, 2026 by benjibc Contributor Loading…
fix(training): separate gspo and gspo-token losses
#212 opened Mar 15, 2026 by benjibc Contributor Loading…
2 of 3 tasks
fix(rl): reuse policy trainer for lora grpo references
#211 opened Mar 15, 2026 by benjibc Contributor Loading…
2 of 3 tasks
Update run script for Qwen 3-8b model
#210 opened Mar 14, 2026 by mayinghan Contributor Loading…
fix(frozen-lake): harden gateway-backed smoke path
#206 opened Mar 13, 2026 by benjibc Contributor Loading…
1 task done
Update run.sh for qwen3-32b model configuration
#203 opened Mar 13, 2026 by mayinghan Contributor Loading…
fix: pass fw_api_key in ORPO/DPO recipes and guard wandb entity
#197 opened Mar 12, 2026 by renfeichen-fw Contributor Loading…
2 of 4 tasks
ProTip! Adding no:label will show everything without a label.