Commit 19cbf69
feat(trial): add BF16 to quant harness; fix igllama json_mode crash
- trial_quant.zig: add 2B-BF16 and 9B-BF16 to QUANT_MODELS; covers
Q4/Q5/Q6/Q8/BF16 precision curve for both weight classes
- trial_quant.zig: change response_format from json_object to text;
igllama grammar sampler (llama_sampler_init_grammar) crashes during
generation for 2B+ vocab sizes β system prompt alone is sufficient
for constrained JSON output at 2B+
- trial_quant.zig: remove --mlock spawn flag (optimization, not required)
- igllama v0.3.10: fix streaming handler use-after-free in json_mode;
loadGrammar + defer free β direct JSON_GRAMMAR comptime const
- LICENSE: add MIT license file
- CHANGELOG.md: v0.2.7 entry
- CLAUDE.md: version 0.2.7, roadmap items 14-15
- showcase.smd: document igllama json_mode crash finding, update
engineering requirements, expand trial task suite to T01-T17,
update framework version to v0.2.7/v0.3.10
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>1 parent 252eae9 commit 19cbf69
File tree
6 files changed
+68
-754
lines changed- examples
- website/content
6 files changed
+68
-754
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
5 | 16 | | |
6 | 17 | | |
7 | 18 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
55 | 55 | | |
56 | 56 | | |
57 | 57 | | |
58 | | - | |
| 58 | + | |
59 | 59 | | |
60 | 60 | | |
61 | 61 | | |
62 | | - | |
| 62 | + | |
63 | 63 | | |
64 | 64 | | |
65 | 65 | | |
| |||
73 | 73 | | |
74 | 74 | | |
75 | 75 | | |
76 | | - | |
77 | | - | |
| 76 | + | |
| 77 | + | |
78 | 78 | | |
79 | | - | |
80 | | - | |
81 | | - | |
| 79 | + | |
| 80 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
0 commit comments